Asmus Freytag <asmusf_at_ix.netcom.com> wrote:
|On 7/30/2013 12:26 PM, Doug Ewell wrote:
|> Buck Golemon <buck at yelp dot com> replied to Richard Wordingham
|> <richard dot wordingham at ntlworld dot com>:
|>
|>>>> There are no Unicode code pages.
|>>> Just to be pedantic, there are several on Windows. They encode the
|>>> coding form (Unicode codes being best thought of as an assignment of
[…]
|> Most Windows .NET developers who are concerned about proper character
|> handling would know this information existed, though they might not have
|> the numbers memorized.
|>
|> Jukka was right, though: Unicode itself does not have code pages.
|> Rather, at least one vendor has defined some of the Unicode encoding
|> schemes as if they were code pages. A code page is not, in general, the
|> same as an encoding scheme.
|What is, then, the proper definition of a "code page"?
|
|When Unicode was first introduced, it was seen as the one thing that
|wasn't a "code page", because the way the Win32 API associated one of
|the traditional code pages with Unicode (giving rise the "A" and "W"
|versions of all the APIs).
|
|Later, it was realized that in order to specify what encoding data were
|in or, for example, to specify a conversion from UTF-7 and UTF-8 to
|UTF-16 (native encoding scheme) one needed some suitable ID number to
|identify the mapping. Well, extending the code page id was the most
|natural way to do that, because, on several platforms, the use of a
|numerical ID from the IBM code page registry was established practice.
IANA however records „MIBenum“ values for that purpose:
The MIBenum value is a unique value for use in MIBs to identify
coded character sets.
See [1] (but also RFC 2278, section 3.7).
[1] <http://www.iana.org/assignments/character-sets>
MIB enums values:
Reserved
0 - 2
Set By Standards Organizations
3 - 999
Unicode / ISO 10646
1000 - 1999
Vendor
2000 - 2999
|A./
--steffen
Received on Tue Jul 30 2013 - 15:52:40 CDT
This archive was generated by hypermail 2.2.0 : Tue Jul 30 2013 - 15:52:41 CDT