Re: Anyone know what SOPS is?

From: Jianping Yang (JIYANG@us.oracle.com)
Date: Fri Dec 04 1998 - 14:03:15 EST


Yes, Oracle supports it. Here is the decsription about SOPS. Hope this will
help you.
 
1. Background Information
   SOPS is the abbreviation for Service Operation Process
   System and is used by Telecom Taiwan for its "telephone
   installation, disconnection and transfer information
   system". It is based on the CNS 11643-1986, the Chinese
   Code for Information Interchange (CCCII) announced in 1986.
   The related information of both CNS 11643-1986 and SOPS
   will be described in following sections.
 
 
2. CNS 11643-1986
2.1 Structure of Character Set
    There are sixteen planes in the CCCII and only plane one
    and two are occupied by characters, the plane three to
    eleven are left blank and reserved for future use, plane
    twelve to sixteen are for user defined characters.
2.2 Rules of Encoding
    The character in each plane is defined by two bytes.
    According to CNS 5205 and CNS 7654, the control code area
    in 7 bits is not used. It means 0x00 to 0x20 and ox7f are
    not used for encoding. So, only 0x21 to 0x7e are used for
    encoding and there are 94 code point in each byte and the
    maximum number of characters in one plane is 94 by 94,
    which is 8836.
 
2.3 Code Range
    In the first plane, there are 684 symbols, its code range
    is from 0x2121 to 0x427e, and 5401 Chinese characters, its
    code range is from 0x4421 to 0x7d48.
    In the second plane, there are 7650 Chinese characters, its
    code range is from 0x2121 to 0x7244.
 
 
3. SOPS Encoding
3.1 Structure of Character Set
    In addition to the plane 1 and 2 of CNS 11643-1986, the
    plane 12 and 13 for user defined character collected by
    Electronic Data Process Center (EDPC) of Executive Yuan
    are also included.
 
3.2 Rules of Encoding and Code Range
    In the first plane, each symbol and character is encoded
    by adding 0x8080 to the original 7-bit CNS code. Its code
    range is from 0xa1a1 to 0xfdc8 for the symbols and characters.
 
    In the second plane, each character is encoded by adding
    0x8000 to the original 7-bit CNS code, its code range is
    from 0xa121 to 0xf244.
 
    In the third plane, each character is encoded by adding
    0x8080 to the original 7-bit CNS code with leading code
    0xc2cb. It is four-byte encoding.
 
    In the fourth plane, each character is encoded by adding
    0x8080 to the original 7-bit CNS code with leading code
    0xc2cc. It is four-byte encoding.
 
Regards,
***************************************************************************
 Jianping Yang Phone: (650) 506-4865
 Development Manager Fax: (650) 506-7225
 Server Globalization Technology Office: 4op948
 Oracle Corporation E-mail: jiyang@us.oracle.com
 


attached mail follows:



John Jenkins wrote:
>
> >
> >I have a customer in Taiwan, he is using a Unicode mapping for BIG5, he
> >claims that a lot of his data is in encoding he calls "SOPS". He claims
> >this is related to BIG5, but will give no indication of how. I have
> >never heard of this encoding, he will not provide any help like an ISO,
> >CNS standards number or IBM CCSID. Has anyone heard of this encoding?,
> >and if so do they have a Unicode mapping?, or even a BIG5 mapping so I
> >could construct one?
> >
>
> I've never heard of SOPS. There are certainly, however, a large number of
> proprietary extensions to the Big 5 out there.

Oracle lists support for this character set, on their Oracle 8 support
page:

http://ntsolutions.oracle.com/products/o8/html/nls_ds.htm

It mentions that it's a 32 bit character set. Hmmmmmm....



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:43 EDT