Hi,
I'm in an off-site meeting from Tuesday to Friday, and I'll reply your email
after I'm back. For any NLSRTL issues, please contact Chao(chaowang,
6-0626),
for any RDBMS NLS issues, please contact Josef(jhasenbe, 6-5829),
and for any management issues, please contact Michael(myau, 6-0730).
Thanks,
***************************************************************************
Jianping Yang Phone: (415) 506-4865
Sever Globalization Technology Fax: (415) 506-7225
Oracle Corporation E-mail: jiyang@oracle.com
attached mail follows:
Yes, Oracle supports it. Here is the decsription about SOPS. Hope this will
help you.
1. Background Information
SOPS is the abbreviation for Service Operation Process
System and is used by Telecom Taiwan for its "telephone
installation, disconnection and transfer information
system". It is based on the CNS 11643-1986, the Chinese
Code for Information Interchange (CCCII) announced in 1986.
The related information of both CNS 11643-1986 and SOPS
will be described in following sections.
2. CNS 11643-1986
2.1 Structure of Character Set
There are sixteen planes in the CCCII and only plane one
and two are occupied by characters, the plane three to
eleven are left blank and reserved for future use, plane
twelve to sixteen are for user defined characters.
2.2 Rules of Encoding
The character in each plane is defined by two bytes.
According to CNS 5205 and CNS 7654, the control code area
in 7 bits is not used. It means 0x00 to 0x20 and ox7f are
not used for encoding. So, only 0x21 to 0x7e are used for
encoding and there are 94 code point in each byte and the
maximum number of characters in one plane is 94 by 94,
which is 8836.
2.3 Code Range
In the first plane, there are 684 symbols, its code range
is from 0x2121 to 0x427e, and 5401 Chinese characters, its
code range is from 0x4421 to 0x7d48.
In the second plane, there are 7650 Chinese characters, its
code range is from 0x2121 to 0x7244.
3. SOPS Encoding
3.1 Structure of Character Set
In addition to the plane 1 and 2 of CNS 11643-1986, the
plane 12 and 13 for user defined character collected by
Electronic Data Process Center (EDPC) of Executive Yuan
are also included.
3.2 Rules of Encoding and Code Range
In the first plane, each symbol and character is encoded
by adding 0x8080 to the original 7-bit CNS code. Its code
range is from 0xa1a1 to 0xfdc8 for the symbols and characters.
In the second plane, each character is encoded by adding
0x8000 to the original 7-bit CNS code, its code range is
from 0xa121 to 0xf244.
In the third plane, each character is encoded by adding
0x8080 to the original 7-bit CNS code with leading code
0xc2cb. It is four-byte encoding.
In the fourth plane, each character is encoded by adding
0x8080 to the original 7-bit CNS code with leading code
0xc2cc. It is four-byte encoding.
Regards,
***************************************************************************
Jianping Yang Phone: (650) 506-4865
Development Manager Fax: (650) 506-7225
Server Globalization Technology Office: 4op948
Oracle Corporation E-mail: jiyang@us.oracle.com
attached mail follows:
John Jenkins wrote:
>
> >
> >I have a customer in Taiwan, he is using a Unicode mapping for BIG5, he
> >claims that a lot of his data is in encoding he calls "SOPS". He claims
> >this is related to BIG5, but will give no indication of how. I have
> >never heard of this encoding, he will not provide any help like an ISO,
> >CNS standards number or IBM CCSID. Has anyone heard of this encoding?,
> >and if so do they have a Unicode mapping?, or even a BIG5 mapping so I
> >could construct one?
> >
>
> I've never heard of SOPS. There are certainly, however, a large number of
> proprietary extensions to the Big 5 out there.
Oracle lists support for this character set, on their Oracle 8 support
page:
http://ntsolutions.oracle.com/products/o8/html/nls_ds.htm
It mentions that it's a 32 bit character set. Hmmmmmm....
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:43 EDT