[Unicode]  Ideographic Variation Database Home | Site Map | Search
 

PRI 98: Combined registration of the Adobe-Japan1 collection and of sequences in that collection

Introduction

A submission for the "Combined registration of the Adobe-Japan1 collection and of sequences in that collection" has been received by the IVD registrar. Because of the comments received during the review period, the submission was not included in the IVD, but instead a revised version has been submitted as PRI 108.

This page remains available for archival purposes.

Submission details

The content of this section has been provided by the submitter.

Submitter

  • Name and address of registrant: Adobe Systems Incorporated, 345 Park Avenue, San Jose, CA 95110-2704 USA
  • Names and email addresses of representatives: Ken Lunde (lunde@adobe.com) & Eric Muller (emuller@adobe.com)
  • URL of the web site describing the collection: Adobe Tech Note #5078
  • Suggested identifier for the collection: Adobe-Japan1
  • Pattern for the sequence identifiers: CID\+[0-9]+

The Adobe-Japan1 Character Collection

The Adobe-Japan1 Character Collection is widely used as the basis for the glyph complement of fonts for the Japanese market. Successive versions of the collection are identified by appending a supplement number, e.g. Adobe-Japan1-6 (the most current supplement).

Adobe-Japan1-6 enumerates 23,058 glyphs, specifically CIDs 0 through 23057, contains nearly 15,000 kanji (aka, ideographs). The actual number of kanji is 14,664, and are the following CIDs and CID ranges: 656, 1125-7477, 7633-7886, 7961-8004, 8266-8267, 8284, 8285, 8359-8717, 13320-15443, 16779-20316, and 21071-23057.

Of the 14,664 kanji in Adobe-Japan1-6, only the following seven are unencoded and lack a parent form, meaning that they cannot be treated through the use of IVSes: 15429, 15431, 15434, and 20068-20071

The Adobe-Japan1-6 character collection is defined in Adobe Tech Note #5078.

Adobe-Japan1 IVD Collection & Assignments

The purpose of the proposed Adobe-Japan1 IVD collection is to support in Unicode plain text the distinctions which are made by the Adobe-Japan1 Character collection.

This submission covers the content of Adobe-Japan1-6. Future supplements of the Adobe-Japan1 Character Collection, if any, will be addressed by corresponding future submissions to the IVD.

Click here to view the proposed Adobe-Japan1 IVS collection. Click here to view the proposed Adobe-Japan1 IVS sequences. The second file contains 14,658 lines (excluding the four header lines).

Note that all Adobe-Japan1-6 kanji, except those seven pointed out above, are given IVS assignments, including those that have only one form assigned. This is to ensure that each Adobe-Japan1-6 kanji can be uniquely and explicitly identified without referencing their default (IVS-less) encoding, and because kanji may be added in future Adobe-Japan1 Supplements that may be variants of such kanji. As an example, observe the single line for the kanji assigned to U+3405 in the collections file: 3405 E0100; Adobe-Japan1; CID+15387

Also note that the highest IVS that is used in U+E010E. It is used with U+9089, meaning that fifteen kanji are assigned to it, and are to be distinguished through the use of IVSes. Below are all fifteen lines for U+9089:

9089 E0100; Adobe-Japan1; CID+6930

9089 E0101; Adobe-Japan1; CID+13407

9089 E0102; Adobe-Japan1; CID+14241

9089 E0103; Adobe-Japan1; CID+14242

9089 E0104; Adobe-Japan1; CID+14243

9089 E0105; Adobe-Japan1; CID+14244

9089 E0106; Adobe-Japan1; CID+14245

9089 E0107; Adobe-Japan1; CID+14246

9089 E0108; Adobe-Japan1; CID+14247

9089 E0109; Adobe-Japan1; CID+14248

9089 E010A; Adobe-Japan1; CID+14249

9089 E010B; Adobe-Japan1; CID+14250

9089 E010C; Adobe-Japan1; CID+14251

9089 E010D; Adobe-Japan1; CID+14252

9089 E010E; Adobe-Japan1; CID+20233

Adobe-Japan1 CIDs are represented as decimal integers.

Representative Glyph Charts

Representative glyphs for the submitted sequences are available in PDF format, in two sets of charts. Both sets show the sequences indexed by their base character, in code point order. The complete charts (4.4MB) show all the submitted sequences. The partial charts (776KB) show only the characters for which multiple sequences are submitted.

Comments received

This page may be updated from time to time to inform reviewers of some of the comments received.

December 20, 2006: the submitters have provided representative glyph charts for the submitted sequences (see above).