[Unicode] Ideographic Variation Database Tech Site | Site Map | Search
 

PRI 521: Combined registration of the CAAPH collection and of sequences in that collection

Introduction

A submission for the “Combined registration of the CAAPH collection and of sequences in that collection” has been received by the IVD Registrar. This submission is currently under review according to the procedures of UTS #37, Unicode Ideographic Variation Database, with an expected closing date of 2025-07-11.

Review Instructions

Reviewers are encouraged to comment on any aspect of the submissions, but more particularly on:

  • whether the glyphic subset corresponding to a proposed sequence is indeed a glyphic subset of the base character for the sequence
  • whether the proposed sequences are congruent with the scope of their collection, or whether a new collection may be more appropriate

All comments should be sent via the reporting form and will be forwarded to the submitter. The content of the submission may be adjusted during the review period to account for the comments received.

Submission Details

The content of this section has been provided by the submitter.

Submitter

  • Name and address of the registrant: Culture and Art Publishing House Co., Ltd., No. 52, Dongsi Batiao Alley, Dongcheng District, Beijing City, PRC
  • Name and email address of the representative: Eiso Chan (eisoch@126.com or chenyc@caaph.com)
  • URL of the website describing the collection: https://www.unicode.org/ivd/caaph/
  • Suggested identifier for the collection: CAAPH
  • Pattern for the sequence identifiers: [A-Z]\d{4}

Culture and Art Publishing House

Since 2015, Culture and Art Publishing House (CAAPH, 文化艺术出版社) has gradually established a series of digital projects, including, but not limited to, its official website, an official WeChat account, and Weidian, Weibo, Douyin, Bilibili, and Xiaohongshu accounts. All of the publications will be simultaneously published in digital form, including ePub, Mobi, xml, and other formats. Most of the publications are published as composite works (复合出版物), which means that they print QR codes on the books, and readers could scan the QR codes to use the digital resources in their databases. In 2017, CAAPH established a long-term digital project to digitize some Chinese traditional musical notations. In 2021, they undertook the task to establish and operate the website and database for the Chinese National Center for Contemporary Art of the Chinese Academy of Arts (中国艺术研究院国家当代艺术研究中心), which is also a long-term project. Since 2025, they have been establishing a newer and larger database that provides more information on Chinese culture and Intangible Cultural Heritage (ICH, 非物质文化遗产).

CAAPH IVD Collection & Assignments

The purpose of the CAAPH IVD collection is to register IVSes for a number of ideograph variants that are very useful for supporting the digitization of Chinese culture and ICH.

This submission covers a total of 95 base characters that correspond to code points in the CJK Unified Ideographs blocks.

Click here to view the proposed sequences of the proposed CAAPH IVD Collection. This data file includes exactly 198 lines (excluding the header lines and EOF marker).

Sequence Identifier Prefixes

The following table provides descriptions of the uppercase Latin characters that serve as first character of the sequence identifiers, the first four of which are based on the Chinese Library Classification (CLC, 中国图书馆分类法/中图法):

Source Prefix Description
CLC B Philosophy and religions — 哲学、宗教
G Culture, science, education and sports — 文化、科学、教育、体育
J Art — 艺术
K History and geography — 历史、地理
CAAPH M Necessary basic glyphs whose style matches Tōngyòng Guīfàn Hànzìbiǎo (通用规范汉字表) and the GB/T 22321.1-2025 standard

NOTE: Four uppercase Latin characters — L, M, W, and Y — are not defined by CLC, which CAAPH uses for other purposes, such as M that is described in the table above.

Representative Glyph Charts

Representative glyphs for the submitted sequences are available in Appendix 5 on pp 16 through 25 of document IRG N2796R3, which shows the sequence identifiers indexed by their base characters in code point order.

Updates & Comments Received

This page may be updated from time to time to inform reviewers of updates and some of the comments received.

2025-04-09: A table that describes the five uppercase Latin characters that serve as first character of the sequence identifiers was added to a new section named Sequence Identifier Prefixes.