Character Classification Tool (iiCore)
Web-based tool to categorize CJK characters find which are not included in the International Ideographs Core (IICORE).
Tools and Resources
- Chinese Big5/HKSCS cut-n-paste input method
- Chinese Tools (Erik Peterson)
Big5/GB/Hz to UTF-8 Converter and Chinese Character Dictionary
- Data on Languages
A useful web interface to a database of Unicode characters,
and codepage conversions.
- Decode Unicode
Allows entering Unicode characters into a text field by simply pressing buttons labeled with corresponding glyphs. There are button sets for some
European languages as well others covering a few Unicode blocks
Google Translation Tools
Translation of text and web pages, dictionaries, and other tools
- Hindi Unicode Conversion Tool
- International Phonetic Alphabet (linguiste.org)
KreativeKorp Unicode tools
Includes various text
transformation and encoding utilities, including
KreativeRecode for several platforms.
- Mark Davis' Utility
- Nepali Conversion Tools
- Quick Key 5.1
The ideal solution for people who need to input foreign characters or mathematical symbols quickly,
but do not want to spend time learning a new keyboard layout. Inserts any Unicode character with a single click.
Richard Cook's perl-based Nameslist tool
The tool consumes a string, searches the
Unicode nameslist, and outputs the code points, names, and NamesList annotations, in tabular or plain html.
Has regex support and does NFC/NFD conversion.
- Richard Ishida's Character Usage Lookup
Finds characters used by a particular language, or languages that use a given non-ASCII character.
- Richard Ishida's Encoding Converter
Allows you to see what bytes are used by legacy encodings to represent a particular character, or to convert a sequence of bytes into characters for a range of encodings. You can customize the encodings you want to experiment with by clicking on change encodings shown.
- Richard Ishida's List Characters
Analyzes the characters in text that you paste into the large box at the top, and produces a list of what characters were used, grouped by Unicode block.
- Richard Ishida's Language Subtag Lookup Tool
BCP 47 language tags are built from subtags in the IANA Subtag Registry. This tool helps you find
or look up subtags and check for errors in language tags.
- Richard Ishida's
Unicode Character Pickers
HTML-based. Quickly create phrases in a script by clicking on Unicode characters arranged in a way that aids their identification.
Covers Arabic, Assyrian Neo-Aramaic, Armenian, Balinese, Buginese, Bengali, Burmese, Cherokee, Cyrillic, Devanagari, Egyptian hieroglyphs, Ethiopic, Gujarati, Gurmukhi, Hebrew, IPA, Javanese, Khmer, Lao, Latin & diacritics, Lisu (Fraser script), Makasar, Mandaic, Malayalam, Mongolian, N'Ko, Old English, Old Norse, Persian, Runic, Russian, Shan, Sinhala, Sundanese, Syriac, Tamil, Thaana, Thai, Tibetan, Tifinagh, Tlicho (Dogrib), Uighur, Urdu, and Vietnamese
Richard Ishida's Unicode Code Converter
This dynamic HTML app helps you convert between Unicode
character numbers, characters, UTF-8 and UTF-16 code units in
hex, percent escapes, and Numeric Character References (hex and
Richard Ishida's UniView
HTML-based. Look up characters, character blocks, paste in and
discover unknown characters, store your own info about
characters, search on character names, do hex/dec/ncr
conversions, highlight character types, etc. etc. Supports the
latest version of Unicode.
- Search for characters
in Unicode by Tomas Schild
Code points can be found by searching for part of the Unicode
character name, of the obsolete Unicode 1.0 name, by the
character itself, the character's associated numerical value (if
any), or the hexadecimal Unicode value
- Static Unicode Lookup
A static tool to perform quick client-side searches of Unicode characters.
Allows simple or complex searches, with the power to filter by Bidi Class,
Unicode Properties, Blocks, Planes, codepoint range, and more. It is an
open-source tool with the repository at
A small utility for offline viewing of the character charts
and character properties for The Unicode Standard
Unicode Character Map
Free and fast online method to select Unicode characters to paste into forms or other apps
- Unicode Character
Table by Sergei Asanov and Oleg Grigoriev
Unicode Character Table has online reference tools, including
selection of Unicode characters by clicking on a chart, and
converting to and from HTML formats.
Database (in German)
Characters in Html/Xml ordered by block, category, bidi-class
and additional properties. The version of each codepoint is shown.
Glyph Lookup by hexadecimal code
Input Tool/Converter Firefox Extension
View Unicode characters, values, and character descriptions in chart and optionally output to a textbox.
Also converts among character references (hex or decimal), HTML entities, and Unicode. Several preferences
allow a great degree of customization including adding one's own DTD for use in entity conversions
- Unicode Lookup by Jonathan Hedley
Unicode Lookup is an online reference tool to lookup Unicode and HTML characters, by name and number, and convert between their decimal, hexadecimal, and octal bases
- Unicodinator A visual Unicode navigator
- W3C Internationalization Checker
Performs various tests on a Web Page to determine its level of internationalisation-friendliness.
It also summarises key internationalization information about a page, such as character encoding
and language declarations, etc.
Web-based Chinese Input Service (WebIME) is a webpage that enables users to input Chinese characters, including Hong Kong
Supplementary Character Set, on Internet. It provides five conventional Chinese input methods (Cangjie, simplified Cangjie, Hanyu pinyin, Cantonese Pronunciation and Chinese dictionary-like method
(Radical/Number of Strokes) augmented with two special features, homophone and candidate characters filtering. WebIME aims to bring more ease and
convenience for users to input Chinese characters when surfing the Internet.
These materials are provided solely for public information and do not imply any endorsement by the Consortium. Nor do they constitute a certification of any level of compliance with standards published by the Unicode Consortium. Links to products are provided by third parties, and the Consortium bears no responsibility for their content or correctness. There is no claim that this material is in any way complete, accurate, or up-to-date.