[Unicode]  The Standard Home | Site Map | Search
 

Supported Scripts

The Unicode Standard encodes scripts rather than languages per se. When writing systems for more than one language share sets of graphical symbols that have historically related derivations, the union of all of those graphical symbols is treated as a single collection of characters for encoding and is identified as a single script. Each script then serves as as an inventory of graphical symbols, which are drawn upon for the writing systems of particular languages. In many cases, a single script, such as the Latin script, may be used to write tens or even hundreds of languages. In other cases, only one language employs a particular script—for example, Hangul, which is typically used only to write the Korean language. The writing systems for some languages may also use more than one script; for example, Japanese traditionally makes use of the Han (Kanji), Hiragana, and Katakana scripts, and modern Japanese usage commonly mixes in the Latin script as well.

The scripts currently supported by Unicode 6.3.0 are listed below. Tool tips for each entry specify the version of the Unicode Standard which first supported the script.

  • Arabic
  • Aramaic, Imperial
  • Armenian
  • Avestan
  • Balinese
  • Bamum
  • Batak
  • Bengali
  • Bopomofo
  • Brahmi
  • Buginese
  • Buhid
  • Canadian Syllabics
  • Carian
  • Chakma
  • Cham
  • Cherokee
  • Coptic
  • Cypriot
  • Cyrillic
  • Deseret
  • Devanagari
  • Egyptian Hieroglyphs
  • Ethiopic
  • Georgian
  • Glagolitic
  • Gothic
  • Greek
  • Gujarati
  • Gurmukhi
  • Han
  • Hangul
  • Hanunóo
  • Hebrew
  • Hiragana
  • Javanese
  • Kaithi
  • Kannada
  • Katakana
  • Kayah Li
  • Kharoshthi
  • Khmer
  • Lao
  • Latin
  • Lepcha (Rong)
  • Limbu
  • Linear B
  • Lisu
  • Lycian
  • Lydian
  • Malayalam
  • Mandaic
  • Meetei Mayek
  • Meroitic Cursive
  • Meroitic Hieroglyphs
  • Miao
  • Mongolian
  • Myanmar
  • New Tai Lue
  • N'Ko
  • Ogham
  • Ol Chiki
  • Old Italic (Etruscan)
  • Old Persian Cuneiform
  • Old South Arabian
  • Old Turkic
  • Osmanya
  • Oriya
  • Pahlavi, Inscriptional
  • Parthian, Inscriptional
  • Phags-pa
  • Phoenician
  • Rejang
  • Runic
  • Saurashtra
  • Samaritan
  • Sharada
  • Shavian
  • Sinhala
  • Sora Sompeng
  • Sumero-Akkadian Cuneiform
  • Sundanese
  • Syloti Nagri
  • Syriac
  • Tagalog
  • Tagbanwa
  • Tai Le
  • Tai Tham
  • Tai Viet
  • Takri
  • Tamil
  • Telugu
  • Thaana
  • Thai
  • Tibetan
  • Tifinagh (Berber)
  • Ugaritic
  • Vai
  • Yi

In addition to the above scripts, a number of other collections of symbols are also encoded by Unicode. These collections consist of the following:

  • Numbers
  • General Diacritics
  • General Punctuation
  • General Symbols
  • Mathematical Symbols (Western and Arabic)
  • Musical Symbols (Western, Byzantine, and Ancient Greek)
  • Technical Symbols
  • Emoji
  • Dingbats
  • Arrows, Blocks, Box Drawing Forms, and Geometric Shapes
  • Game Symbols
  • Miscellaneous Symbols
  • Presentation Forms
  • Braille Patterns
  • Kangxi Radicals