Unicode Utilities: Unicode Language Identifers and BCP47

Warning: Testing version with both ICU and Unicode 10.0β properties!

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid

Input
  Localization:

Status

Source: sl-Cyrl-YU-rozaj-solba-1994-b-1234-a-Foobar-x-b-1234-a-Foobar

Canonical Form: sl-Cyrl-RS-1994-rozaj-solba-a-foobar-b-1234-x-b-1234-a-foobar

TypeCodeNameReplacement
LanguageslSlovenian
ScriptCyrlCyrillic
RegionYUSerbiaRS ME
Variant1994standardized resian orthography
Variantrozajresian
Variantsolbastolvizza/solbica dialect
Extensiona-foobar
Extensionb-1234
Private-Usex-b-1234-a-foobar

Samples

Notes


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Noto Fonts site, Unicode Fonts for Ancient Scripts, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.8; ICU version: 59.1.0.0; Unicode version: 9.0.0.0