Unicode Utilities: Unicode Language Identifers and BCP47

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid

Input
  Localization:

Status

Source: fr-CA

TypeCodeNameReplacement
LanguagefrFrench
RegionCACanada

Source: gsw-Arab-AQ

TypeCodeNameReplacement
LanguagegswSwiss German
ScriptArabArabic
RegionAQAntarctica

Source: eng-Latn-840

Canonical Form: en-Latn-US

Minimal Form: en

TypeCodeNameReplacement
Languageenginvalid codeen
ScriptLatnLatin
Region840invalid CodeUS

Samples

Notes


Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Unicode Fonts for Ancient Scripts, Noto Fonts site, Large, multi-script Unicode fonts. See also: Unicode Display Problems.

Version 3.7; ICU version: 57.0.1.0; Unicode version: 8.0.0.0