Unicode Characters and Blocks

Unicode is a standard first released in 1991 which covers most characters from all of the world's written systems.

It covers current and historical scripts, alphabets, symbols, emojis and non-printable codes for controlling and formatting.

At the time of writing it contains almost 150,000 characters and is continually being updated by the Unicode Consortium.

The characters are arranged in planes and blocks.
The current blocks and links to the defined characters on the Unicode website are provided below.

 

Block Range Block Name (+Link to Unicode PDF)
U0000 - U007FBasic Latin
U0080 - U00FFLatin-1 Supplement
U0100 - U017FLatin Extended-A
U0180 - U024FLatin Extended-B
U0250 - U02AFIPA Extensions
U02B0 - U02FFSpacing Modifier Letters
U0300 - U036FCombining Diacritical Marks
U0370 - U03FFGreek and Coptic
U0400 - U04FFCyrillic
U0500 - U052FCyrillic Supplement
U0530 - U058FArmenian
U0590 - U05FFHebrew
U0600 - U06FFArabic
U0700 - U074FSyriac
U0750 - U077FArabic Supplement
U0780 - U07BFThaana
U07C0 - U07FFNKo
U0800 - U083FSamaritan
U0840 - U085FMandaic
U0860 - U086FSyriac Supplement
U08A0 - U08FFArabic Extended-A
U0900 - U097FDevanagari
U0980 - U09FFBengali
U0A00 - U0A7FGurmukhi
U0A80 - U0AFFGujarati
U0B00 - U0B7FOriya
U0B80 - U0BFFTamil
U0C00 - U0C7FTelugu
U0C80 - U0CFFKannada
U0D00 - U0D7FMalayalam
U0D80 - U0DFFSinhala
U0E00 - U0E7FThai
U0E80 - U0EFFLao
U0F00 - U0FFFTibetan
U1000 - U109FMyanmar
U10A0 - U10FFGeorgian
U1100 - U11FFHangul Jamo
U1200 - U137FEthiopic
U1380 - U139FEthiopic Supplement
U13A0 - U13FFCherokee
U1400 - U167FUnified Canadian Aboriginal Syllabics
U1680 - U169FOgham
U16A0 - U16FFRunic
U1700 - U171FTagalog
U1720 - U173FHanunoo
U1740 - U175FBuhid
U1760 - U177FTagbanwa
U1780 - U17FFKhmer
U1800 - U18AFMongolian
U18B0 - U18FFUnified Canadian Aboriginal Syllabics Extended
U1900 - U194FLimbu
U1950 - U197FTai Le
U1980 - U19DFNew Tai Lue
U19E0 - U19FFKhmer Symbols
U1A00 - U1A1FBuginese
U1A20 - U1AAFTai Tham
U1AB0 - U1AFFCombining Diacritical Marks Extended
U1B00 - U1B7FBalinese
U1B80 - U1BBFSundanese
U1BC0 - U1BFFBatak
U1C00 - U1C4FLepcha
U1C50 - U1C7FOl Chiki
U1C80 - U1C8FCyrillic Extended-C
U1C90 - U1CBFGeorgian Extended
U1CC0 - U1CCFSundanese Supplement
U1CD0 - U1CFFVedic Extensions
U1D00 - U1D7FPhonetic Extensions
U1D80 - U1DBFPhonetic Extensions Supplement
U1DC0 - U1DFFCombining Diacritical Marks Supplement
U1E00 - U1EFFLatin Extended Additional
U1F00 - U1FFFGreek Extended
U2000 - U206FGeneral Punctuation
U2070 - U209FSuperscripts and Subscripts
U20A0 - U20CFCurrency Symbols
U20D0 - U20FFCombining Diacritical Marks for Symbols
U2100 - U214FLetterlike Symbols
U2150 - U218FNumber Forms
U2190 - U21FFArrows
U2200 - U22FFMathematical Operators
U2300 - U23FFMiscellaneous Technical
U2400 - U243FControl Pictures
U2440 - U245FOptical Character Recognition
U2460 - U24FFEnclosed Alphanumerics
U2500 - U257FBox Drawing
U2580 - U259FBlock Elements
U25A0 - U25FFGeometric Shapes
U2600 - U26FFMiscellaneous Symbols
U2700 - U27BFDingbats
U27C0 - U27EFMiscellaneous Mathematical Symbols-A
U27F0 - U27FFSupplemental Arrows-A
U2800 - U28FFBraille Patterns
U2900 - U297FSupplemental Arrows-B
U2980 - U29FFMiscellaneous Mathematical Symbols-B
U2A00 - U2AFFSupplemental Mathematical Operators
U2B00 - U2BFFMiscellaneous Symbols and Arrows
U2C00 - U2C5FGlagolitic
U2C60 - U2C7FLatin Extended-C
U2C80 - U2CFFCoptic
U2D00 - U2D2FGeorgian Supplement
U2D30 - U2D7FTifinagh
U2D80 - U2DDFEthiopic Extended
U2DE0 - U2DFFCyrillic Extended-A
U2E00 - U2E7FSupplemental Punctuation
U2E80 - U2EFFCJK Radicals Supplement
U2F00 - U2FDFKangxi Radicals
U2FF0 - U2FFFIdeographic Description Characters
U3000 - U303FCJK Symbols and Punctuation
U3040 - U309FHiragana
U30A0 - U30FFKatakana
U3100 - U312FBopomofo
U3130 - U318FHangul Compatibility Jamo
U3190 - U319FKanbun
U31A0 - U31BFBopomofo Extended
U31C0 - U31EFCJK Strokes
U31F0 - U31FFKatakana Phonetic Extensions
U3200 - U32FFEnclosed CJK Letters and Months
U3300 - U33FFCJK Compatibility
U3400 - U4DBFCJK Unified Ideographs Extension A
U4DC0 - U4DFFYijing Hexagram Symbols
U4E00 - U9FFFCJK Unified Ideographs
UA000 - UA48FYi Syllables
UA490 - UA4CFYi Radicals
UA4D0 - UA4FFLisu
UA500 - UA63FVai
UA640 - UA69FCyrillic Extended-B
UA6A0 - UA6FFBamum
UA700 - UA71FModifier Tone Letters
UA720 - UA7FFLatin Extended-D
UA800 - UA82FSyloti Nagri
UA830 - UA83FCommon Indic Number Forms
UA840 - UA87FPhags-pa
UA880 - UA8DFSaurashtra
UA8E0 - UA8FFDevanagari Extended
UA900 - UA92FKayah Li
UA930 - UA95FRejang
UA960 - UA97FHangul Jamo Extended-A
UA980 - UA9DFJavanese
UA9E0 - UA9FFMyanmar Extended-B
UAA00 - UAA5FCham
UAA60 - UAA7FMyanmar Extended-A
UAA80 - UAADFTai Viet
UAAE0 - UAAFFMeetei Mayek Extensions
UAB00 - UAB2FEthiopic Extended-A
UAB30 - UAB6FLatin Extended-E
UAB70 - UABBFCherokee Supplement
UABC0 - UABFFMeetei Mayek
UAC00 - UD7AFHangul Syllables
UD7B0 - UD7FFHangul Jamo Extended-B
UD800 - UDB7FHigh Surrogates
UDB80 - UDBFFHigh Private Use Surrogates
UDC00 - UDFFFLow Surrogates
UE000 - UF8FFPrivate Use Area
UF900 - UFAFFCJK Compatibility Ideographs
UFB00 - UFB4FAlphabetic Presentation Forms
UFB50 - UFDFFArabic Presentation Forms-A
UFE00 - UFE0FVariation Selectors
UFE10 - UFE1FVertical Forms
UFE20 - UFE2FCombining Half Marks
UFE30 - UFE4FCJK Compatibility Forms
UFE50 - UFE6FSmall Form Variants
UFE70 - UFEFFArabic Presentation Forms-B
UFF00 - UFFEFHalfwidth and Fullwidth Forms
UFFF0 - UFFFFSpecials
U10000 - U1007FLinear B Syllabary
U10080 - U100FFLinear B Ideograms
U10100 - U1013FAegean Numbers
U10140 - U1018FAncient Greek Numbers
U10190 - U101CFAncient Symbols
U101D0 - U101FFPhaistos Disc
U10280 - U1029FLycian
U102A0 - U102DFCarian
U102E0 - U102FFCoptic Epact Numbers
U10300 - U1032FOld Italic
U10330 - U1034FGothic
U10350 - U1037FOld Permic
U10380 - U1039FUgaritic
U103A0 - U103DFOld Persian
U10400 - U1044FDeseret
U10450 - U1047FShavian
U10480 - U104AFOsmanya
U104B0 - U104FFOsage
U10500 - U1052FElbasan
U10530 - U1056FCaucasian Albanian
U10600 - U1077FLinear A
U10800 - U1083FCypriot Syllabary
U10840 - U1085FImperial Aramaic
U10860 - U1087FPalmyrene
U10880 - U108AFNabataean
U108E0 - U108FFHatran
U10900 - U1091FPhoenician
U10920 - U1093FLydian
U10980 - U1099FMeroitic Hieroglyphs
U109A0 - U109FFMeroitic Cursive
U10A00 - U10A5FKharoshthi
U10A60 - U10A7FOld South Arabian
U10A80 - U10A9FOld North Arabian
U10AC0 - U10AFFManichaean
U10B00 - U10B3FAvestan
U10B40 - U10B5FInscriptional Parthian
U10B60 - U10B7FInscriptional Pahlavi
U10B80 - U10BAFPsalter Pahlavi
U10C00 - U10C4FOld Turkic
U10C80 - U10CFFOld Hungarian
U10D00 - U10D3FHanifi Rohingya
U10E60 - U10E7FRumi Numeral Symbols
U10E80 - U10EBFYezidi
U10F00 - U10F2FOld Sogdian
U10F30 - U10F6FSogdian
U10FB0 - U10FDFChorasmian
U10FE0 - U10FFFElymaic
U11000 - U1107FBrahmi
U11080 - U110CFKaithi
U110D0 - U110FFSora Sompeng
U11100 - U1114FChakma
U11150 - U1117FMahajani
U11180 - U111DFSharada
U111E0 - U111FFSinhala Archaic Numbers
U11200 - U1124FKhojki
U11280 - U112AFMultani
U112B0 - U112FFKhudawadi
U11300 - U1137FGrantha
U11400 - U1147FNewa
U11480 - U114DFTirhuta
U11580 - U115FFSiddham
U11600 - U1165FModi
U11660 - U1167FMongolian Supplement
U11680 - U116CFTakri
U11700 - U1173FAhom
U11800 - U1184FDogra
U118A0 - U118FFWarang Citi
U11900 - U1195FDives Akuru
U119A0 - U119FFNandinagari
U11A00 - U11A4FZanabazar Square
U11A50 - U11AAFSoyombo
U11AC0 - U11AFFPau Cin Hau
U11C00 - U11C6FBhaiksuki
U11C70 - U11CBFMarchen
U11D00 - U11D5FMasaram Gondi
U11D60 - U11DAFGunjala Gondi
U11EE0 - U11EFFMakasar
U11FB0 - U11FBFLisu Supplement
U11FC0 - U11FFFTamil Supplement
U12000 - U123FFCuneiform
U12400 - U1247FCuneiform Numbers and Punctuation
U12480 - U1254FEarly Dynastic Cuneiform
U13000 - U1342FEgyptian Hieroglyphs
U13430 - U1343FEgyptian Hieroglyph Format Controls
U14400 - U1467FAnatolian Hieroglyphs
U16800 - U16A3FBamum Supplement
U16A40 - U16A6FMro
U16AD0 - U16AFFBassa Vah
U16B00 - U16B8FPahawh Hmong
U16E40 - U16E9FMedefaidrin
U16F00 - U16F9FMiao
U16FE0 - U16FFFIdeographic Symbols and Punctuation
U17000 - U187FFTangut
U18800 - U18AFFTangut Components
U18B00 - U18CFFKhitan Small Script
U18D00 - U18D8FTangut Supplement
U1B000 - U1B0FFKana Supplement
U1B100 - U1B12FKana Extended-A
U1B130 - U1B16FSmall Kana Extension
U1B170 - U1B2FFNushu
U1BC00 - U1BC9FDuployan
U1BCA0 - U1BCAFShorthand Format Controls
U1D000 - U1D0FFByzantine Musical Symbols
U1D100 - U1D1FFMusical Symbols
U1D200 - U1D24FAncient Greek Musical Notation
U1D2E0 - U1D2FFMayan Numerals
U1D300 - U1D35FTai Xuan Jing Symbols
U1D360 - U1D37FCounting Rod Numerals
U1D400 - U1D7FFMathematical Alphanumeric Symbols
U1D800 - U1DAAFSutton SignWriting
U1E000 - U1E02FGlagolitic Supplement
U1E100 - U1E14FNyiakeng Puachue Hmong
U1E2C0 - U1E2FFWancho
U1E800 - U1E8DFMende Kikakui
U1E900 - U1E95FAdlam
U1EC70 - U1ECBFIndic Siyaq Numbers
U1ED00 - U1ED4FOttoman Siyaq Numbers
U1EE00 - U1EEFFArabic Mathematical Alphabetic Symbols
U1F000 - U1F02FMahjong Tiles
U1F030 - U1F09FDomino Tiles
U1F0A0 - U1F0FFPlaying Cards
U1F100 - U1F1FFEnclosed Alphanumeric Supplement
U1F200 - U1F2FFEnclosed Ideographic Supplement
U1F300 - U1F5FFMiscellaneous Symbols and Pictographs
U1F600 - U1F64FEmoticons
U1F650 - U1F67FOrnamental Dingbats
U1F680 - U1F6FFTransport and Map Symbols
U1F700 - U1F77FAlchemical Symbols
U1F780 - U1F7FFGeometric Shapes Extended
U1F800 - U1F8FFSupplemental Arrows-C
U1F900 - U1F9FFSupplemental Symbols and Pictographs
U1FA00 - U1FA6FChess Symbols
U1FA70 - U1FAFFSymbols and Pictographs Extended-A
U1FB00 - U1FBFFSymbols for Legacy Computing
U20000 - U2A6DFCJK Unified Ideographs Extension B
U2A700 - U2B73FCJK Unified Ideographs Extension C
U2B740 - U2B81FCJK Unified Ideographs Extension D
U2B820 - U2CEAFCJK Unified Ideographs Extension E
U2CEB0 - U2EBEFCJK Unified Ideographs Extension F
U2F800 - U2FA1FCJK Compatibility Ideographs Supplement
U30000 - U3134FCJK Unified Ideographs Extension G
UE0000 - UE007FTags
UE0100 - UE01EFVariation Selectors Supplement
UF0000 - UFFFFFSupplementary Private Use Area-A
U100000 - U10FFFFSupplementary Private Use Area-B

Related Pages