UTF-8 is capable of encoding all 1,112,064 valid Unicode code points
using up to four code bytes.
- starting bytes are
11xx xxxx
- continuation bytes are
10xx xxxx
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
| 0x |
NUL |
SOH |
STX |
ETX |
EOT |
ENQ |
ACK |
BEL |
BS |
HT |
LF |
VT |
FF |
CR |
SO |
SI |
| 1x |
DLE |
DC1 |
DC2 |
DC3 |
DC4 |
NAK |
SYN |
ETB |
CAN |
CAN |
SUB |
ESC |
FS |
GS |
RS |
US |
| 2x |
SP |
! |
" |
# |
$ |
% |
& |
' |
( |
) |
* |
+ |
, |
- |
. |
/ |
| 3x |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
: |
; |
< |
= |
> |
? |
| 4x |
@ |
A |
B |
C |
D |
E |
F |
G |
H |
I |
J |
K |
L |
M |
N |
O |
| 5x |
P |
Q |
R |
S |
T |
U |
V |
W |
X |
Y |
Z |
[ |
\ |
] |
^ |
_ |
| 6x |
` |
a |
b |
c |
d |
e |
f |
g |
h |
i |
j |
k |
l |
m |
n |
o |
| 7x |
p |
q |
r |
s |
t |
u |
v |
w |
x |
y |
z |
{ |
| |
} |
~ |
DEL |
| 8x |
+0 |
+1 |
+2 |
+3 |
+4 |
+5 |
+6 |
+7 |
+8 |
+9 |
+A |
+B |
+C |
+D |
+E |
+F |
| 9x |
+10 |
+11 |
+12 |
+13 |
+14 |
+15 |
+16 |
+17 |
+18 |
+19 |
+1A |
+1B |
+1C |
+1D |
+1E |
+1F |
| Ax |
+20 |
+21 |
+22 |
+23 |
+24 |
+25 |
+26 |
+27 |
+28 |
+29 |
+2A |
+2B |
+2C |
+2D |
+2E |
+2F |
| Bx |
+30 |
+31 |
+32 |
+33 |
+34 |
+35 |
+36 |
+37 |
+38 |
+39 |
+3A |
+3B |
+3C |
+3D |
+3E |
+3F |
| Cx |
[2] |
[2] |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
| Dx |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
| Ex |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
3 |
| Fx |
4 |
4 |
4 |
4 |
4 |
[4] |
[4] |
[4] |
[5] |
[5] |
[5] |
[5] |
[6] |
[6] |
|
|
This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.
¢ = c2 a2
| 0xC2 Controls and Latin-1 Supplement |
| |
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
| U+008x |
XXX |
XXX |
BPH |
NBH |
IND |
NEL |
SSA |
ESA |
HTS |
HTJ |
VTS |
PLD |
PLU |
RI |
SS2 |
SS3 |
| U+009x |
DCS |
PU1 |
PU2 |
STS |
CCH |
MW |
SPA |
EPA |
SOS |
XXX |
SCI |
CSI |
ST |
OSC |
PM |
APC |
| U+00Ax |
NBSP |
¡ |
¢ |
£ |
¤ |
¥ |
¦ |
§ |
¨ |
© |
ª |
« |
¬ |
SHY |
® |
¯ |
| U+00Bx |
° |
± |
² |
³ |
´ |
µ |
¶ |
· |
¸ |
¹ |
º |
» |
¼ |
½ |
¾ |
¿ |
| U+00Cx |
À |
Á |
 |
à |
Ä |
Å |
Æ |
Ç |
È |
É |
Ê |
Ë |
Ì |
Í |
Î |
Ï |
| U+00Dx |
Ð |
Ñ |
Ò |
Ó |
Ô |
Õ |
Ö |
× |
Ø |
Ù |
Ú |
Û |
Ü |
Ý |
Þ |
ß |
| U+00Ex |
à |
á |
â |
ã |
ä |
å |
æ |
ç |
è |
é |
ê |
ë |
ì |
í |
î |
ï |
| U+00Fx |
ð |
ñ |
ò |
ó |
ô |
õ |
ö |
÷ |
ø |
ù |
ú |
û |
ü |
ý |
þ |
ÿ |
λ : ce bb
| Greek and Coptic |
|
0 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
A |
B |
C |
D |
E |
F |
| U+037x |
Ͱ |
ͱ |
Ͳ |
ͳ |
ʹ |
͵ |
|
|
Ͷ |
ͷ |
ͺ |
ͻ |
ͼ |
ͽ |
; |
Ϳ |
| U+038x |
|
|
|
|
΄ |
΅ |
Ά |
· |
Έ |
Ή |
Ί |
|
Ό |
|
Ύ |
Ώ |
| U+039x |
ΐ |
Α |
Β |
Γ |
Δ |
Ε |
Ζ |
Η |
Θ |
Ι |
Κ |
Λ |
Μ |
Ν |
Ξ |
Ο |
| U+03Ax |
Π |
Ρ |
|
Σ |
Τ |
Υ |
Φ |
Χ |
Ψ |
Ω |
Ϊ |
Ϋ |
ά |
έ |
ή |
ί |
| U+03Bx |
ΰ |
α |
β |
γ |
δ |
ε |
ζ |
η |
θ |
ι |
κ |
λ |
μ |
ν |
ξ |
ο |
| U+03Cx |
π |
ρ |
ς |
σ |
τ |
υ |
φ |
χ |
ψ |
ω |
ϊ |
ϋ |
ό |
ύ |
ώ |
Ϗ |
| U+03Dx |
ϐ |
ϑ |
ϒ |
ϓ |
ϔ |
ϕ |
ϖ |
ϗ |
Ϙ |
ϙ |
Ϛ |
ϛ |
Ϝ |
ϝ |
Ϟ |
ϟ |
| U+03Ex |
Ϡ |
ϡ |
Ϣ |
ϣ |
Ϥ |
ϥ |
Ϧ |
ϧ |
Ϩ |
ϩ |
Ϫ |
ϫ |
Ϭ |
ϭ |
Ϯ |
ϯ |
| U+03Fx |
ϰ |
ϱ |
ϲ |
ϳ |
ϴ |
ϵ |
϶ |
Ϸ |
ϸ |
Ϲ |
Ϻ |
ϻ |
ϼ |
Ͻ |
Ͼ |
Ͽ |
三個和尚沒水å–
(Chinese Proverb)
incoming: left 2024