Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	夷??藥	100010001100111000111111001111111110010101011010	88ce3f3fe55a
EUC-JP	夷??藥	101100001101000000111111001111111110100110111011	b0d03f3fe9bb
UTF-8	夷덉갹藥	111001011010010010110111111010111000110110001001111010101011000010111001111010001001011110100101	e5a4b7eb8d89eab0b9e897a5
UHC	夷덉갹藥	1110110010101000100010001110110010110000101111011110010110110111	eca888ecb0bde5b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)