Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	歪??罌??	1001100001100011001111110011111111100011101000000011111100111111	98633f3fe3a03f3f
EUC-JP	歪??罌??	1100111111000100001111110011111111100110101000100011111100111111	cfc43f3fe6a23f3f
UTF-8	歪뉔쨹罌녕튊	111001101010110110101010111010111000100110010100111011001010100010111001111001111011110110001100111010111000010110010101111011011000101010001010	e6adaaeb8994eca8b9e7bd8ceb8595ed8a8a
UHC	歪뉔쨹罌녕튊	111010001110000010000111111010011010010010010011111001011010001010110011111001111011100110011110	e8e087e9a493e5a2b3e7b99e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)