Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????E??	00111111001111110011111100111111010001010011111100111111	3f3f3f3f453f3f
SJIS-WIN	鞜擾ｽｭE隰千	1110100011011111100011111110111110111101101011010100010111101000101011001001000011100111	e8df8fefbdad45e8ac90e7
EUC-JP	鞜擾ｽｭE隰千	11110000111000011011111011110001100011101011110110001110101011010100010111110000101011101100000011101001	f0e1bef18ebd8ead45f0aec0e9
UTF-8	鞜擾ｽｭE隰千	11101001100111101001110011100110100100111011111011101111101111011011110111101111101111011010110101000101111010011001101010110000111001011000110110000011	e99e9ce693beefbdbdefbdad45e99ab0e58d83
UHC	?擾??E?千	001111111110100011110110001111110011111101000101001111111111010010110110	3fe8f63f3f453ff4b6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)