Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????????B	00111111001111110011111100111111001111110011111100111111001111110011111101000010	3f3f3f3f3f3f3f3f3f42
SJIS-WIN	額??屍??藥??B	10001010011110100011111100111111100011100111001000111111001111111110010101011010001111110011111101000010	8a7a3f3f8e723f3fe55a3f3f42
EUC-JP	額??屍??藥??B	10110011110110110011111100111111101110111101001100111111001111111110100110111011001111110011111101000010	b3db3f3fbbd33f3fe9bb3f3f42
UTF-8	額뗨뛝屍뜈괮藥럤뤇B	11101001101000011000110111101011100101111010100011101011100110111001110111100101101100011000110111101011100111001000100011101010101101001010111011101000100101111010010111101011100111111010010011101011101001001000011101000010	e9a18deb97a8eb9b9de5b18deb9c88eab4aee897a5eb9fa4eba48742
UHC	額뗨뛝屍뜈괮藥럤뤇B	11100100111111101000101111101000100011010101101011100011101110111000110110001011100000100101010111100101101101111000111010000111100011111011011101000010	e4fe8be88d5ae3bb8d8b8255e5b78e878fb742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)