Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????B	001111110011111100111111001111110011111101000010	3f3f3f3f3f42
SJIS-WIN	讌泌ｱ題犒B	11100110101001011001010011100101101100011001000111101000111000001011010101000010	e6a594e5b191e8e0b542
EUC-JP	讌泌ｱ題犒B	1110110010100111110010001110011110001110101100011100001011101010111000001011011101000010	eca7c8e78eb1c2eae0b742
UTF-8	讌泌ｱ題犒B	11101000101011101000110011100110101100111000110011101111101111011011000111101001101000011000110011100111100010101001001001000010	e8ae8ce6b38cefbdb1e9a18ce78a9242
UHC	?泌?題?B	0011111111111001101100100011111111110000101110010011111101000010	3ff9b23ff0b93f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)