Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????B	00111111001111110011111100111111001111110011111101000010	3f3f3f3f3f3f42
SJIS-WIN	醇袒善醇袒善B	10001111100001101110010111010110100100010101000010001111100001101110010111010110100100010101000001000010	8f86e5d691508f86e5d6915042
EUC-JP	醇袒善醇袒善B	10111101111001101110101011011000110000011011000110111101111001101110101011011000110000011011000101000010	bde6ead8c1b1bde6ead8c1b142
UTF-8	醇袒善醇袒善B	11101001100001101000011111101000101000101001001011100101100101101000010011101001100001101000011111101000101000101001001011100101100101101000010001000010	e98687e8a292e59684e98687e8a292e5968442
UHC	醇袒善醇袒善B	11100010111101011101001110110010111000001011110011100010111101011101001110110010111000001011110001000010	e2f5d3b2e0bce2f5d3b2e0bc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)