Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	坦樽損辰	1001001001010010100100100100110110010001101110011001001001000011	9252924d91b99243
EUC-JP	坦樽損辰	1100001110110011110000111010111011000010101110111100001110100100	c3b3c3aec2bbc3a4
UTF-8	坦樽損辰	111001011001110110100110111001101010100010111101111001101001000010001101111010001011111010110000	e59da6e6a8bde6908de8beb0
UHC	坦樽損辰	1111011110100100111100011101110011100001110111111111001011100011	f7a4f1dce1dff2e3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)