Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	萬駈什莨件従	111001001101110110001011111011011000111101011001111001001011110010001100100011111000111101011101	e4dd8bed8f59e4bc8c8f8f5d
EUC-JP	萬駈什莨件従	111010001101111110110110111011111011110110111010111010001011111010110111111011111011110110111110	e8dfb6efbdbae8beb7efbdbe
UTF-8	萬駈什莨件従	111010001001000010101100111010011010011110001000111001001011101110000000111010001000111010101000111001001011101110110110111001011011111010010011	e890ace9a788e4bb80e88ea8e4bbb6e5be93
UHC	萬?什?件?	110110001011111100111111111001001010011100111111110010111110110000111111	d8bf3fe4a73fcbec3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)