Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	壌緖ｸ丈ｦ而	10001111111010111111101110001110101110001000111111100100101001101000111010100111	8febfb8eb88fe4a68ea7
EUC-JP	壌?ｸ丈ｦ而	1011111011101101001111111000111010111000101111101110011010001110101001101011110010101001	beed3f8eb8bee68ea6bca9
UTF-8	壌緖ｸ丈ｦ而	111001011010001110001100111001111011011110010110111011111011110110111000111001001011100010001000111011111011110110100110111010001000000010001100	e5a38ce7b796efbdb8e4b888efbda6e8808c
UHC	?緖?丈?而	001111111101111111111101001111111110110111011011001111111110110010111011	3fdffd3feddb3fecbb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)