Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	H{@sGJH{	10010010010010000111101110010010010000000111001110010010010001111001001001001010100100100100100001111011	92487b9240739247924a92487b
SJIS-WIN	?H{?@s?G?J?H{	00111111010010000111101100111111010000000111001100111111010001110011111101001010001111110100100001111011	3f487b3f40733f473f4a3f487b
EUC-JP	?H{?@s?G?J?H{	00111111010010000111101100111111010000000111001100111111010001110011111101001010001111110100100001111011	3f487b3f40733f473f4a3f487b
UTF-8	H{@sGJH{	110000101001001001001000011110111100001010010010010000000111001111000010100100100100011111000010100100100100101011000010100100100100100001111011	c292487bc2924073c29247c2924ac292487b
UHC	?H{?@s?G?J?H{	00111111010010000111101100111111010000000111001100111111010001110011111101001010001111110100100001111011	3f487b3f40733f473f4a3f487b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)