Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????	00111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f
SJIS-WIN	?????緬?	0011111100111111001111110011111100111111100101101100100100111111	3f3f3f3f3f96c93f
EUC-JP	?????緬?	0011111100111111001111110011111100111111110011001100101100111111	3f3f3f3f3fcccb3f
UTF-8	야롎섞뤶쫷緬렱	111011001001010110111100111010111010000110001110111011001000010010011110111010111010010010110110111011001010101110110111111001111011011110101100111010111010000010110001	ec95bceba18eec849eeba4b6ecabb7e7b7aceba0b1
UHC	야롎섞뤶쫷緬렱	1011111011011111100011101101010010111100101011111000111111100100101001101000111011011000111110111000111010111110	bedf8ed4bcaf8fe4a68ed8fb8ebe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)