Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	?る?爰?ぜ	001111111000001011101001001111111110000010100111001111111000001010111010	3f82e93fe0a73f82ba
EUC-JP	?る?爰?ぜ	001111111010010011101011001111111110000010101001001111111010010010111100	3fa4eb3fe0a93fa4bc
UTF-8	閭る틳爰귟ぜ	111011111010011010000110111000111000001010001011111011011000101110110011111001111000100010110000111010101011011110011111111000111000000110011100	efa686e3828bed8bb3e788b0eab79fe3819c
UHC	閭る틳爰귟ぜ	111001101010110110101010111010111011101010011011111010101011101010000010111010001010101010111100	e6adaaebba9beaba82e8aabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)