Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	謳野さ縲よｲ	1110011010010000100101101110110010000010101100111110001110000000100000101110011010110010	e69096ec82b3e38082e6b2
EUC-JP	謳野さ縲よｲ	111010111111000011001100111011101010010010110101111001011110000010100100111010001000111010110010	ebf0cceea4b5e5e0a4e88eb2
UTF-8	謳野さ縲よｲ	111010001010110010110011111010011000011110001110111000111000000110010101111001111011100010110010111000111000001010001000111011111011110110110010	e8acb3e9878ee38195e7b8b2e38288efbdb2
UHC	謳野さ?よ?	11001111110001001110010110101111101010101011010100111111101010101110100000111111	cfc4e5afaab53faae83f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)