Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	嘲勃ぐ麝た	10011010011111011001011001110101100000101010111011101010011011001000001010111101	9a7d967582aeea6c82bd
EUC-JP	嘲勃ぐ麝た	11010011110111101100101111010110101001001011000011110011110011011010010010111111	d3decbd6a4b0f3cda4bf
UTF-8	嘲勃ぐ麝た	111001011001100010110010111001011000101110000011111000111000000110010000111010011011101010011101111000111000000110011111	e598b2e58b83e38190e9ba9de3819f
UHC	嘲勃ぐ麝た	11110000101111111101101011111010101010101011000011011110111110101010101010111111	f0bfdafaaab0defaaabf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)