Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??G}??G{^	001111110011111101000111011111010011111100111111010001110111101101011110	3f3f477d3f3f477b5e
SJIS-WIN	搖?G}搖?G{^	1001110110001010001111110100011101111101100111011000101000111111010001110111101101011110	9d8a3f477d9d8a3f477b5e
EUC-JP	搖?G}搖?G{^	1101100111101010001111110100011101111101110110011110101000111111010001110111101101011110	d9ea3f477dd9ea3f477b5e
UTF-8	搖렪G}搖렪G{^	1110011010010000100101101110101110100000101010100100011101111101111001101001000010010110111010111010000010101010010001110111101101011110	e69096eba0aa477de69096eba0aa477b5e
UHC	搖렪G}搖렪G{^	11101000111101001000111010111000010001110111110111101000111101001000111010111000010001110111101101011110	e8f48eb8477de8f48eb8477b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)