Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??CD??CD^	001111110011111101000011010001000011111100111111010000110100010001011110	3f3f43443f3f43445e
SJIS-WIN	唏唸CD唏唸CD^	10011010010010001001101001011000010000110100010010011010010010001001101001011000010000110100010001011110	9a489a5843449a489a5843445e
EUC-JP	唏唸CD唏唸CD^	11010011101010011101001110111001010000110100010011010011101010011101001110111001010000110100010001011110	d3a9d3b94344d3a9d3b943445e
UTF-8	唏唸CD唏唸CD^	1110010110010100100011111110010110010100101110000100001101000100111001011001010010001111111001011001010010111000010000110100010001011110	e5948fe594b84344e5948fe594b843445e
UHC	??CD??CD^	001111110011111101000011010001000011111100111111010000110100010001011110	3f3f43443f3f43445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)