Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????????^	001111110011111100111111001111110011111100111111001111110011111101011110	3f3f3f3f3f3f3f3f5e
SJIS-WIN	??擁衙??擁衙^	00111111001111111001011101101001111001011100100100111111001111111001011101101001111001011100100101011110	3f3f9769e5c93f3f9769e5c95e
EUC-JP	??擁衙??擁衙^	00111111001111111100110111001010111010101100101100111111001111111100110111001010111010101100101101011110	3f3fcdcaeacb3f3fcdcaeacb5e
UTF-8	솜셈擁衙솜셈擁衙^	11101100100001101001110011101100100001011000100011100110100100111000000111101000101000011001100111101100100001101001110011101100100001011000100011100110100100111000000111101000101000011001100101011110	ec869cec8588e69381e8a199ec869cec8588e69381e8a1995e
UHC	솜셈擁衙솜셈擁衙^	1011110011011000101111001100000011101000101101101110010010110111101111001101100010111100110000001110100010110110111001001011011101011110	bcd8bcc0e8b6e4b7bcd8bcc0e8b6e4b75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)