Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????????	001111110011111100111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f3f3f
SJIS-WIN	辱??乳??醫??	100100000100101000111111001111111001001111111011001111110011111111100111110011100011111100111111	904a3f3f93fb3f3fe7ce3f3f
EUC-JP	辱??乳??醫??	101111111010101100111111001111111100011011111101001111110011111111101110110100000011111100111111	bfab3f3fc6fd3f3feed03f3f
UTF-8	辱됰씭乳득벚醫귙닀	111010001011111010110001111010111001000010110000111011001001010010101101111001001011100110110011111010111001001110011101111010111011001010011010111010011000011010101011111010101011011110011001111010111000101110000000	e8beb1eb90b0ec94ade4b9b3eb939debb29ae986abeab799eb8b80
UHC	辱됰씭乳득벚醫귙닀	111010011011010010001001111010111001110110111110111010101110000110110101111001101011101010100010111011001010001010000010111000111000100010001001	e9b489eb9dbeeae1b5e6baa2eca282e38889

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)