To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 辱??乳??醫?? 100100000100101000111111001111111001001111111011001111110011111111100111110011100011111100111111 904a3f3f93fb3f3fe7ce3f3f
EUC-JP 辱??乳??醫?? 101111111010101100111111001111111100011011111101001111110011111111101110110100000011111100111111 bfab3f3fc6fd3f3feed03f3f
UTF-8 辱됰씭乳득벚醫귙닀 111010001011111010110001111010111001000010110000111011001001010010101101111001001011100110110011111010111001001110011101111010111011001010011010111010011000011010101011111010101011011110011001111010111000101110000000 e8beb1eb90b0ec94ade4b9b3eb939debb29ae986abeab799eb8b80
UHC 辱됰씭乳득벚醫귙닀 111010011011010010001001111010111001110110111110111010101110000110110101111001101011101010100010111011001010001010000010111000111000100010001001 e9b489eb9dbeeae1b5e6baa2eca282e38889

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)