To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??惟?????魏??魏??閻????┿ 111000011001111100111111001111111000100011010010001111110011111100111111001111110011111111101001101100000011111100111111111010011011000000111111001111111110100010000101001111110011111100111111001111111000010010111001 e19f3f3f88d23f3f3f3f3fe9b03f3fe9b03f3fe8853f3f3f3f84b9
EUC-JP 癲??惟?????魏??魏??閻????┿ 111000101010000100111111001111111011000011010100001111110011111100111111001111110011111111110010101100100011111100111111111100101011001000111111001111111110111111100101001111110011111100111111001111111010100010111011 e2a13f3fb0d43f3f3f3f3ff2b23f3ff2b23f3fefe53f3f3f3fa8bb
UTF-8 癲ㅺ슝惟뀀꽚烈ㅻ뿪魏쒏궇魏곹돦閻롫갭柳껓┿ 111001111001100110110010111000111000010110111010111011001000101010011101111001101000001110011111111010111000000010000000111010101011110110011010111011111010011010011111111000111000010110111011111010111011111110101010111010011010110110001111111011001001001010001111111010101011011010000111111010011010110110001111111010101011001110111001111010111000111110100110111010011001011010111011111010111010000110101011111010101011000010101101111011111010011110001001111010101011101110010011111000101001010010111111 e799b2e385baec8a9de6839feb8080eabd9aefa69fe385bbebbfaae9ad8fec928feab687e9ad8feab3b9eb8fa6e996bbeba1abeab0adefa789eabb93e294bf
UHC 癲ㅺ슝惟뀀꽚烈ㅻ뿪魏쒏궇魏곹돦閻롫갭柳껓┿ 111011111010011010100100111010101011110110111001111010101110111010110010111010111000010010101001111001101110111110100100111010111001011110101010111010101110000010011100111001101000001010100000111010101110000010000001111011011000100110101010111001111010001010001110111010111011000010111000111010101111011110000011111011111010011010111011 efa6a4eabdb9eaeeb2eb84a9e6efa4eb97aaeae09ce682a0eae081ed89aae7a28eebb0b8eaf783efa6bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)