To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??椅??音?????宥?$源??曖 1110000110011111001111110011111110001000110101100011111100111111100010011011100100111111001111110011111100111111001111111001011101000111001111111000000110010000100011001011100100111111001111111001111001000010 e19f3f3f88d63f3f89b93f3f3f3f3f97473f81908cb93f3f9e42
EUC-JP 癲??椅??音??饔??宥?$源??曖 11100010101000010011111100111111101100001101100000111111001111111011001010111011001111110011111110001111111010001110111100111111001111111100110110101000001111111010000111110000101110001011101100111111001111111101101110100011 e2a13f3fb0d83f3fb2bb3f3f8fe8ef3f3fcda83fa1f0b8bb3f3fdba3
UTF-8 癲ㅻ슡椅썹빊音섏냸饔낆뮉宥귛$源놁쪕曖 111001111001100110110010111000111000010110111011111011001000101010100001111001101010010010000101111011001000110110111001111010111011100110001010111010011001111110110011111011001000010010001111111010111000001110111000111010011010010110010100111010111000001010000110111010111010111010001001111001011010111010100101111010101011011110011011111011111011110010000100111001101011101010010000111010111000011010000001111011001010101010010101111001101001101110010110 e799b2e385bbec8aa1e6a485ec8db9ebb98ae99fb3ec848feb83b8e9a594eb8286ebae89e5aea5eab79befbc84e6ba90eb8681ecaa95e69b96
UHC 癲ㅻ슡椅썹빊音섏냸饔낆뮉宥귛$源놁쪕曖 1110111110100110101001001110101110011010101011011110101111110101101111011110011110010101101100001110101111100101100110001110110010000110100010001110100010111101100001011110110010010010100101111110101011101001100000101110010110100011101001001110101010111001100001101110110010100101100011111110010011110010 efa6a4eb9aadebf5bde795b0ebe598ec8688e8bd85ec9297eae982e5a3a4eab986eca58fe4f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)