To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル????蟻?????循??寃??筌??釗 1110000110011111100000111000101100111111001111110011111100111111100010110110000100111111001111110011111100111111001111111000111101111010001111110011111110011011100000110011111100111111111000101010001100111111001111111111101110111011 e19f838b3f3f3f3f8b613f3f3f3f3f8f7a3f3f9b833f3fe2a33f3ffbbb
EUC-JP 癲ル?佾??蟻?????循??寃??筌??釗 1110001010100001101001011110101100111111100011111011000011111011001111110011111110110101110000100011111100111111001111110011111100111111101111011101101100111111001111111101010111100011001111110011111111100100101001010011111100111111100011111110001110100110 e2a1a5eb3f8fb0fb3f3fb5c23f3f3f3f3fbddb3f3fd5e33f3fe4a53f3f8fe3a6
UTF-8 癲ル슡佾쒏벚蟻띿쒜烈쒕굞循꿰깗寃몃쳛筌덈벝釗 111001111001100110110010111000111000001110101011111011001000101010100001111001001011110110111110111011001001001010001111111010111011001010011010111010001001111110111011111010111001110110111111111011001001001010011100111011111010011010011111111011001001001010010101111010101011010110011110111001011011111010101010111010101011111110110000111010101011100110010111111001011010111110000011111010111010101010000011111011001011001110011011111001111010110110001100111010111000110110001000111010111011001010011101111010011000011110010111 e799b2e383abec8aa1e4bdbeec928febb29ae89fbbeb9dbfec929cefa69fec9295eab59ee5beaaeabfb0eab997e5af83ebaa83ecb39be7ad8ceb8d88ebb29de98797
UHC 癲ル슡佾쒏벚蟻띿쒜烈쒕굞循꿰깗寃몃쳛筌덈벝釗 1110111110100110101010111110101110011010101011011110110011101011100111001110011010111010101000101110101111111100100011011110110010111110101011101110011011101111100111001110101110000010100001101110001011100000101100101110011110000011100011111110101010110010101110001110101110101011100000011110111110100111100010001110101110010011101110001110000111110010 efa6abeb9aadeceb9ce6baa2ebfc8decbeaee6ef9ceb8286e2e0b2e7838feab2b8ebab81efa788eb93b8e1f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)