To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?溢??怨??鴉?????閻??萸? 111000011001111110000011100010110011111110001000111011000011111100111111100010011000010100111111001111111110100111101011001111110011111100111111001111110011111111101000100001010011111100111111111001001100111000111111 e19f838b3f88ec3f3f89853f3fe9eb3f3f3f3f3fe8853f3fe4ce3f
EUC-JP 癲ル?溢??怨??鴉?????閻??萸? 111000101010000110100101111010110011111110110000111011100011111100111111101100011110010100111111001111111111001011101101001111110011111100111111001111110011111111101111111001010011111100111111111010001101000000111111 e2a1a5eb3fb0ee3f3fb1e53f3ff2ed3f3f3f3f3fefe53f3fe8d03f
UTF-8 癲ル슪溢섊땟怨⑺맪鴉롧춱類욌쾳閻롫갭萸팤 111001111001100110110010111000111000001110101011111011001000101010101010111001101011101010100010111011001000010010001010111010111001010110011111111001101000000010101000111000101001000110111010111010111010011110101010111010011011010010001001111010111010000110100111111011001011011010110001111011111010011110010000111011001001101010001100111011001011111010110011111010011001011010111011111010111010000110101011111010101011000010101101111010001001000010111000111011011000110010100100 e799b2e383abec8aaae6baa2ec848aeb959fe680a8e291baeba7aae9b489eba1a7ecb6b1efa790ec9a8cecbeb3e996bbeba1abeab0ade890b8ed8ca4
UHC 癲ル슪溢섊땟怨⑺맪鴉롧춱類욌쾳閻롫갭萸팤 11101111101001101010101111101011100110101011001111101100111011101001100011100111101101101010110111101010101100111010100111101101100100001011001011100100101111001000111011100111101011011000110111101011101110101001111011101011101100101000100111100111101000101000111011101011101100001011100011101011101011011011101101100001 efa6abeb9ab3ecee98e7b6adeab3a9ed90b2e4bc8ee7ad8debba9eebb289e7a28eebb0b8ebadbb61

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)