To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?揖??惟??馭??循??野??? 1110000110011111100000111000101100111111100101110100101100111111001111111000100011010010001111110011111111101001011001100011111100111111100011110111101000111111001111111001011011101100001111110011111100111111 e19f838b3f974b3f3f88d23f3fe9663f3f8f7a3f3f96ec3f3f3f
EUC-JP 癲ル?揖??惟??馭??循??野??薏 11100010101000011010010111101011001111111100110110101100001111110011111110110000110101000011111100111111111100011100011100111111001111111011110111011011001111110011111111001100111011100011111100111111100011111101100111011110 e2a1a5eb3fcdac3f3fb0d43f3ff1c73f3fbddb3f3fccee3f3f8fd9de
UTF-8 癲ル슣揖㏝윀惟㏉맪馭앭퐲循륁췀野껊갭薏 111001111001100110110010111000111000001110101011111011001000101010100011111001101000111110010110111000111000111110011101111011001001110010000000111001101000001110011111111000111000111110001001111010111010011110101010111010011010011010101101111011001001010110101101111011011001000010110010111001011011111010101010111010111010010110000001111011001011011110000000111010011000011110001110111010101011101110001010111010101011000010101101111010001001011010001111 e799b2e383abec8aa3e68f96e38f9dec9c80e6839fe38f89eba7aae9a6adec95aded90b2e5beaaeba581ecb780e9878eeabb8aeab0ade8968f
UHC 癲ル슣揖㏝윀惟㏉맪馭앭퐲循륁췀野껊갭薏 1110111110100110101010111110101110011010101011111110101111100111101001111110100110011111100010111110101011101110101001111110110110010000101100101110010111011111100111011110010110111101100110111110001011100000100011111110110010101101100111001110010110101111100000111110101110110000101110001110101111111011 efa6abeb9aafebe7a7e99f8beaeea7ed90b2e5df9de5bd9be2e08fecad9ce5af83ebb0b8ebfb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)