To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????F 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f46
SJIS-WIN 将脯宵ツ娼テ宵ゥ。゙繒ェォ繽ォタ。ッF 10001111101010111110001111111011100011111010101011000010100011111010100111000011100011111010101010101001101000011101111011111011100011111010101011110001111011011010101111100011100011111010101111000000101000011010111101000110 8fabe3fb8faac28fa9c38faaa9a1defb8faaf1edabe38fabc0a1af46
EUC-JP 将脯宵ツ娼テ宵ゥ。゙繒ェ?ォ繽ォタ。ッF 101111101010110111100110111111011011111010101100100011101100001010111110101010111000111011000011101111101010110010001110101010011000111010100001100011101101111010001111110101001101010010001110101010100011111110001110101010111110010111101111100011101010101110001110110000001000111010100001100011101010111101000110 beade6fdbeac8ec2beab8ec3beac8ea98ea18ede8fd4d48eaa3f8eabe5ef8eab8ec08ea18eaf46
UTF-8 将脯宵ツ娼テ宵ゥ。゙繒ェォ繽ォタ。ッF 11100101101100001000011011101000100001001010111111100101101011101011010111101111101111101000001011100101101010001011110011101111101111101000001111100101101011101011010111101111101111011010100111101111101111011010000111101111101111101001111011100111101110011001001011101111101111011010101011101110100001011010100011101111101111011010101111100111101110011011110111101111101111011010101111101111101111101000000011101111101111011010000111101111101111011010111101000110 e5b086e884afe5aeb5efbe82e5a8bcefbe83e5aeb5efbda9efbda1efbe9ee7b992efbdaaee85a8efbdabe7b9bdefbdabefbe80efbda1efbdaf46
UHC ?脯宵?娼?宵???繒????????F 00111111111110001110000111100001101100100011111111110011110111100011111111100001101100100011111100111111001111111111000111111001001111110011111100111111001111110011111100111111001111110011111101000110 3ff8e1e1b23ff3de3fe1b23f3f3ff1f93f3f3f3f3f3f3f3f46

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)