To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 顫趣スソ顋蛾宦褊エ髴趣スソ髮芽謠エB 11101000111110101000111011101111101111011011111111101000111110011000100111101001100110111000000111100101111011011011010011101001100111001000111011101111101111011011111111101001100110111000100111101000111110011000000111100110100011111011010001000010 e8fa8eefbdbfe8f989e99b81e5edb4e99c8eefbdbfe99b89e8f981e68fb442
EUC-JP 顫趣スソ顋蛾宦褊エ髴趣スソ髮芽?謠エB 111100001111110010111100111100011000111010111101100011101011111111110000111110111011001011101011110101011110000111101010111011111000111010110100111100011111110010111100111100011000111010111101100011101011111111110001111110111011001011101010001111111110101111101111100011101011010001000010 f0fcbcf18ebd8ebff0fbb2ebd5e1eaef8eb4f1fcbcf18ebd8ebff1fbb2ea3febef8eb442
UTF-8 顫趣スソ顋蛾宦褊エ髴趣スソ髮芽謠エB 11101001101000011010101111101000101101101010001111101111101111011011110111101111101111011011111111101001101000011000101111101000100110111011111011100101101011101010011011101000101001001000101011101111101111011011010011101001101010111011010011101000101101101010001111101111101111011011110111101111101111011011111111101001101010111010111011101000100010101011110111101110100110111001110011101000101011001010000011101111101111011011010001000010 e9a1abe8b6a3efbdbdefbdbfe9a18be89bbee5aea6e8a48aefbdb4e9abb4e8b6a3efbdbdefbdbfe9abaee88abdee9b9ce8aca0efbdb442
UHC 顫趣???蛾宦???趣??髮芽?謠?B 111011111011010111110110101011000011111100111111001111111110010010110110111111001011001000111111001111110011111111110110101011000011111100111111110110111010010111100100101101000011111111101001101010100011111101000010 efb5f6ac3f3f3fe4b6fcb23f3f3ff6ac3f3fdba5e4b43fe9aa3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)