To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 髴趣スソ髮蛾宦謠エ髴趣スソ髮蛾宦謠エB 11101001100111001000111011101111101111011011111111101001100110111000100111101001100110111000000111100110100011111011010011101001100111001000111011101111101111011011111111101001100110111000100111101001100110111000000111100110100011111011010001000010 e99c8eefbdbfe99b89e99b81e68fb4e99c8eefbdbfe99b89e99b81e68fb442
EUC-JP 髴趣スソ髮蛾宦謠エ髴趣スソ髮蛾宦謠エB 11110001111111001011110011110001100011101011110110001110101111111111000111111011101100101110101111010101111000011110101111101111100011101011010011110001111111001011110011110001100011101011110110001110101111111111000111111011101100101110101111010101111000011110101111101111100011101011010001000010 f1fcbcf18ebd8ebff1fbb2ebd5e1ebef8eb4f1fcbcf18ebd8ebff1fbb2ebd5e1ebef8eb442
UTF-8 髴趣スソ髮蛾宦謠エ髴趣スソ髮蛾宦謠エB 11101001101010111011010011101000101101101010001111101111101111011011110111101111101111011011111111101001101010111010111011101000100110111011111011100101101011101010011011101000101011001010000011101111101111011011010011101001101010111011010011101000101101101010001111101111101111011011110111101111101111011011111111101001101010111010111011101000100110111011111011100101101011101010011011101000101011001010000011101111101111011011010001000010 e9abb4e8b6a3efbdbdefbdbfe9abaee89bbee5aea6e8aca0efbdb4e9abb4e8b6a3efbdbdefbdbfe9abaee89bbee5aea6e8aca0efbdb442
UHC ?趣??髮蛾宦謠??趣??髮蛾宦謠?B 0011111111110110101011000011111100111111110110111010010111100100101101101111110010110010111010011010101000111111001111111111011010101100001111110011111111011011101001011110010010110110111111001011001011101001101010100011111101000010 3ff6ac3f3fdba5e4b6fcb2e9aa3f3ff6ac3f3fdba5e4b6fcb2e9aa3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)