To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 顫趣スソ顋崎ご鋺ゥ髴趣スソ髮崎瘁竏ョB 11101000111110101000111011101111101111011011111111101000111110011000110111101000100000101011001011100111111110101010100111101001100111001000111011101111101111011011111111101001100110111000110111101000111000011000000111100010100010001010111001000010 e8fa8eefbdbfe8f98de882b2e7faa9e99c8eefbdbfe99b8de8e181e288ae42
EUC-JP 顫趣スソ顋崎ご鋺ゥ髴趣スソ髮崎瘁竏ョB 11110000111111001011110011110001100011101011110110001110101111111111000011111011101110101110101010100100101101001110111011111100100011101010100111110001111111001011110011110001100011101011110110001110101111111111000111111011101110101110101011100001111000011110001111101000100011101010111001000010 f0fcbcf18ebd8ebff0fbbaeaa4b4eefc8ea9f1fcbcf18ebd8ebff1fbbaeae1e1e3e88eae42
UTF-8 顫趣スソ顋崎ご鋺ゥ髴趣スソ髮崎瘁竏ョB 11101001101000011010101111101000101101101010001111101111101111011011110111101111101111011011111111101001101000011000101111100101101101001000111011100011100000011001010011101001100010111011101011101111101111011010100111101001101010111011010011101000101101101010001111101111101111011011110111101111101111011011111111101001101010111010111011100101101101001000111011100111100110001000000111100111101010111000111111101111101111011010111001000010 e9a1abe8b6a3efbdbdefbdbfe9a18be5b48ee38194e98bbaefbda9e9abb4e8b6a3efbdbdefbdbfe9abaee5b48ee79881e7ab8fefbdae42
UHC 顫趣???崎ご???趣??髮崎???B 1110111110110101111101101010110000111111001111110011111111010000111110001010101010110100001111110011111100111111111101101010110000111111001111111101101110100101110100001111100000111111001111110011111101000010 efb5f6ac3f3f3fd0f8aab43f3f3ff6ac3f3fdba5d0f83f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)