To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 顋主叙蓚櫁ア趣スェ髮惹嵂蜍櫁ア趣スェ^ 11101000111110011000111011100101100011111001011011100100111010011001111011101000101100011000111011101111101111011010101011101001100110111000111011100100111110101011001011100101100010111001111011101000101100011000111011101111101111011010101001011110 e8f98ee58f96e4e99ee8b18eefbdaae99b8ee4fab2e58b9ee8b18eefbdaa5e
EUC-JP 顋主叙蓚櫁ア趣スェ髮惹嵂蜍櫁ア趣スェ^ 1111000011111011101111001110011110111101111101101110100011101011110111001110101010001110101100011011110011110001100011101011110110001110101010101111000111111011101111001110011010001111101110111101000011101001111010111101110011101010100011101011000110111100111100011000111010111101100011101010101001011110 f0fbbce7bdf6e8ebdcea8eb1bcf18ebd8eaaf1fbbce68fbbd0e9ebdcea8eb1bcf18ebd8eaa5e
UTF-8 顋主叙蓚櫁ア趣スェ髮惹嵂蜍櫁ア趣スェ^ 11101001101000011000101111100100101110001011101111100101100011111001100111101000100100111001101011100110101010111000000111101111101111011011000111101000101101101010001111101111101111011011110111101111101111011010101011101001101010111010111011100110100000111011100111100101101101011000001011101000100111001000110111100110101010111000000111101111101111011011000111101000101101101010001111101111101111011011110111101111101111011010101001011110 e9a18be4b8bbe58f99e8939ae6ab81efbdb1e8b6a3efbdbdefbdaae9abaee683b9e5b582e89c8de6ab81efbdb1e8b6a3efbdbdefbdaa5e
UHC ?主?蓚??趣??髮惹????趣??^ 00111111111100011010101100111111111000101011111000111111001111111111011010101100001111110011111111011011101001011110010110101001001111110011111100111111001111111111011010101100001111110011111101011110 3ff1ab3fe2be3f3ff6ac3f3fdba5e5a93f3f3f3ff6ac3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)