To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲???????????蟻??檍??伊?? 11100001100111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100010110110000100111111001111111001111011111000001111110011111110001000110010010011111100111111 e19f3f3f3f3f3f3f3f3f3f3f3f8b613f3f9ef83f3f88c93f3f
EUC-JP 癲??堉??洹?????蟻??檍??伊?? 1110001010100001001111110011111110001111101101111111110100111111001111111000111111000111101110100011111100111111001111110011111100111111101101011100001000111111001111111101110011111010001111110011111110110000110010110011111100111111 e2a13f3f8fb7fd3f3f8fc7ba3f3f3f3f3fb5c23f3fdcfa3f3fb0cb3f3f
UTF-8 癲삳끃堉붼윀洹욌퉲吏경쐣蟻볤껀檍용벡伊됪쐣 111001111001100110110010111011001000001010110011111010111000000110000011111001011010000010001001111010111011011010111100111011001001110010000000111001101011010010111001111011001001101010001100111011011000100110110010111011111010011110011110111010101011001010111101111011001001000010100011111010001001111110111011111010111011001110100100111010101011101110000000111001101010101010001101111011001001101010101001111010111011001010100001111001001011110010001010111010111001000010101010111011001001000010100011 e799b2ec82b3eb8183e5a089ebb6bcec9c80e6b4b9ec9a8ced89b2efa79eeab2bdec90a3e89fbbebb3a4eabb80e6aa8dec9aa9ebb2a1e4bc8aeb90aaec90a3
UHC 癲삳끃堉붼윀洹욌퉲吏경쐣蟻볤껀檍용벡伊됪쐣 111011111010011010111011111010111000010110111001111010111011110010010100111010011001111110001011111010101011011110011110111010111011100110001010111011001010011110110000111001101001110010001001111010111111110010010011111010101011001010101011111001011110010110111111111010111011101010100100111011001010010110001001111001101001110010001001 efa6bbeb85b9ebbc94e99f8beab79eebb98aeca7b0e69c89ebfc93eab2abe5e5bfebbaa4eca589e69c89

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)