To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 潁〓?燁????????擁????????語 100111111111000110000001101011000011111111111011010110010011111100111111001111110011111100111111001111110011111100111111100101110110100100111111001111110011111100111111001111110011111100111111001111111000110011101010 9ff181ac3ffb593f3f3f3f3f3f3f3f97693f3f3f3f3f3f3f3f8cea
EUC-JP 潁〓?燁??倻?????擁????????語 110111101111001110100010101011100011111110001111110010101011001100111111001111111000111110110001111101100011111100111111001111110011111100111111110011011100101000111111001111110011111100111111001111110011111100111111001111111011100011101100 def3a2ae3f8fcab33f3f8fb1f63f3f3f3f3fcdca3f3f3f3f3f3f3f3fb8ec
UTF-8 潁〓젙燁삳젇倻뽰쑬溜뺣젧擁숃쾴溜졾날溜뤿젙語 111001101011110110000001111000111000000010010011111011001010000010011001111001111000011110000001111011001000001010110011111011001010000010000111111001011000000010111011111010111011110110110000111011001001000110101100111011111010011110001011111010111011101010100011111011001010000010100111111001101001001110000001111011001000100010000011111011001011111010110100111011111010011110001011111011001010000110111110111010111000001010100000111011111010011110001011111010111010010010111111111011001010000010011001111010001010101010011110 e6bd81e38093eca099e78781ec82b3eca087e580bbebbdb0ec91acefa78bebbaa3eca0a7e69381ec8883ecbeb4efa78beca1beeb82a0efa78beba4bfeca099e8aa9e
UHC 潁〓젙燁삳젇倻뽰쑬溜뺣젧擁숃쾴溜졾날溜뤿젙語 1110011110111000101000011110101110100000100101011110011110100111101110111110101110100000100010101110010110100110100101101110110010111110101010001110101011111110100101011110101110100000100111111110100010110110100110011110100010110010100010101110101011111110101000001110010110110011101011111110101011111110100011111110101110100000100101011110010111011110 e7b8a1eba095e7a7bbeba08ae5a696ecbea8eafe95eba09fe8b699e8b28aeafea0e5b3afeafe8feba095e5de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)