To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?溢h?喩??齬??猷??淫??? 111000011001111110000011100010110011111110001000111011001000001010001000001111111001101001100111001111110011111111101010100101110011111100111111100101110101000100111111001111111000100011111010001111110011111100111111 e19f838b3f88ec82883f9a673f3fea973f3f97513f3f88fa3f3f3f
EUC-JP 癲ル?溢h?喩??齬??猷??淫??? 111000101010000110100101111010110011111110110000111011101010001111101000001111111101001111001000001111110011111111110011111101110011111100111111110011011011001000111111001111111011000011111100001111110011111100111111 e2a1a5eb3fb0eea3e83fd3c83f3ff3f73f3fcdb23f3fb0fc3f3f3f
UTF-8 癲ル슪溢h짆喩쎼럶齬잙뱪猷뗦쾬淫낃탿歷 111001111001100110110010111000111000001110101011111011001000101010101010111001101011101010100010111011111011110110001000111011001010011110000110111001011001011010101001111011001000111010111100111010111001111110110110111010011011110110101100111011001001111010011001111010111011000110101010111001111000110010110111111010111001011110100110111011001011111010101100111001101011011110101011111010111000001010000011111011011000001110111111111011111010011010001100 e799b2e383abec8aaae6baa2efbd88eca786e596a9ec8ebceb9fb6e9bdacec9e99ebb1aae78cb7eb97a6ecbeace6b7abeb8283ed83bfefa68c
UHC 癲ル슪溢h짆喩쎼럶齬잙뱪猷뗦쾬淫낃탿歷 1110111110100110101010111110101110011010101100111110110011101110101000111110100010100011100101011110101011100111100110111110001110001110100101011110010111100001100111111110101110010011100100001110101110100011100010111110011010110010100000111110101111100010100001011110101010110101100110111110011010111000 efa6abeb9ab3eceea3e8a395eae79be38e95e5e19feb9390eba38be6b283ebe285eab59be6b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)