To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲??爰??循??瑤?????肯???喩??B 1110000110011111001111110011111111100000101001110011111100111111100011110111101000111111001111111110101010100010001111110011111100111111001111110011111110001101011011010011111100111111001111111001101001100111001111110011111101000010 e19f3f3fe0a73f3f8f7a3f3feaa23f3f3f3f3f8d6d3f3f3f9a673f3f42
EUC-JP 癲??爰??循??瑤?????肯???喩??B 1110001010100001001111110011111111100000101010010011111100111111101111011101101100111111001111111111010010100100001111110011111100111111001111110011111110111001110011100011111100111111001111111101001111001000001111110011111101000010 e2a13f3fe0a93f3fbddb3f3ff4a43f3f3f3f3fb9ce3f3f3fd3c83f3f42
UTF-8 癲쒕짅爰껅벚循낇뫛瑤뗭늹柳닸걗肯留싧춢喩뽮굉B 11100111100110011011001011101100100100101001010111101100101001111000010111100111100010001011000011101010101110111000010111101011101100101001101011100101101111101010101011101011100000101000011111101011101010111001101111100111100100011010010011101011100101111010110111101011100010101011100111101111101001111000100111101011100010111011100011101010101100011001011111101000100000101010111111101111101001111000110111101100100010111010011111101100101101101010001011100101100101101010100111101011101111011010111011101010101101011000100101000010 e799b2ec9295eca785e788b0eabb85ebb29ae5beaaeb8287ebab9be791a4eb97adeb8ab9efa789eb8bb8eab197e882afefa78dec8ba7ecb6a2e596a9ebbdaeeab58942
UHC 癲쒕짅爰껅벚循낇뫛瑤뗭늹柳닸걗肯留싧춢喩뽮굉B 111011111010011010011100111010111010001110010100111010101011101010000011111001101011101010100010111000101110000010000101111011011001000110111011111010001111110110001011111011001000100010000010111010101111011110110100111001101000000110000010110100001110100111101011101001111001101011100101101011011000001111101010111001111001011011101010101100011011001001000010 efa69ceba394eaba83e6baa2e2e085ed91bbe8fd8bec8882eaf7b4e68182d0e9eba79ae5ad83eae796eab1b242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)