To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??諭??筍ル?域??異??碎??魚 111000011001111100111111001111111001011101000000001111110011111111100010101000011000001110001011001111111000100011100110001111110011111110001000110110010011111100111111111000011110101000111111001111111000101110011011 e19f3f3f97403f3fe2a1838b3f88e63f3f88d93f3fe1ea3f3f8b9b
EUC-JP 癲??諭??筍ル?域??異??碎??魚 111000101010000100111111001111111100110110100001001111110011111111100100101000111010010111101011001111111011000011101000001111110011111110110000110110110011111100111111111000101110110000111111001111111011010111111011 e2a13f3fcda13f3fe4a3a5eb3fb0e83f3fb0db3f3fe2ec3f3fb5fb
UTF-8 癲ㅺ퓥諭욤짆筍ル뙀域밟뫁異루쥗碎몄탪魚 111001111001100110110010111000111000010110111010111011011001001110100101111010001010101110101101111011001001101010100100111011001010011110000110111001111010110110001101111000111000001110101011111010111001100110000000111001011001111110011111111010111011000010011111111010111010101110000001111001111001010110110000111010111010001110101000111011001010010110010111111001111010001010001110111010111010101010000100111011011000001110101010111010011010110110011010 e799b2e385baed93a5e8abadec9aa4eca786e7ad8de383abeb9980e59f9febb09febab81e795b0eba3a8eca597e7a28eebaa84ed83aae9ad9a
UHC 癲ㅺ퓥諭욤짆筍ル뙀域밟뫁異루쥗碎몄탪魚 1110111110100110101001001110101010111111100011101110101110110001101111111110100010100011100101011110001011101100101010111110101110001100100001101110011010110100101110011110001010010001101001011110110010110110101101111110011110100010100011011110000111101111101110001110110010110101100011001110010111100000 efa6a4eabf8eebb1bfe8a395e2ecabeb8c86e6b4b9e291a5ecb6b7e7a28de1efb8ecb58ce5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)