To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?????罐徇??碎??茹????ぜ揄?? 1110000110011111001111110011111100111111001111110011111111100011101000111001110001101101001111110011111111100001111010100011111100111111111001001010010100111111001111110011111100111111100000101011101010011101100010010011111100111111 e19f3f3f3f3f3fe3a39c6d3f3fe1ea3f3fe4a53f3f3f3f82ba9d893f3f
EUC-JP 癲?????罐徇??碎??茹????ぜ揄?? 1110001010100001001111110011111100111111001111110011111111100110101001011101011111001110001111110011111111100010111011000011111100111111111010001010011100111111001111110011111100111111101001001011110011011001111010010011111100111111 e2a13f3f3f3f3fe6a5d7ce3f3fe2ec3f3fe8a73f3f3f3fa4bcd9e93f3f
UTF-8 癲ㅺ옇流ⓩ만罐徇쒐뙴碎ㅼ맚茹띿슜梨룩ぜ揄몄쵄 111001111001100110110010111000111000010110111010111011001001100010000111111011111010011110001010111000101001001110101001111010111010011110001100111001111011110110010000111001011011111010000111111011001001001010010000111010111001100110110100111001111010001010001110111000111000010110111100111010111010011110011010111010001000110010111001111010111001110110111111111011001000101010011100111011111010011110100010111010111010001110101001111000111000000110011100111001101000111110000100111010111010101010000100111011001011010110000100 e799b2e385baec9887efa78ae293a9eba78ce7bd90e5be87ec9290eb99b4e7a28ee385bceba79ae88cb9eb9dbfec8a9cefa7a2eba3a9e3819ce68f84ebaa84ecb584
UHC 癲ㅺ옇流ⓩ만罐徇쒐뙴碎ㅼ맚茹띿슜梨룩ぜ揄몄쵄 1110111110100110101001001110101010111111101110001110101011111100101010001110011010111000101110001100111010111000111000101101111110011100111001111000110010110111111000011110111110100100111011001001000010101010111001101010101010001101111011001001101010101001111011001011000110110111111010001010101010111100111010101111000110111000111011001010110010000110 efa6a4eabfb8eafca8e6b8b8ceb8e2df9ce78cb7e1efa4ec90aae6aa8dec9aa9ecb1b7e8aabceaf1b8ecac86

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)