To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?諭ょ????瑤??游??矜伊??? 11100001100111111000001110001011001111111001011101000000100000101110010100111111001111110011111100111111111010101010001000111111001111111001111111100000001111110011111111100001111000001000100011001001001111110011111100111111 e19f838b3f974082e53f3f3f3feaa23f3f9fe03f3fe1e088c93f3f3f
EUC-JP 癲ル?諭ょ?洹??瑤??游??矜伊??佾 1110001010100001101001011110101100111111110011011010000110100100111001110011111110001111110001111011101000111111001111111111010010100100001111110011111111011110111000100011111100111111111000101110001010110000110010110011111100111111100011111011000011111011 e2a1a5eb3fcda1a4e73f8fc7ba3f3ff4a43f3fdee23f3fe2e2b0cb3f3f8fb0fb
UTF-8 癲ル슢諭ょ뙴洹욌뙀瑤녹뜫游뜹슫矜伊믦퉪佾 111001111001100110110010111000111000001110101011111011001000101010100010111010001010101110101101111000111000001010000111111010111001100110110100111001101011010010111001111011001001101010001100111010111001100110000000111001111001000110100100111010111000010110111001111010111001110010101011111001101011100010111000111010111001110010111001111011001000101010101011111001111001111110011100111001001011110010001010111010111010111110100110111011011000100110101010111001001011110110111110 e799b2e383abec8aa2e8abade38287eb99b4e6b4b9ec9a8ceb9980e791a4eb85b9eb9cabe6b8b8eb9cb9ec8aabe79f9ce4bc8aebafa6ed89aae4bdbe
UHC 癲ル슢諭ょ뙴洹욌뙀瑤녹뜫游뜹슫矜伊믦퉪佾 11101111101001101010101111101011100110101010111011101011101100011010101011100111100011001011011111101010101101111001111011101011100011001000011011101000111111011011001111101100100011011010110011101010111111011011011011100101100110101011010011010000111010001110110010100101100100101110100010111001100000101110110011101011 efa6abeb9aaeebb1aae78cb7eab79eeb8c86e8fdb3ec8daceafdb6e59ab4d0e8eca592e8b982eceb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)