To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?泣g?幽??癰??肄??碎l?? 11100001100111111000001110001011001111111000101110000011100000101000011100111111100101110100100000111111001111111110000110011110001111110011111111100011111001010011111100111111111000011110101010000010100011000011111100111111 e19f838b3f8b8382873f97483f3fe19e3f3fe3e53f3fe1ea828c3f3f
EUC-JP 癲ル?泣g?幽??癰??肄??碎l?獒 111000101010000110100101111010110011111110110101111000111010001111100111001111111100110110101001001111110011111111100001111111100011111100111111111001101110011100111111001111111110001011101100101000111110110000111111100011111100101110111011 e2a1a5eb3fb5e3a3e73fcda93f3fe1fe3f3fe6e73f3fe2eca3ec3f8fcbbb
UTF-8 癲ル슢泣g뙴幽뚯춹癰귥쥒肄잒첑碎l뫊獒 111001111001100110110010111000111000001110101011111011001000101010100010111001101011001110100011111011111011110110000111111010111001100110110100111001011011100110111101111010111001101010101111111011001011011010111001111001111001100110110000111010101011011110100101111011001010010110010010111010001000001010000100111011001001111010010010111011001011001010010001111001111010001010001110111011111011110110001100111010111010101110001010111001111000110110010010 e799b2e383abec8aa2e6b3a3efbd87eb99b4e5b9bdeb9aafecb6b9e799b0eab7a5eca592e88284ec9e92ecb291e7a28eefbd8cebab8ae78d92
UHC 癲ル슢泣g뙴幽뚯춹癰귥쥒肄잒첑碎l뫊獒 1110111110100110101010111110101110011010101011101110101111101000101000111110011110001100101101111110101011101011100011001110110010101101100101011110100010111001100000101110110010100010100010011110110010111101100111111110100010101010100111101110000111101111101000111110110010010001101011001110100010100011 efa6abeb9aaeebe8a3e78cb7eaeb8cecad95e8b982eca289ecbd9fe8aa9ee1efa3ec91ace8a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)