To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?溢d?飮??齬??游??錦維?? 11100001100111111000001110001011001111111000100011101100100000101000010000111111100111110101101000111111001111111110101010010111001111110011111110011111111000000011111100111111100010111101000110001000110110110011111100111111 e19f838b3f88ec82843f9f5a3f3fea973f3f9fe03f3f8bd188db3f3f
EUC-JP 癲ル?溢d?飮??齬??游??錦維?? 11100010101000011010010111101011001111111011000011101110101000111110010000111111110111011011101100111111001111111111001111110111001111110011111111011110111000100011111100111111101101101101001110110000110111010011111100111111 e2a1a5eb3fb0eea3e43fddbb3f3ff3f73f3fdee23f3fb6d3b0dd3f3f
UTF-8 癲ル슪溢d펺飮뗣럶齬잙뱪游뜹슫錦維쀦꼮 111001111001100110110010111000111000001110101011111011001000101010101010111001101011101010100010111011111011110110000100111011011000111010111010111010011010001110101110111010111001011110100011111010111001111110110110111010011011110110101100111011001001111010011001111010111011000110101010111001101011100010111000111010111001110010111001111011001000101010101011111010011000110010100110111001111011011010101101111011001000000010100110111010101011110010101110 e799b2e383abec8aaae6baa2efbd84ed8ebae9a3aeeb97a3eb9fb6e9bdacec9e99ebb1aae6b8b8eb9cb9ec8aabe98ca6e7b6adec80a6eabcae
UHC 癲ル슪溢d펺飮뗣럶齬잙뱪游뜹슫錦維쀦꼮 1110111110100110101010111110101110011010101100111110110011101110101000111110010010111100100010101110101111100110100010111110001110001110100101011110010111100001100111111110101110010011100100001110101011111101101101101110010110011010101101001101000011011110111010111010101110010111111001101000010010001001 efa6abeb9ab3eceea3e4bc8aebe68be38e95e5e19feb9390eafdb6e59ab4d0deebab97e68489

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)