To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 癲?8柚??幼??z癲?8柚??幼??zB 1110000110011111001111111000001001010111100101110100110100111111001111111001011101100011001111110011111101111010111000011001111100111111100000100101011110010111010011010011111100111111100101110110001100111111001111110111101001000010 e19f3f8257974d3f3f97633f3f7ae19f3f8257974d3f3f97633f3f7a42
EUC-JP 癲?8柚??幼??z癲?8柚??幼??zB 1110001010100001001111111010001110111000110011011010111000111111001111111100110111000100001111110011111101111010111000101010000100111111101000111011100011001101101011100011111100111111110011011100010000111111001111110111101001000010 e2a13fa3b8cdae3f3fcdc43f3f7ae2a13fa3b8cdae3f3fcdc43f3f7a42
UTF-8 癲쒕8柚삯뜮幼꾩뒠z癲쒕8柚삯뜮幼꾩뒠zB 111001111001100110110010111011001001001010010101111011111011110010011000111001101001111110011010111011001000001010101111111010111001110010101110111001011011100110111100111010101011111010101001111010111001001010100000011110101110011110011001101100101110110010010010100101011110111110111100100110001110011010011111100110101110110010000010101011111110101110011100101011101110010110111001101111001110101010111110101010011110101110010010101000000111101001000010 e799b2ec9295efbc98e69f9aec82afeb9caee5b9bceabea9eb92a07ae799b2ec9295efbc98e69f9aec82afeb9caee5b9bceabea9eb92a07a42
UHC 癲쒕8柚삯뜮幼꾩뒠z癲쒕8柚삯뜮幼꾩뒠zB 111011111010011010011100111010111010001110111000111010101111011010111011111010011000110110101110111010101110101010000100111011001000101010011100011110101110111110100110100111001110101110100011101110001110101011110110101110111110100110001101101011101110101011101010100001001110110010001010100111000111101001000010 efa69ceba3b8eaf6bbe98daeeaea84ec8a9c7aefa69ceba3b8eaf6bbe98daeeaea84ec8a9c7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)