To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲?8柚??宥?????日??唯??認?B 11100001100111110011111110000010010101111001011101001101001111110011111110010111010001110011111100111111001111110011111100111111100100111111101000111111001111111001011101000010001111110011111110010100010001100011111101000010 e19f3f8257974d3f3f97473f3f3f3f3f93fa3f3f97423f3f94463f42
EUC-JP 癲?8柚??宥?????日??唯??認?B 11100010101000010011111110100011101110001100110110101110001111110011111111001101101010000011111100111111001111110011111100111111110001101111110000111111001111111100110110100011001111110011111111000111101001110011111101000010 e2a13fa3b8cdae3f3fcda83f3f3f3f3fc6fc3f3fcda33f3fc7a73f42
UTF-8 癲쒕8柚삯뜮宥멸콟銳얜㉡日듿보唯몃뀪認얥B 11100111100110011011001011101100100100101001010111101111101111001001100011100110100111111001101011101100100000101010111111101011100111001010111011100101101011101010010111101011101010011011100011101100101111011001111111101001100010101011001111101100100101101001110011100011100010011010000111100110100101111010010111101011100100111011111111101011101100111011010011100101100101001010111111101011101010101000001111101011100000001010101011101000101010101000110111101100100101101010010101000010 e799b2ec9295efbc98e69f9aec82afeb9caee5aea5eba9b8ecbd9fe98ab3ec969ce389a1e697a5eb93bfebb3b4e594afebaa83eb80aae8aa8dec96a542
UHC 癲쒕8柚삯뜮宥멸콟銳얜㉡日듿보唯몃뀪認얥B 1110111110100110100111001110101110100011101110001110101011110110101110111110100110001101101011101110101011101001101110001110101010110001100101111110011111100101101111101110101110101000101100101110110011101101100010101110010110111010101110001110101011100110101110001110101110000101101000001110110011100011100111100100110001000010 efa69ceba3b8eaf6bbe98daeeae9b8eab197e7e5beeba8b2eced8ae5bab8eae6b8eb85a0ece39e4c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)