To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????\F 0011111100111111001111110011111100111111001111110011111100111111001111110101110001000110 3f3f3f3f3f3f3f3f3f5c46
SJIS-WIN 掌??壯х?艶n?\F 10001111101101100011111100111111100110101110000110000100100001110011111110001001100100001000001010001110001111110101110001000110 8fb63f3f9ae184873f8990828e3f5c46
EUC-JP 掌??壯х?艶n?\F 10111110101110000011111100111111110101001110001110100111111001110011111110110001111100001010001111101110001111110101110001000110 beb83f3fd4e3a7e73fb1f0a3ee3f5c46
UTF-8 掌싩컩壯х컮艶n벘\F 11100110100011101000110011101100100010111010100111101100101110111010100111100101101000111010111111010001100001011110110010111011101011101110100010001001101101101110111110111101100011101110101110110010100110000101110001000110 e68e8cec8ba9ecbba9e5a3afd185ecbbaee889b6efbd8eebb2985c46
UHC 掌싩컩壯х컮艶n벘\F 1110110111100110100110101110011110110000100100011110110111100000101011001110011110110000100101001110011011111101101000111110111010010011101101010101110001000110 ede69ae7b091ede0ace7b094e6fda3ee93b55c46

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)