To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 醤ヤヤ竟ンヤヤ顳 100011111101110111010100110101001111000011000111111010001110110111011101110101001101010011110000110001111110100101000010 8fddd4d4f0c7e8edddd4d4f0c7e942
EUC-JP 醤ヤヤ?竟ンヤヤ?顳 101111101101111110001110110101001000111011010100001111111111000011101111100011101101110110001110110101001000111011010100001111111111000110100011 bedf8ed48ed43ff0ef8edd8ed48ed43ff1a3
UTF-8 醤ヤヤ竟ンヤヤ顳 111010011000011010100100111011111011111010010100111011111011111010010100111011101000001010000110111001111010101110011111111011111011111010011101111011111011111010010100111011111011111010010100111011101000001010000110111010011010000110110011 e986a4efbe94efbe94ee8286e7ab9fefbe9defbe94efbe94ee8286e9a1b3
UHC ????竟????? 0011111100111111001111110011111111001100111001010011111100111111001111110011111100111111 3f3f3f3fcce53f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)