To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????嫩?莖?釜?莎瀕???????^ 001111110011111100111111001111110011111100111111100110110110001100111111111001001011000100111111100010101001100000111111111001001011001110010101011011010011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f9b633fe4b13f8a983fe4b3956d3f3f3f3f3f3f3f5e
EUC-JP ??????嫩?莖?釜?莎瀕???????^ 001111110011111100111111001111110011111100111111110101011100010000111111111010001011001100111111101100111111100000111111111010001011010111001001110011100011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3fd5c43fe8b33fb3f83fe8b5c9ce3f3f3f3f3f3f3f5e
UTF-8 셈롛뤰탮탮펿嫩펿莖펿釜핉莎瀕렦렯롍렯롇렯롗^ 11101100100001011000100011101011101000011001101111101011101001001011000011101101100000111010111011101101100000111010111011101101100011101011111111100101101010111010100111101101100011101011111111101000100011101001011011101101100011101011111111101001100001111001110011101101100101011000100111101000100011101000111011100111100000001001010111101011101000001010011011101011101000001010111111101011101000011000110111101011101000001010111111101011101000011000011111101011101000001010111111101011101000011001011101011110 ec8588eba19beba4b0ed83aeed83aeed8ebfe5aba9ed8ebfe88e96ed8ebfe9879ced9589e88e8ee78095eba0a6eba0afeba18deba0afeba187eba0afeba1975e
UHC 셈롛뤰탮탮펿嫩펿莖펿釜핉莎瀕렦렯롍렯롇렯롗^ 10111100110000001000111011011111100011111101111010110101100011101011010110001110101111001000111011010010111011001011110010001110110011001110110010111100100011101101110110111100110000001000111011011110111011011101111010110101100011101011010110001110101111001000111011010011100011101011110010001110110011011000111010111100100011101101101101011110 bcc08edf8fdeb58eb58ebc8ed2ecbc8eccecbc8eddbcc08edeeddeb58eb58ebc8ed38ebc8ecd8ebc8edb5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)