To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????D??????D^ 001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f443f3f3f3f3f3f445e
SJIS-WIN 宣舌褻宣舌暹D宣舌褻宣舌暹D^ 100100001110100110010000111000111110010111110110100100001110100110010000111000111001110111111001010001001001000011101001100100001110001111100101111101101001000011101001100100001110001110011101111110010100010001011110 90e990e3e5f690e990e39df94490e990e3e5f690e990e39df9445e
EUC-JP 宣舌褻宣舌暹D宣舌褻宣舌暹D^ 110000001110101111000000111001011110101011111000110000001110101111000000111001011101101011111011010001001100000011101011110000001110010111101010111110001100000011101011110000001110010111011010111110110100010001011110 c0ebc0e5eaf8c0ebc0e5dafb44c0ebc0e5eaf8c0ebc0e5dafb445e
UTF-8 宣舌褻宣舌暹D宣舌褻宣舌暹D^ 111001011010111010100011111010001000100010001100111010001010010010111011111001011010111010100011111010001000100010001100111001101001101010111001010001001110010110101110101000111110100010001000100011001110100010100100101110111110010110101110101000111110100010001000100011001110011010011010101110010100010001011110 e5aea3e8888ce8a4bbe5aea3e8888ce69ab944e5aea3e8888ce8a4bbe5aea3e8888ce69ab9445e
UHC 宣舌褻宣舌暹D宣舌褻宣舌暹D^ 111000001011111011100000110111111110000011100001111000001011111011100000110111111110000011100111010001001110000010111110111000001101111111100000111000011110000010111110111000001101111111100000111001110100010001011110 e0bee0dfe0e1e0bee0dfe0e744e0bee0dfe0e1e0bee0dfe0e7445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)