To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 炎??瑤?????D炎??瑤?????D^ 10001001100010100011111100111111111010101010001000111111001111110011111100111111001111110100010010001001100010100011111100111111111010101010001000111111001111110011111100111111001111110100010001011110 898a3f3feaa23f3f3f3f3f44898a3f3feaa23f3f3f3f3f445e
EUC-JP 炎??瑤?????D炎??瑤?????D^ 10110001111010100011111100111111111101001010010000111111001111110011111100111111001111110100010010110001111010100011111100111111111101001010010000111111001111110011111100111111001111110100010001011110 b1ea3f3ff4a43f3f3f3f3f44b1ea3f3ff4a43f3f3f3f3f445e
UTF-8 炎⑴쁺瑤꿰쑊念곩뱢D炎⑴쁺瑤꿰쑊念곩뱢D^ 111001111000001010001110111000101001000110110100111011001000000110111010111001111001000110100100111010101011111110110000111011001001000110001010111011111010011010100011111010101011001110101001111010111011000110100010010001001110011110000010100011101110001010010001101101001110110010000001101110101110011110010001101001001110101010111111101100001110110010010001100010101110111110100110101000111110101010110011101010011110101110110001101000100100010001011110 e7828ee291b4ec81bae791a4eabfb0ec918aefa6a3eab3a9ebb1a244e7828ee291b4ec81bae791a4eabfb0ec918aefa6a3eab3a9ebb1a2445e
UHC 炎⑴쁺瑤꿰쑊念곩뱢D炎⑴쁺瑤꿰쑊念곩뱢D^ 111001101111101010101001111001111001100010000001111010001111110110110010111001111001110010101001111001101111011010000001111001011001001110001000010001001110011011111010101010011110011110011000100000011110100011111101101100101110011110011100101010011110011011110110100000011110010110010011100010000100010001011110 e6faa9e79881e8fdb2e79ca9e6f681e5938844e6faa9e79881e8fdb2e79ca9e6f681e59388445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)