To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??鬱陌??畯?????鬱陌??畯???^ 001111110011111110011111010101001110100010011001001111110011111111111011011011110011111100111111001111110011111100111111100111110101010011101000100110010011111100111111111110110110111100111111001111110011111101011110 3f3f9f54e8993f3ffb6f3f3f3f3f3f9f54e8993f3ffb6f3f3f3f5e
EUC-JP 焌?鬱陌??畯???焌?鬱陌??畯???^ 100011111100100111101000001111111101110110110101111011111111100100111111001111111000111111001101101110110011111100111111001111111000111111001001111010000011111111011101101101011110111111111001001111110011111110001111110011011011101100111111001111110011111101011110 8fc9e83fddb5eff93f3f8fcdbb3f3f3f8fc9e83fddb5eff93f3f8fcdbb3f3f3f5e
UTF-8 焌렧鬱陌렡렜畯렧梨렢焌렧鬱陌렡렜畯렧梨렗^ 11100111100001001000110011101011101000001010011111101001101011001011000111101001100110011000110011101011101000001010000111101011101000001001110011100111100101011010111111101011101000001010011111101111101001111010001011101011101000001010001011100111100001001000110011101011101000001010011111101001101011001011000111101001100110011000110011101011101000001010000111101011101000001001110011100111100101011010111111101011101000001010011111101111101001111010001011101011101000001001011101011110 e7848ceba0a7e9acb1e9998ceba0a1eba09ce795afeba0a7efa7a2eba0a2e7848ceba0a7e9acb1e9998ceba0a1eba09ce795afeba0a7efa7a2eba0975e
UHC 焌렧鬱陌렡렜畯렧梨렢焌렧鬱陌렡렜畯렧梨렗^ 1111000111100000100011101011011011101010101001101101100011101000100011101011001010001110101011101111000111100001100011101011011011101100101100011000111010110011111100011110000010001110101101101110101010100110110110001110100010001110101100101000111010101110111100011110000110001110101101101110110010110001100011101010110001011110 f1e08eb6eaa6d8e88eb28eaef1e18eb6ecb18eb3f1e08eb6eaa6d8e88eb28eaef1e18eb6ecb18eac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)