To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?\?????????Hr[?\?????????Hr[^ 0011111101011100001111110011111100111111001111110011111100111111001111110011111100111111010010000111001001011011001111110101110000111111001111110011111100111111001111110011111100111111001111110011111101001000011100100101101101011110 3f5c3f3f3f3f3f3f3f3f3f48725b3f5c3f3f3f3f3f3f3f3f3f48725b5e
SJIS-WIN 巍\俎????????Hr[巍\俎????????Hr[^ 100110111101100101011100100110001101011100111111001111110011111100111111001111110011111100111111001111110100100001110010010110111001101111011001010111001001100011010111001111110011111100111111001111110011111100111111001111110011111101001000011100100101101101011110 9bd95c98d73f3f3f3f3f3f3f3f48725b9bd95c98d73f3f3f3f3f3f3f3f48725b5e
EUC-JP 巍\俎????????Hr[巍\俎????????Hr[^ 110101101101101101011100110100001101100100111111001111110011111100111111001111110011111100111111001111110100100001110010010110111101011011011011010111001101000011011001001111110011111100111111001111110011111100111111001111110011111101001000011100100101101101011110 d6db5cd0d93f3f3f3f3f3f3f3f48725bd6db5cd0d93f3f3f3f3f3f3f3f48725b5e
UTF-8 巍\俎렯롏렯렶렯롏렯렏Hr[巍\俎렯롏렯렶렯롏렯렏Hr[^ 111001011011011110001101010111001110010010111111100011101110101110100000101011111110101110100001100011111110101110100000101011111110101110100000101101101110101110100000101011111110101110100001100011111110101110100000101011111110101110100000100011110100100001110010010110111110010110110111100011010101110011100100101111111000111011101011101000001010111111101011101000011000111111101011101000001010111111101011101000001011011011101011101000001010111111101011101000011000111111101011101000001010111111101011101000001000111101001000011100100101101101011110 e5b78d5ce4bf8eeba0afeba18feba0afeba0b6eba0afeba18feba0afeba08f48725be5b78d5ce4bf8eeba0afeba18feba0afeba0b6eba0afeba18feba0afeba08f48725b5e
UHC 巍\俎렯롏렯렶렯롏렯렏Hr[巍\俎렯롏렯렶렯롏렯렏Hr[^ 11101000111001000101110011110000101110111000111010111100100011101101010110001110101111001000111011000001100011101011110010001110110101011000111010111100100011101010010101001000011100100101101111101000111001000101110011110000101110111000111010111100100011101101010110001110101111001000111011000001100011101011110010001110110101011000111010111100100011101010010101001000011100100101101101011110 e8e45cf0bb8ebc8ed58ebc8ec18ebc8ed58ebc8ea548725be8e45cf0bb8ebc8ed58ebc8ec18ebc8ed58ebc8ea548725b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)