To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 鉛?????嶸??W}鉛?????嶸??W{^ 100010011001010000111111001111110011111100111111001111111111101010110100001111110011111101010111011111011000100110010100001111110011111100111111001111110011111111111010101101000011111100111111010101110111101101011110 89943f3f3f3f3ffab43f3f577d89943f3f3f3f3ffab43f3f577b5e
EUC-JP 鉛?????嶸??W}鉛?????嶸??W{^ 1011000111110100001111110011111100111111001111110011111110001111101110111111010000111111001111110101011101111101101100011111010000111111001111110011111100111111001111111000111110111011111101000011111100111111010101110111101101011110 b1f43f3f3f3f3f8fbbf43f3f577db1f43f3f3f3f3f8fbbf43f3f577b5e
UTF-8 鉛녿젶黎좊젻嶸뤿젪W}鉛녿젶黎좊젻嶸뤿젪W{^ 1110100110001001100110111110101110000101101111111110110010100000101101101110111110100110100010011110110010100010100010101110110010100000101110111110010110110110101110001110101110100100101111111110110010100000101010100101011101111101111010011000100110011011111010111000010110111111111011001010000010110110111011111010011010001001111011001010001010001010111011001010000010111011111001011011011010111000111010111010010010111111111011001010000010101010010101110111101101011110 e9899beb85bfeca0b6efa689eca28aeca0bbe5b6b8eba4bfeca0aa577de9899beb85bfeca0b6efa689eca28aeca0bbe5b6b8eba4bfeca0aa577b5e
UHC 鉛녿젶黎좊젻嶸뤿젪W}鉛녿젶黎좊젻嶸뤿젪W{^ 1110011011100111100001101110101110100000101010101110011010110001101000001110101110100000101011101110011110101110100011111110101110100000101000100101011101111101111001101110011110000110111010111010000010101010111001101011000110100000111010111010000010101110111001111010111010001111111010111010000010100010010101110111101101011110 e6e786eba0aae6b1a0eba0aee7ae8feba0a2577de6e786eba0aae6b1a0eba0aee7ae8feba0a2577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)