To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 壕蛟????濠?砒W}壕蛟????濠?砒W{^ 10001101100010001110010110000000001111110011111100111111001111111000110110001010001111111110000111100101010101110111110110001101100010001110010110000000001111110011111100111111001111111000110110001010001111111110000111100101010101110111101101011110 8d88e5803f3f3f3f8d8a3fe1e5577d8d88e5803f3f3f3f8d8a3fe1e5577b5e
EUC-JP 壕蛟????濠?砒W}壕蛟????濠?砒W{^ 10111001111010001110100111100000001111110011111100111111001111111011100111101010001111111110001011100111010101110111110110111001111010001110100111100000001111110011111100111111001111111011100111101010001111111110001011100111010101110111101101011110 b9e8e9e03f3f3f3fb9ea3fe2e7577db9e8e9e03f3f3f3fb9ea3fe2e7577b5e
UTF-8 壕蛟렱폈렰렍濠얗砒W}壕蛟렱폈렰렍濠얗砒W{^ 1110010110100011100101011110100010011011100111111110101110100000101100011110110110001111100010001110101110100000101100001110101110100000100011011110011010111111101000001110110010010110100101111110011110100000100100100101011101111101111001011010001110010101111010001001101110011111111010111010000010110001111011011000111110001000111010111010000010110000111010111010000010001101111001101011111110100000111011001001011010010111111001111010000010010010010101110111101101011110 e5a395e89b9feba0b1ed8f88eba0b0eba08de6bfa0ec9697e7a092577de5a395e89b9feba0b1ed8f88eba0b0eba08de6bfa0ec9697e7a092577b5e
UHC 壕蛟렱폈렰렍濠얗砒W}壕蛟렱폈렰렍濠얗砒W{^ 1111101110111101110011101111000110001110101111101100011011110001100011101011110110001110101000111111101111001100101111101110100111011101111101110101011101111101111110111011110111001110111100011000111010111110110001101111000110001110101111011000111010100011111110111100110010111110111010011101110111110111010101110111101101011110 fbbdcef18ebec6f18ebd8ea3fbccbee9ddf7577dfbbdcef18ebec6f18ebd8ea3fbccbee9ddf7577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)