To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 垓??塢垓??塢n}垓??塢垓??塢n{^ 1001101010110100001111110011111110011010110001111001101010110100001111110011111110011010110001110110111001111101100110101011010000111111001111111001101011000111100110101011010000111111001111111001101011000111011011100111101101011110 9ab43f3f9ac79ab43f3f9ac76e7d9ab43f3f9ac79ab43f3f9ac76e7b5e
EUC-JP 垓??塢垓??塢n}垓??塢垓??塢n{^ 1101010010110110001111110011111111010100110010011101010010110110001111110011111111010100110010010110111001111101110101001011011000111111001111111101010011001001110101001011011000111111001111111101010011001001011011100111101101011110 d4b63f3fd4c9d4b63f3fd4c96e7dd4b63f3fd4c9d4b63f3fd4c96e7b5e
UTF-8 垓렫戶塢垓렫戶塢n}垓렫戶塢垓렫戶塢n{^ 1110010110011110100100111110101110100000101010111110011010001000101101101110010110100001101000101110010110011110100100111110101110100000101010111110011010001000101101101110010110100001101000100110111001111101111001011001111010010011111010111010000010101011111001101000100010110110111001011010000110100010111001011001111010010011111010111010000010101011111001101000100010110110111001011010000110100010011011100111101101011110 e59e93eba0abe688b6e5a1a2e59e93eba0abe688b6e5a1a26e7de59e93eba0abe688b6e5a1a2e59e93eba0abe688b6e5a1a26e7b5e
UHC 垓렫戶塢垓렫戶塢n}垓렫戶塢垓렫戶塢n{^ 11111010101001111000111010111001111110111100001011100111111100011111101010100111100011101011100111111011110000101110011111110001011011100111110111111010101001111000111010111001111110111100001011100111111100011111101010100111100011101011100111111011110000101110011111110001011011100111101101011110 faa78eb9fbc2e7f1faa78eb9fbc2e7f16e7dfaa78eb9fbc2e7f1faa78eb9fbc2e7f16e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)