To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????z????z[????z????z[^ 0011111100111111001111110011111101111010001111110011111100111111001111110111101001011011001111110011111100111111001111110111101000111111001111110011111100111111011110100101101101011110 3f3f3f3f7a3f3f3f3f7a5b3f3f3f3f7a3f3f3f3f7a5b5e
SJIS-WIN ?頃??z?頃??z[?頃??z?頃??z[^ 001111111000110110100000001111110011111101111010001111111000110110100000001111110011111101111010010110110011111110001101101000000011111100111111011110100011111110001101101000000011111100111111011110100101101101011110 3f8da03f3f7a3f8da03f3f7a5b3f8da03f3f7a3f8da03f3f7a5b5e
EUC-JP 塡頃??z塡頃??z[塡頃??z塡頃??z[^ 1000111110111000101101001011101010100010001111110011111101111010100011111011100010110100101110101010001000111111001111110111101001011011100011111011100010110100101110101010001000111111001111110111101010001111101110001011010010111010101000100011111100111111011110100101101101011110 8fb8b4baa23f3f7a8fb8b4baa23f3f7a5b8fb8b4baa23f3f7a8fb8b4baa23f3f7a5b5e
UTF-8 塡頃렰렔z塡頃렰렔z[塡頃렰렔z塡頃렰렔z[^ 11100101101000011010000111101001101000001000001111101011101000001011000011101011101000001001010001111010111001011010000110100001111010011010000010000011111010111010000010110000111010111010000010010100011110100101101111100101101000011010000111101001101000001000001111101011101000001011000011101011101000001001010001111010111001011010000110100001111010011010000010000011111010111010000010110000111010111010000010010100011110100101101101011110 e5a1a1e9a083eba0b0eba0947ae5a1a1e9a083eba0b0eba0947a5be5a1a1e9a083eba0b0eba0947ae5a1a1e9a083eba0b0eba0947a5b5e
UHC 塡頃렰렔z塡頃렰렔z[塡頃렰렔z塡頃렰렔z[^ 111011101111001111001100111100011000111010111101100011101010100101111010111011101111001111001100111100011000111010111101100011101010100101111010010110111110111011110011110011001111000110001110101111011000111010101001011110101110111011110011110011001111000110001110101111011000111010101001011110100101101101011110 eef3ccf18ebd8ea97aeef3ccf18ebd8ea97a5beef3ccf18ebd8ea97aeef3ccf18ebd8ea97a5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)