To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????GB?????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100011101000010001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f47423f3f3f3f3f3f
SJIS-WIN ???臾????????恂⑤?GB???臾?? 001111110011111100111111111001000110101100111111001111110011111100111111001111110011111100111111001111111001110010010110100001110100010000111111010001110100001000111111001111110011111111100100011010110011111100111111 3f3f3fe46b3f3f3f3f3f3f3f3f9c9687443f47423f3f3fe46b3f3f
EUC-JP ???臾?????沅??恂??GB???臾?? 00111111001111110011111111100111110011000011111100111111001111110011111100111111100011111100011011101001001111110011111111010111111101100011111100111111010001110100001000111111001111110011111111100111110011000011111100111111 3f3f3fe7cc3f3f3f3f3f8fc6e93f3fd7f63f3f47423f3f3fe7cc3f3f
UTF-8 列룸뿊臾덈콢閱곕뀍沅잏뙴恂⑤쐧GB列룸뿊臾덈콢 1110111110100110100111001110101110100011101110001110101110111111100010101110100010000111101111101110101110001101100010001110110010111101101000101110100110010110101100011110101010110011100101011110101110000000100011011110011010110010100001011110110010011110100011111110101110011001101101001110011010000001100000101110001010010001101001001110110010010000101001110100011101000010111011111010011010011100111010111010001110111000111010111011111110001010111010001000011110111110111010111000110110001000111011001011110110100010 efa69ceba3b8ebbf8ae887beeb8d88ecbda2e996b1eab395eb808de6b285ec9e8feb99b4e68182e291a4ec90a74742efa69ceba3b8ebbf8ae887beeb8d88ecbda2
UHC 列룸뿊臾덈콢閱곕뀍沅잏뙴恂⑤쐧GB列룸뿊臾덈콢 1110011011101010101101111110101110010111100100011110101110101100100010001110101110110001100110101110011011110011101100001110101110000101100010001110101010110110100111111110011110001100101101111110001011100001101010001110101110011100100011000100011101000010111001101110101010110111111010111001011110010001111010111010110010001000111010111011000110011010 e6eab7eb9791ebac88ebb19ae6f3b0eb8588eab69fe78cb7e2e1a8eb9c8c4742e6eab7eb9791ebac88ebb19a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)