To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????I?????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001001001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f493f3f3f3f3f3f
SJIS-WIN ???臾????????恂⑤?I???臾?? 0011111100111111001111111110010001101011001111110011111100111111001111110011111100111111001111110011111110011100100101101000011101000100001111110100100100111111001111110011111111100100011010110011111100111111 3f3f3fe46b3f3f3f3f3f3f3f3f9c9687443f493f3f3fe46b3f3f
EUC-JP ???臾?????沅??恂??I???臾?? 001111110011111100111111111001111100110000111111001111110011111100111111001111111000111111000110111010010011111100111111110101111111011000111111001111110100100100111111001111110011111111100111110011000011111100111111 3f3f3fe7cc3f3f3f3f3f8fc6e93f3fd7f63f3f493f3f3fe7cc3f3f
UTF-8 列룸뿊臾덈콢閱곕뀍沅잏뙴恂⑤쐧I列룸뿊臾덈콢 11101111101001101001110011101011101000111011100011101011101111111000101011101000100001111011111011101011100011011000100011101100101111011010001011101001100101101011000111101010101100111001010111101011100000001000110111100110101100101000010111101100100111101000111111101011100110011011010011100110100000011000001011100010100100011010010011101100100100001010011101001001111011111010011010011100111010111010001110111000111010111011111110001010111010001000011110111110111010111000110110001000111011001011110110100010 efa69ceba3b8ebbf8ae887beeb8d88ecbda2e996b1eab395eb808de6b285ec9e8feb99b4e68182e291a4ec90a749efa69ceba3b8ebbf8ae887beeb8d88ecbda2
UHC 列룸뿊臾덈콢閱곕뀍沅잏뙴恂⑤쐧I列룸뿊臾덈콢 11100110111010101011011111101011100101111001000111101011101011001000100011101011101100011001101011100110111100111011000011101011100001011000100011101010101101101001111111100111100011001011011111100010111000011010100011101011100111001000110001001001111001101110101010110111111010111001011110010001111010111010110010001000111010111011000110011010 e6eab7eb9791ebac88ebb19ae6f3b0eb8588eab69fe78cb7e2e1a8eb9c8c49e6eab7eb9791ebac88ebb19a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)