To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????儒??裔?????恂⑤????誼 001111110011111100111111001111110011111100111111100011101111001000111111001111111110010111100001001111110011111100111111001111110011111110011100100101101000011101000100001111110011111100111111001111111000101101100010 3f3f3f3f3f3f8ef23f3fe5e13f3f3f3f3f9c9687443f3f3f3f8b62
EUC-JP ???堉??儒??裔??沅??恂?????誼 001111110011111100111111100011111011011111111101001111110011111110111100111101000011111100111111111010101110001100111111001111111000111111000110111010010011111100111111110101111111011000111111001111110011111100111111001111111011010111000011 3f3f3f8fb7fd3f3fbcf43f3feae33f3f8fc6e93f3fd7f63f3f3f3f3fb5c3
UTF-8 列룸똾堉녑쑵儒몌펻裔꾨퀩沅숂뙴恂⑤쐧列룸씈誼 111011111010011010011100111010111010001110111000111010111001100010111110111001011010000010001001111010111000010110010001111011001001000110110101111001011000010010010010111010111010101010001100111011011000111010111011111010001010001110010100111010101011111010101000111011011000000010101001111001101011001010000101111011001000100010000010111010111001100110110100111001101000000110000010111000101001000110100100111011001001000010100111111011111010011010011100111010111010001110111000111011001001010010001000111010001010101010111100 efa69ceba3b8eb98bee5a089eb8591ec91b5e58492ebaa8ced8ebbe8a394eabea8ed80a9e6b285ec8882eb99b4e68182e291a4ec90a7efa69ceba3b8ec9488e8aabc
UHC 列룸똾堉녑쑵儒몌펻裔꾨퀩沅숂뙴恂⑤쐧列룸씈誼 1110011011101010101101111110101110001100100001001110101110111100101100111110010110111110101010101110101011100011101110001110111110111100100010111110011111100000100001001110101110110011100111011110101010110110100110011110011110001100101101111110001011100001101010001110101110011100100011001110011011101010101101111110101110011101101000001110101111111110 e6eab7eb8c84ebbcb3e5beaaeae3b8efbc8be7e084ebb39deab699e78cb7e2e1a8eb9c8ce6eab7eb9da0ebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)