To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 軟??誼?????D軟??誼?????D^ 10010011111011100011111100111111100010110110001000111111001111110011111100111111001111110100010010010011111011100011111100111111100010110110001000111111001111110011111100111111001111110100010001011110 93ee3f3f8b623f3f3f3f3f4493ee3f3f8b623f3f3f3f3f445e
EUC-JP 軟??誼??洧??D軟??誼??洧??D^ 1100011011110000001111110011111110110101110000110011111100111111100011111100011110110100001111110011111101000100110001101111000000111111001111111011010111000011001111110011111110001111110001111011010000111111001111110100010001011110 c6f03f3fb5c33f3f8fc7b43f3f44c6f03f3fb5c33f3f8fc7b43f3f445e
UTF-8 軟먮챶誼썼뇛洧뺤졒D軟먮챶誼썼뇛洧뺤졒D^ 111010001011101110011111111010111010100010101110111011001011000110110110111010001010101010111100111011001000110110111100111010111000011110011011111001101011010010100111111010111011101010100100111011001010000110010010010001001110100010111011100111111110101110101000101011101110110010110001101101101110100010101010101111001110110010001101101111001110101110000111100110111110011010110100101001111110101110111010101001001110110010100001100100100100010001011110 e8bb9feba8aeecb1b6e8aabcec8dbceb879be6b4a7ebbaa4eca19244e8bb9feba8aeecb1b6e8aabcec8dbceb879be6b4a7ebbaa4eca192445e
UHC 軟먮챶誼썼뇛洧뺤졒D軟먮챶誼썼뇛洧뺤졒D^ 111001101110001110010000111010111010101010000011111010111111111010111101111010001000011110000110111010101111101110010101111011001010000010111111010001001110011011100011100100001110101110101010100000111110101111111110101111011110100010000111100001101110101011111011100101011110110010100000101111110100010001011110 e6e390ebaa83ebfebde88786eafb95eca0bf44e6e390ebaa83ebfebde88786eafb95eca0bf445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)