To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 僥??悟?????[僥??悟?????[^ 10011001010001100011111100111111100011001110010100111111001111110011111100111111001111110101101110011001010001100011111100111111100011001110010100111111001111110011111100111111001111110101101101011110 99463f3f8ce53f3f3f3f3f5b99463f3f8ce53f3f3f3f3f5b5e
EUC-JP 僥??悟?????[僥??悟?????[^ 11010001101001110011111100111111101110001110011100111111001111110011111100111111001111110101101111010001101001110011111100111111101110001110011100111111001111110011111100111111001111110101101101011110 d1a73f3fb8e73f3f3f3f3f5bd1a73f3fb8e73f3f3f3f3f5b5e
UTF-8 僥뚮젡悟뽯젘溜븍젵[僥뚮젡悟뽯젘溜븍젵[^ 111001011000001110100101111010111001101010101110111011001010000010100001111001101000001010011111111010111011110110101111111011001010000010011000111011111010011110001011111010111011100010001101111011001010000010110101010110111110010110000011101001011110101110011010101011101110110010100000101000011110011010000010100111111110101110111101101011111110110010100000100110001110111110100111100010111110101110111000100011011110110010100000101101010101101101011110 e583a5eb9aaeeca0a1e6829febbdafeca098efa78bebb88deca0b55be583a5eb9aaeeca0a1e6829febbdafeca098efa78bebb88deca0b55b5e
UHC 僥뚮젡悟뽯젘溜븍젵[僥뚮젡悟뽯젘溜븍젵[^ 111010001110100110001100111010111010000010011010111001111111011010010110111010111010000010010100111010101111111010111010111010111010000010101001010110111110100011101001100011001110101110100000100110101110011111110110100101101110101110100000100101001110101011111110101110101110101110100000101010010101101101011110 e8e98ceba09ae7f696eba094eafebaeba0a95be8e98ceba09ae7f696eba094eafebaeba0a95b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)