To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 造??脹?漿????造??脹?漿????B 100100011010001000111111001111111001001010101111001111111001111111110111001111110011111100111111001111111001000110100010001111110011111110010010101011110011111110011111111101110011111100111111001111110011111101000010 91a23f3f92af3f9ff73f3f3f3f91a23f3f92af3f9ff73f3f3f3f42
EUC-JP 造??脹?漿鈒???造??脹?漿鈒???B 11000010101001000011111100111111110001001011000100111111110111101111100110001111111000111100001000111111001111110011111111000010101001000011111100111111110001001011000100111111110111101111100110001111111000111100001000111111001111110011111101000010 c2a43f3fc4b13fdef98fe3c23f3f3fc2a43f3fc4b13fdef98fe3c23f3f3f42
UTF-8 造섦뤚脹흙漿鈒뇜롉롃造섦뤚脹흙漿鈒뇜롉롃B 11101001100000001010000011101100100001001010011011101011101001001001101011101000100001001011100111101101100111011001100111100110101111001011111111101001100010001001001011101011100001111001110011101011101000011000100111101011101000011000001111101001100000001010000011101100100001001010011011101011101001001001101011101000100001001011100111101101100111011001100111100110101111001011111111101001100010001001001011101011100001111001110011101011101000011000100111101011101000011000001101000010 e980a0ec84a6eba49ae884b9ed9d99e6bcbfe98892eb879ceba189eba183e980a0ec84a6eba49ae884b9ed9d99e6bcbfe98892eb879ceba189eba18342
UHC 造섦뤚脹흙漿鈒뇜롉롃造섦뤚脹흙漿鈒뇜롉롃B 1111000011100011101111001011010010001111110010011111001111101100110010001110101111101101111011001101111110111100101100111111110110001110110011111000111011001010111100001110001110111100101101001000111111001001111100111110110011001000111010111110110111101100110111111011110010110011111111011000111011001111100011101100101001000010 f0e3bcb48fc9f3ecc8ebedecdfbcb3fd8ecf8ecaf0e3bcb48fc9f3ecc8ebedecdfbcb3fd8ecf8eca42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)