To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ??????掖骸?[??????掖骸?[^ 00111111001111110011111100111111001111110011111110011101011101001000101001011011001111110101101100111111001111110011111100111111001111110011111110011101011101001000101001011011001111110101101101011110 3f3f3f3f3f3f9d748a5b3f5b3f3f3f3f3f3f9d748a5b3f5b5e
EUC-JP ??????掖骸?[??????掖骸?[^ 00111111001111110011111100111111001111110011111111011001110101011011001110111100001111110101101100111111001111110011111100111111001111110011111111011001110101011011001110111100001111110101101101011110 3f3f3f3f3f3fd9d5b3bc3f5b3f3f3f3f3f3fd9d5b3bc3f5b5e
UTF-8 렱蓮뤰탮퐥얏掖骸렰[렱蓮뤰탮퐥얏掖骸렰[^ 111010111010000010110001111011111010011010011001111010111010010010110000111011011000001110101110111011011001000010100101111011001001011010001111111001101000111010010110111010011010101010111000111010111010000010110000010110111110101110100000101100011110111110100110100110011110101110100100101100001110110110000011101011101110110110010000101001011110110010010110100011111110011010001110100101101110100110101010101110001110101110100000101100000101101101011110 eba0b1efa699eba4b0ed83aeed90a5ec968fe68e96e9aab8eba0b05beba0b1efa699eba4b0ed83aeed90a5ec968fe68e96e9aab8eba0b05b5e
UHC 렱蓮뤰탮퐥얏掖骸렰[렱蓮뤰탮퐥얏掖骸렰[^ 100011101011111011100110111001011000111111011110101101011000111010111101100011101011111011100110111001001111101011111010101101011000111010111101010110111000111010111110111001101110010110001111110111101011010110001110101111011000111010111110111001101110010011111010111110101011010110001110101111010101101101011110 8ebee6e58fdeb58ebd8ebee6e4fafab58ebd5b8ebee6e58fdeb58ebd8ebee6e4fafab58ebd5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)