To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 載????齎??長魄載????齎??長白^ 1000110111011010001111110011111100111111001111111110011011011000001111110011111110010010101101111110100110101110100011011101101000111111001111110011111100111111111001101101100000111111001111111001001010110111100101001001001001011110 8dda3f3f3f3fe6d83f3f92b7e9ae8dda3f3f3f3fe6d83f3f92b794925e
EUC-JP 載????齎??長魄載????齎??長白^ 1011101011011100001111110011111100111111001111111110110011011010001111110011111111000100101110011111001010110000101110101101110000111111001111110011111100111111111011001101101000111111001111111100010010111001110001111111001001011110 badc3f3f3f3fecda3f3fc4b9f2b0badc3f3f3f3fecda3f3fc4b9c7f25e
UTF-8 載얜렜언윌齎흩윌長魄載얜렜언윌齎흩윌長白^ 11101000101111001000100111101100100101101001110011101011101000001001110011101100100101101011100011101100100111001000110011101001101111011000111011101101100111011010100111101100100111001000110011101001100101011011011111101001101011011000010011101000101111001000100111101100100101101001110011101011101000001001110011101100100101101011100011101100100111001000110011101001101111011000111011101101100111011010100111101100100111001000110011101001100101011011011111100111100110011011110101011110 e8bc89ec969ceba09cec96b8ec9c8ce9bd8eed9da9ec9c8ce995b7e9ad84e8bc89ec969ceba09cec96b8ec9c8ce9bd8eed9da9ec9c8ce995b7e799bd5e
UHC 載얜렜언윌齎흩윌長魄載얜렜언윌齎흩윌長白^ 1110111010110000101111101110101110001110101011101011111011110000110000001010101011101110101100101100100011110000110000001010101011101101111111101101101111011110111011101011000010111110111010111000111010101110101111101111000011000000101010101110111010110010110010001111000011000000101010101110110111111110110110111101110001011110 eeb0beeb8eaebef0c0aaeeb2c8f0c0aaedfedbdeeeb0beeb8eaebef0c0aaeeb2c8f0c0aaedfedbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)