To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ðÛðïzðÛðïzB 1111000011011011111100001110111101111010111100001101101111110000111011110111101001000010 f0dbf0ef7af0dbf0ef7a42
SJIS-WIN ????z????zB 0011111100111111001111110011111101111010001111110011111100111111001111110111101001000010 3f3f3f3f7a3f3f3f3f7a42
EUC-JP ðÛðïzðÛðïzB 100011111010100111000011100011111010101011100101100011111010100111000011100011111010101111000001011110101000111110101001110000111000111110101010111001011000111110101001110000111000111110101011110000010111101001000010 8fa9c38faae58fa9c38fabc17a8fa9c38faae58fa9c38fabc17a42
UTF-8 ðÛðïzðÛðïzB 11000011101100001100001110011011110000111011000011000011101011110111101011000011101100001100001110011011110000111011000011000011101011110111101001000010 c3b0c39bc3b0c3af7ac3b0c39bc3b0c3af7a42
UHC ð?ð?zð?ð?zB 101010011010001100111111101010011010001100111111011110101010100110100011001111111010100110100011001111110111101001000010 a9a33fa9a33f7aa9a33fa9a33f7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)