To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 竣???頂蝎????B 1000111101110110001111110011111100111111100100101011100011100101100110010011111100111111001111110011111101000010 8f763f3f3f92b8e5993f3f3f3f42
EUC-JP 竣???頂蝎????B 1011110111010111001111110011111100111111110001001011101011101001111110010011111100111111001111110011111101000010 bdd73f3f3fc4bae9f93f3f3f3f42
UTF-8 竣얹렰렜頂蝎렢댓렰렑B 11100111101010111010001111101100100101101011100111101011101000001011000011101011101000001001110011101001101000001000001011101000100111011000111011101011101000001010001011101011100011001001001111101011101000001011000011101011101000001001000101000010 e7aba3ec96b9eba0b0eba09ce9a082e89d8eeba0a2eb8c93eba0b0eba09142
UHC 竣얹렰렜頂蝎렢댓렰렑B 111100011110001010111110111100011000111010111101100011101010111011110000101000101100101011101001100011101011001110110100111100011000111010111101100011101010011001000010 f1e2bef18ebd8eaef0a2cae98eb3b4f18ebd8ea642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)