To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 綜??刷?忽???B 10010001100011100011111100111111100011011111110000111111100011011001101000111111001111110011111101000010 918e3f3f8dfc3f8d9a3f3f3f42
EUC-JP 綜??刷?忽?飡?B 110000011110111000111111001111111011101011111110001111111011100111111010001111111000111111101000110010000011111101000010 c1ee3f3fbafe3fb9fa3f8fe8c83f42
UTF-8 綜렢뤳刷디忽퀛飡렦B 11100111101101101001110011101011101000001010001011101011101001001011001111100101100010001011011111101011100101001001010011100101101111111011110111101101100000001001101111101001101000111010000111101011101000001010011001000010 e7b69ceba0a2eba4b3e588b7eb9494e5bfbded809be9a3a1eba0a642
UHC 綜렢뤳刷디忽퀛飡렦B 11110000111111001000111010110011100011111110000111100001111011001011010111110000111110111110110010110011100011111110000111100010100011101011010101000010 f0fc8eb38fe1e1ecb5f0fbecb38fe1e28eb542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)