To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 盖韈痣盖韈痣B 11100001101100111110100011100111111000010111101111100001101100111110100011100111111000010111101101000010 e1b3e8e7e17be1b3e8e7e17b42
EUC-JP 盖韈痣盖韈痣B 11100010101101011111000011101001111000011101110011100010101101011111000011101001111000011101110001000010 e2b5f0e9e1dce2b5f0e9e1dc42
UTF-8 盖韈痣盖韈痣B 11100111100110111001011011101001100111111000100011100111100101111010001111100111100110111001011011101001100111111000100011100111100101111010001101000010 e79b96e99f88e797a3e79b96e99f88e797a342
UHC 盖??盖??B 110010111100110000111111001111111100101111001100001111110011111101000010 cbcc3f3fcbcc3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)