To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 霆顔泓霆顔汞^ 11101000101110111000101011100111100111111001011111101000101110111000101011100111100111111000011101011110 e8bb8ae79f97e8bb8ae79f875e
EUC-JP 霆顔泓霆顔汞^ 11110000101111011011010011101001110111011111011111110000101111011011010011101001110111011110011101011110 f0bdb4e9ddf7f0bdb4e9dde75e
UTF-8 霆顔泓霆顔汞^ 11101001100111001000011011101001101000011001010011100110101100111001001111101001100111001000011011101001101000011001010011100110101100011001111001011110 e99c86e9a194e6b393e99c86e9a194e6b19e5e
UHC 霆顔泓霆顔汞^ 11101111111111011110010011010100111110111111001011101111111111011110010011010100111110111111000101011110 effde4d4fbf2effde4d4fbf15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)