To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??C???C?^ 001111110011111101000011001111110011111100111111010000110011111101011110 3f3f433f3f3f433f5e
SJIS-WIN ?男C箕?男C箕^ 00111111100100100110101001000011100101101010010100111111100100100110101001000011100101101010010101011110 3f926a4396a53f926a4396a55e
EUC-JP ?男C箕?男C箕^ 00111111110000111100101101000011110011001010011100111111110000111100101101000011110011001010011101011110 3fc3cb43cca73fc3cb43cca75e
UTF-8 뤶男C箕뤶男C箕^ 111010111010010010110110111001111001010010110111010000111110011110101110100101011110101110100100101101101110011110010100101101110100001111100111101011101001010101011110 eba4b6e794b743e7ae95eba4b6e794b743e7ae955e
UHC 뤶男C箕뤶男C箕^ 100011111110010011010001111110110100001111010001101110011000111111100100110100011111101101000011110100011011100101011110 8fe4d1fb43d1b98fe4d1fb43d1b95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)