To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 厭??揄??? 100010010111110100111111001111111001110110001001001111110011111100111111 897d3f3f9d893f3f3f
EUC-JP 厭??揄??荑 1011000111011110001111110011111111011001111010010011111100111111100011111101011111111001 b1de3f3fd9e93f3f8fd7f9
UTF-8 厭묎퍐揄껈쉬荑 111001011000111010101101111010111010110010001110111011011000110110010000111001101000111110000100111010101011101110001000111011001000100110101100111010001000110110010001 e58eadebac8eed8d90e68f84eabb88ec89ace88d91
UHC 厭묎퍐揄껈쉬荑 1110011011110100100100011110101010111011100001111110101011110001100000111110100110111101101011001110110010111111 e6f491eabb87eaf183e9bdacecbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)