To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 鐔醐醜鐔種愁鐔種州 111010000101110010001100111011011000111101011000111010000101110010001110111011011000111101000100111010000101110010001110111011011000111101000010 e85c8ced8f58e85c8eed8f44e85c8eed8f42
EUC-JP 鐔醐醜鐔種愁鐔種州 111011111011110110111000111011111011110110111001111011111011110110111100111011111011110110100101111011111011110110111100111011111011110110100011 efbdb8efbdb9efbdbcefbda5efbdbcefbda3
UTF-8 鐔醐醜鐔種愁鐔種州 111010011001000010010100111010011000011010010000111010011000011010011100111010011001000010010100111001111010100010101110111001101000010010000001111010011001000010010100111001111010100010101110111001011011011110011110 e99094e98690e9869ce99094e7a8aee68481e99094e7a8aee5b79e
UHC ??醜?種愁?種州 0011111100111111111101011101110100111111111100001111101011100001111111100011111111110000111110101111000110110110 3f3ff5dd3ff0fae1fe3ff0faf1b6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)