To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蔔守イ蝉邀型 11100100111101111000111011100111101100101001000011100100111101111000111011100111101100011000110001011110 e4f78ee7b290e4f78ee7b18c5e
EUC-JP 蔔守イ蝉?邀型 11101000111110011011110011101001100011101011001011000000111001100011111111101110101100111011011110111111 e8f9bce98eb2c0e63feeb3b7bf
UTF-8 蔔守イ蝉邀型 111010001001010010010100111001011010111010001000111011111011110110110010111010001001110110001001111011101001010110110001111010011000001010000000111001011001111010001011 e89494e5ae88efbdb2e89d89ee95b1e98280e59e8b
UHC 蔔守???邀型 1101110011011011111000011111101000111111001111110011111111101001101011011111101011111110 dcdbe1fa3f3f3fe9adfafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)