To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????v}B 001111110011111100111111001111110011111100111111011101100111110101000010 3f3f3f3f3f3f767d42
SJIS-WIN 蜈??邀??v}B 1110010110000101001111110011111111100111101100010011111100111111011101100111110101000010 e5853f3fe7b13f3f767d42
EUC-JP 蜈??邀??v}B 1110100111100101001111110011111111101110101100110011111100111111011101100111110101000010 e9e53f3feeb33f3f767d42
UTF-8 蜈랃쉐邀섓쉥v}B 111010001001110010001000111010111001111010000011111011001000100110010000111010011000001010000000111011001000010010010011111011001000100110100101011101100111110101000010 e89c88eb9e83ec8990e98280ec8493ec89a5767d42
UHC 蜈랃쉐邀섓쉥v}B 111010001010010110001101111011111011110110100110111010011010110110011000111011111011110110101011011101100111110101000010 e8a58defbda6e9ad98efbdab767d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)