To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 譏比サ呎エゥ鬣 1110011010011000100101001110010010111011100110011110011010110100101010011110100110100101 e69894e4bb99e6b4a9e9a5
EUC-JP 譏比サ呎エゥ鬣 1110101111111000110010001110011010001110101110111101001011101000100011101011010010001110101010011111001010100111 ebf8c8e68ebbd2e88eb48ea9f2a7
UTF-8 譏比サ呎エゥ鬣 111010001010110110001111111001101010111110010100111011111011110110111011111001011001000110001110111011111011110110110100111011111011110110101001111010011010110010100011 e8ad8fe6af94efbdbbe5918eefbdb4efbda9e9aca3
UHC 譏比????? 110100011100000111011101111011110011111100111111001111110011111100111111 d1c1ddef3f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)