To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄β????酉?? 100101101110111110000011110000000011111100111111001111110011111110010011110100010011111100111111 96ef83c03f3f3f3f93d13f3f
EUC-JP 厄β????酉?? 110011001111000110100110110000100011111100111111001111110011111111000110110100110011111100111111 ccf1a6c23f3f3f3fc6d33f3f
UTF-8 厄β돦杻쒍틠酉몄뿉 1110010110001110100001001100111010110010111010111000111110100110111011111010011110001000111011001001001010001101111011011000101110100000111010011000010110001001111010111010101010000100111010111011111110001001 e58e84ceb2eb8fa6efa788ec928ded8ba0e98589ebaa84ebbf89
UHC 厄β돦杻쒍틠酉몄뿉 111001001111100010100101111000101000100110101010111010101111010010011100111001001011101010001100111010111011011110111000111011001001011110010000 e4f8a5e289aaeaf49ce4ba8cebb7b8ec9790

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)