To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 èî”å±…èä 1110100011101110100101001110010110110001100001011110100011100100 e8ee94e5b185e8e4
SJIS-WIN ????±??? 001111110011111100111111001111111000000101111101001111110011111100111111 3f3f3f3f817d3f3f3f
EUC-JP èî?å±?èä 10001111101010111011001010001111101010111100001000111111100011111010101110101001101000011101111000111111100011111010101110110010100011111010101110100011 8fabb28fabc23f8faba9a1de3f8fabb28faba3
UTF-8 èî”å±…èä 11000011101010001100001110101110110000101001010011000011101001011100001010110001110000101000010111000011101010001100001110100100 c3a8c3aec294c3a5c2b1c285c3a8c3a4
UHC ????±??? 001111110011111100111111001111111010000110111110001111110011111100111111 3f3f3f3fa1be3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)