To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???艾??癌?? 0011111100111111001111111110010010001000001111110011111110001010111000000011111100111111 3f3f3fe4883f3f8ae03f3f
EUC-JP ???艾??癌?? 0011111100111111001111111110011111101000001111110011111110110100111000100011111100111111 3f3f3fe7e83f3fb4e23f3f
UTF-8 筽욅빓艾뉐컽癌듣쾫 111001111010110110111101111011001001101010000101111010111011100110010011111010001000100110111110111010111000100110010000111011001011101110111101111001111001100110001100111010111001001110100011111011001011111010101011 e7adbdec9a85ebb993e889beeb8990ecbbbde7998ceb93a3ecbeab
UHC 筽욅빓艾뉐컽癌듣쾫 111010001010010010011110111001111001010110110111111001001111010110000111111001011011000010011110111001001101111110110101111010001011001010000010 e8a49ee795b7e4f587e5b09ee4dfb5e8b282

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)