To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ç®´å‹ºé´ 1110011110101110101101001110010110001011101110101110100110110100 e7aeb4e58bbae9b4
SJIS-WIN ??´????´ 00111111001111111000000101001100001111110011111100111111001111111000000101001100 3f3f814c3f3f3f3f814c
EUC-JP ç®´å?ºé´ 1000111110101011101011101000111110100010111011101010000110101101100011111010101110101001001111111000111110100010111010111000111110101011101100011010000110101101 8fabae8fa2eea1ad8faba93f8fa2eb8fabb1a1ad
UTF-8 ç®´å‹ºé´ 11000011101001111100001010101110110000101011010011000011101001011100001010001011110000101011101011000011101010011100001010110100 c3a7c2aec2b4c3a5c28bc2bac3a9c2b4
UHC ?®´??º?´ 001111111010001011100111101000101010010100111111001111111010100010101100001111111010001010100101 3fa2e7a2a53f3fa8ac3fa2a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)