To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 æ›ºæ£•ç² 1110011010011011101110101110011010100011100101011110011110110010 e69bbae6a395e7b2
SJIS-WIN ????£??? 001111110011111100111111001111111000000110010010001111110011111100111111 3f3f3f3f81923f3f3f
EUC-JP æ?ºæ£?ç? 1000111110101001110000010011111110001111101000101110101110001111101010011100000110100001111100100011111110001111101010111010111000111111 8fa9c13f8fa2eb8fa9c1a1f23f8fabae3f
UTF-8 æ›ºæ£•ç² 11000011101001101100001010011011110000101011101011000011101001101100001010100011110000101001010111000011101001111100001010110010 c3a6c29bc2bac3a6c2a3c295c3a7c2b2
UHC æ?ºæ???² 101010011010000100111111101010001010110010101001101000010011111100111111001111111010100111110111 a9a13fa8aca9a13f3f3fa9f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)