To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ç™²ëˆì±¶å®œë£ìŠ­ç™’ê¿ 1110011110011001101100101110101110001101100010001110110010110001101101101110010110101110100111001110101110100011100111011110110010001010101011011110011110011001100100101110101010111111 e799b2eb8d88ecb1b6e5ae9ceba39dec8aade79992eabf
SJIS-WIN ???????±¶????£????????? 0011111100111111001111110011111100111111001111110011111110000001011111011000000111110111001111110011111100111111001111111000000110010010001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f817d81f73f3f3f3f81923f3f3f3f3f3f3f3f3f
EUC-JP ç??ë??챶å®?ë£?ì??ç??ê¿ 10001111101010111010111000111111001111111000111110101011101100110011111100111111100011111010101111000000101000011101111010100010111110011000111110101011101010011000111110100010111011100011111110001111101010111011001110100001111100100011111110001111101010111100000000111111001111111000111110101011101011100011111100111111100011111010101110110100100011111010001011000100 8fabae3f3f8fabb33f3f8fabc0a1dea2f98faba98fa2ee3f8fabb3a1f23f8fabc03f3f8fabae3f3f8fabb48fa2c4
UTF-8 ç™²ëˆì±¶å®œë£ìŠ­ç™’ê¿ 11000011101001111100001010011001110000101011001011000011101010111100001010001101110000101000100011000011101011001100001010110001110000101011011011000011101001011100001010101110110000101001110011000011101010111100001010100011110000101001110111000011101011001100001010001010110000101010110111000011101001111100001010011001110000101001001011000011101010101100001010111111 c3a7c299c2b2c3abc28dc288c3acc2b1c2b6c3a5c2aec29cc3abc2a3c29dc3acc28ac2adc3a7c299c292c3aac2bf
UHC ??²????±¶?®??????­????¿ 0011111100111111101010011111011100111111001111110011111100111111101000011011111010100010110100100011111110100010111001110011111100111111001111110011111100111111001111111010000110101001001111110011111100111111001111111010001010101111 3f3fa9f73f3f3f3fa1bea2d23fa2e73f3f3f3f3f3fa1a93f3f3f3fa2af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)