To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 å߁㏃âì£ï§žâã½ì‚¨ï§žÑ 11100101110111111000000111100011100011111000001111100010111011001010001111101111101001111001111011100010111000111011110111101100100000101010100011101111101001111001111011010001 e5df81e38f83e2eca3efa79ee2e3bdec82a8efa79ed1
SJIS-WIN ????????£?§??????¨?§?? 0011111100111111001111110011111100111111001111110011111100111111100000011001001000111111100000011001100000111111001111110011111100111111001111110011111110000001010011100011111110000001100110000011111100111111 3f3f3f3f3f3f3f3f81923f81983f3f3f3f3f3f814e3f81983f3f
EUC-JP åß?ã??âì£ï§?âã?ì?¨ï§?Ñ 100011111010101110101001100011111010100111001110001111111000111110101011101010100011111100111111100011111010101110100100100011111010101111000000101000011111001010001111101010111100000110100001111110000011111110001111101010111010010010001111101010111010101000111111100011111010101111000000001111111010000110101111100011111010101111000001101000011111100000111111100011111010101011010000 8faba98fa9ce3f8fabaa3f3f8faba48fabc0a1f28fabc1a1f83f8faba48fabaa3f8fabc03fa1af8fabc1a1f83f8faad0
UTF-8 å߁㏃âì£ï§žâã½ì‚¨ï§žÑ 1100001110100101110000111001111111000010100000011100001110100011110000101000111111000010100000111100001110100010110000111010110011000010101000111100001110101111110000101010011111000010100111101100001110100010110000111010001111000010101111011100001110101100110000101000001011000010101010001100001110101111110000101010011111000010100111101100001110010001 c3a5c39fc281c3a3c28fc283c3a2c3acc2a3c3afc2a7c29ec3a2c3a3c2bdc3acc282c2a8c3afc2a7c29ec391
UHC ?ß????????§???½??¨?§?? 001111111010100110101100001111110011111100111111001111110011111100111111001111110011111110100001110101110011111100111111001111111010100011110110001111110011111110100001101001110011111110100001110101110011111100111111 3fa9ac3f3f3f3f3f3f3f3fa1d73f3f3fa8f63f3fa1a73fa1d73f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)