To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蠍エ隶添蠍エ隶貞カク 11100101101101101011010011101000101011101001001101011001111001011011011010110100111010001010111010010010111001011011011010111000 e5b6b4e8ae9359e5b6b4e8ae92e5b6b8
EUC-JP 蠍エ隶添蠍エ隶貞カク 1110101010111000100011101011010011110000101100001100010110111010111010101011100010001110101101001111000010110000110001001110011110001110101101101000111010111000 eab88eb4f0b0c5baeab88eb4f0b0c4e78eb68eb8
UTF-8 蠍エ隶添蠍エ隶貞カク 111010001010000010001101111011111011110110110100111010011001101010110110111001101011011110111011111010001010000010001101111011111011110110110100111010011001101010110110111010001011001010011110111011111011110110110110111011111011110110111000 e8a08defbdb4e99ab6e6b7bbe8a08defbdb4e99ab6e8b29eefbdb6efbdb8
UHC ???添???貞?? 001111110011111100111111111101001101010100111111001111110011111111101111111101100011111100111111 3f3f3ff4d53f3f3feff63f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)