To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN ?惺?惺[?惺?惺[^ 001111111001110010110111001111111001110010110111010110110011111110011100101101110011111110011100101101110101101101011110 3f9cb73f9cb75b3f9cb73f9cb75b5e
EUC-JP ?惺?惺[?惺?惺[^ 001111111101100010111001001111111101100010111001010110110011111111011000101110010011111111011000101110010101101101011110 3fd8b93fd8b95b3fd8b93fd8b95b5e
UTF-8 蟬惺蟬惺[蟬惺蟬惺[^ 111010001001111110101100111001101000001110111010111010001001111110101100111001101000001110111010010110111110100010011111101011001110011010000011101110101110100010011111101011001110011010000011101110100101101101011110 e89face683bae89face683ba5be89face683bae89face683ba5b5e
UHC 蟬惺蟬惺[蟬惺蟬惺[^ 11100000110100011110000011110110111000001101000111100000111101100101101111100000110100011110000011110110111000001101000111100000111101100101101101011110 e0d1e0f6e0d1e0f65be0d1e0f6e0d1e0f65b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)