To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN 褻短褻短[褻短褻短[^ 11100101111101101001001001011010111001011111011010010010010110100101101111100101111101101001001001011010111001011111011010010010010110100101101101011110 e5f6925ae5f6925a5be5f6925ae5f6925a5b5e
EUC-JP 褻短褻短[褻短褻短[^ 11101010111110001100001110111011111010101111100011000011101110110101101111101010111110001100001110111011111010101111100011000011101110110101101101011110 eaf8c3bbeaf8c3bb5beaf8c3bbeaf8c3bb5b5e
UTF-8 褻短褻短[褻短褻短[^ 111010001010010010111011111001111001111110101101111010001010010010111011111001111001111110101101010110111110100010100100101110111110011110011111101011011110100010100100101110111110011110011111101011010101101101011110 e8a4bbe79fade8a4bbe79fad5be8a4bbe79fade8a4bbe79fad5b5e
UHC 褻短褻短[褻短褻短[^ 11100000111000011101001110101101111000001110000111010011101011010101101111100000111000011101001110101101111000001110000111010011101011010101101101011110 e0e1d3ade0e1d3ad5be0e1d3ade0e1d3ad5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)