To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 晤????晤????B 10011101111010110011111100111111001111110011111110011101111010110011111100111111001111110011111101000010 9deb3f3f3f3f9deb3f3f3f3f42
EUC-JP 晤????晤????B 11011010111011010011111100111111001111110011111111011010111011010011111100111111001111110011111101000010 daed3f3f3f3fdaed3f3f3f3f42
UTF-8 晤볠뎸曆쫒晤볠뎸曆쫒B 11100110100110011010010011101011101100111010000011101011100011101011100011101111101001101000101111101100101010111001001011100110100110011010010011101011101100111010000011101011100011101011100011101111101001101000101111101100101010111001001001000010 e699a4ebb3a0eb8eb8efa68becab92e699a4ebb3a0eb8eb8efa68becab9242
UHC 晤볠뎸曆쫒晤볠뎸曆쫒B 111001111111101110010011111001101000100110001011111001101011011110100110011010011110011111111011100100111110011010001001100010111110011010110111101001100110100101000010 e7fb93e6898be6b7a669e7fb93e6898be6b7a66942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)