To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???M@???M@B 0011111100111111001111110100110101000000001111110011111100111111010011010100000001000010 3f3f3f4d403f3f3f4d4042
SJIS-WIN ?丞蛤M@?丞蛤M@B 001111111000111111100101100101001011100001001101010000000011111110001111111001011001010010111000010011010100000001000010 3f8fe594b84d403f8fe594b84d4042
EUC-JP 晛丞蛤M@晛丞蛤M@B 10001111110000101011101110111110111001111100100010111010010011010100000010001111110000101011101110111110111001111100100010111010010011010100000001000010 8fc2bbbee7c8ba4d408fc2bbbee7c8ba4d4042
UTF-8 晛丞蛤M@晛丞蛤M@B 1110011010011001100110111110010010111000100111101110100010011011101001000100110101000000111001101001100110011011111001001011100010011110111010001001101110100100010011010100000001000010 e6999be4b89ee89ba44d40e6999be4b89ee89ba44d4042
UHC 晛丞蛤M@晛丞蛤M@B 1111101011011001111000111010101011111001111011010100110101000000111110101101100111100011101010101111100111101101010011010100000001000010 fad9e3aaf9ed4d40fad9e3aaf9ed4d4042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)