To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN タマタアタマタアB 11000000110011111100000010110001111100111111011011000000110011111100000010110001111100111111011001000010 c0cfc0b1f3f6c0cfc0b1f3f642
EUC-JP タマタア?タマタア?B 10001110110000001000111011001111100011101100000010001110101100010011111110001110110000001000111011001111100011101100000010001110101100010011111101000010 8ec08ecf8ec08eb13f8ec08ecf8ec08eb13f42
UTF-8 タマタアタマタアB 11101111101111101000000011101111101111101000111111101111101111101000000011101111101111011011000111101110100010111010100111101111101111101000000011101111101111101000111111101111101111101000000011101111101111011011000111101110100010111010100101000010 efbe80efbe8fefbe80efbdb1ee8ba9efbe80efbe8fefbe80efbdb1ee8ba942
UHC ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)