To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN タハシスタハシスB 11000000110010101011110011110101110000101011110111000000110010101011110011110101110000101011110101000010 c0cabcf5c2bdc0cabcf5c2bd42
EUC-JP タハシ?スタハシ?スB 10001110110000001000111011001010100011101011110000111111100011101011110110001110110000001000111011001010100011101011110000111111100011101011110101000010 8ec08eca8ebc3f8ebd8ec08eca8ebc3f8ebd42
UTF-8 タハシスタハシスB 11101111101111101000000011101111101111101000101011101111101111011011110011101110100100001010110111101111101111011011110111101111101111101000000011101111101111101000101011101111101111011011110011101110100100001010110111101111101111011011110101000010 efbe80efbe8aefbdbcee90adefbdbdefbe80efbe8aefbdbcee90adefbdbd42
UHC ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)