To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN タ鈆」チ」タ鈆」チ」B 11000000111110111100000110100011110000011010001111000000111110111100000110100011110000011010001101000010 c0fbc1a3c1a3c0fbc1a3c1a342
EUC-JP タ鈆」チ」タ鈆」チ」B 1000111011000000100011111110001110111100100011101010001110001110110000011000111010100011100011101100000010001111111000111011110010001110101000111000111011000001100011101010001101000010 8ec08fe3bc8ea38ec18ea38ec08fe3bc8ea38ec18ea342
UTF-8 タ鈆」チ」タ鈆」チ」B 11101111101111101000000011101001100010001000011011101111101111011010001111101111101111101000000111101111101111011010001111101111101111101000000011101001100010001000011011101111101111011010001111101111101111101000000111101111101111011010001101000010 efbe80e98886efbda3efbe81efbda3efbe80e98886efbda3efbe81efbda342
UHC ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)