To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN タマチ」タ霏マチ」タ鐚タマチ」タ霏マチ」タ鐚^ 110000001100111111000001101000111100000011101000110000001100111111000001101000111100000011101000010110111100000011001111110000011010001111000000111010001100000011001111110000011010001111000000111010000101101101011110 c0cfc1a3c0e8c0cfc1a3c0e85bc0cfc1a3c0e8c0cfc1a3c0e85b5e
EUC-JP タマチ」タ霏マチ」タ鐚タマチ」タ霏マチ」タ鐚^ 100011101100000010001110110011111000111011000001100011101010001110001110110000001111000011000010100011101100111110001110110000011000111010100011100011101100000011101111101111001000111011000000100011101100111110001110110000011000111010100011100011101100000011110000110000101000111011001111100011101100000110001110101000111000111011000000111011111011110001011110 8ec08ecf8ec18ea38ec0f0c28ecf8ec18ea38ec0efbc8ec08ecf8ec18ea38ec0f0c28ecf8ec18ea38ec0efbc5e
UTF-8 タマチ」タ霏マチ」タ鐚タマチ」タ霏マチ」タ鐚^ 11101111101111101000000011101111101111101000111111101111101111101000000111101111101111011010001111101111101111101000000011101001100111001000111111101111101111101000111111101111101111101000000111101111101111011010001111101111101111101000000011101001100100001001101011101111101111101000000011101111101111101000111111101111101111101000000111101111101111011010001111101111101111101000000011101001100111001000111111101111101111101000111111101111101111101000000111101111101111011010001111101111101111101000000011101001100100001001101001011110 efbe80efbe8fefbe81efbda3efbe80e99c8fefbe8fefbe81efbda3efbe80e9909aefbe80efbe8fefbe81efbda3efbe80e99c8fefbe8fefbe81efbda3efbe80e9909a5e
UHC ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)