To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ëþÀçÉþíù·ÞëþÀçÉþíù·Ü^ 111010111111111011000000111001111100100111111110111011011111100110110111110111101110101111111110110000001110011111001001111111101110110111111001101101111101110001011110 ebfec0e7c9feedf9b7deebfec0e7c9feedf9b7dc5e
SJIS-WIN ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
EUC-JP ëþÀçÉþíù?ÞëþÀçÉþíù?Ü^ 100011111010101110110011100011111010100111010000100011111010101010100010100011111010101110101110100011111010101010110001100011111010100111010000100011111010101110111111100011111010101111100011001111111000111110101001101100001000111110101011101100111000111110101001110100001000111110101010101000101000111110101011101011101000111110101010101100011000111110101001110100001000111110101011101111111000111110101011111000110011111110001111101010101110010001011110 8fabb38fa9d08faaa28fabae8faab18fa9d08fabbf8fabe33f8fa9b08fabb38fa9d08faaa28fabae8faab18fa9d08fabbf8fabe33f8faae45e
UTF-8 ëþÀçÉþíù·ÞëþÀçÉþíù·Ü^ 1100001110101011110000111011111011000011100000001100001110100111110000111000100111000011101111101100001110101101110000111011100111000010101101111100001110011110110000111010101111000011101111101100001110000000110000111010011111000011100010011100001110111110110000111010110111000011101110011100001010110111110000111001110001011110 c3abc3bec380c3a7c389c3bec3adc3b9c2b7c39ec3abc3bec380c3a7c389c3bec3adc3b9c2b7c39c5e
UHC ?þ???þ??·Þ?þ???þ??·?^ 00111111101010011010110100111111001111110011111110101001101011010011111100111111101000011010010010101000101011010011111110101001101011010011111100111111001111111010100110101101001111110011111110100001101001000011111101011110 3fa9ad3f3f3fa9ad3f3fa1a4a8ad3fa9ad3f3f3fa9ad3f3fa1a43f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)