To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ðúÐ莨Äñ޽ޢéëÞïŽßÂëÓø 11110000111110101101000011101000100011101010100011000100111100011000111010111101100011101010001011101001111010111101111011101111100011101101111111000010111010111101001111111000 f0fad0e88ea8c4f18ebd8ea2e9ebdeef8edfc2ebd3f8
SJIS-WIN ?????¨?????¢?????????? 001111110011111100111111001111110011111110000001010011100011111100111111001111110011111100111111100000011001000100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f814e3f3f3f3f3f81913f3f3f3f3f3f3f3f3f3f
EUC-JP ðú?è?¨Äñ???¢éëÞï?ßÂëÓø 10001111101010011100001110001111101010111110001000111111100011111010101110110010001111111010000110101111100011111010101010100011100011111010101111010000001111110011111100111111101000011111000110001111101010111011000110001111101010111011001110001111101010011011000010001111101010111100000100111111100011111010100111001110100011111010101010100100100011111010101110110011100011111010101011010001100011111010100111001100 8fa9c38fabe23f8fabb23fa1af8faaa38fabd03f3f3fa1f18fabb18fabb38fa9b08fabc13f8fa9ce8faaa48fabb38faad18fa9cc
UTF-8 ðúÐ莨Äñ޽ޢéëÞïŽßÂëÓø 1100001110110000110000111011101011000011100100001100001110101000110000101000111011000010101010001100001110000100110000111011000111000010100011101100001010111101110000101000111011000010101000101100001110101001110000111010101111000011100111101100001110101111110000101000111011000011100111111100001110000010110000111010101111000011100100111100001110111000 c3b0c3bac390c3a8c28ec2a8c384c3b1c28ec2bdc28ec2a2c3a9c3abc39ec3afc28ec39fc382c3abc393c3b8
UHC ð?Ð??¨???½????Þ??ß???ø 1010100110100011001111111010100010100010001111110011111110100001101001110011111100111111001111111010100011110110001111110011111100111111001111111010100010101101001111110011111110101001101011000011111100111111001111111010100110101010 a9a33fa8a23f3fa1a73f3f3fa8f63f3f3f3fa8ad3f3fa9ac3f3f3fa9aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)