To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 韆タ螯ゥ煮詔ニ鞴?賤筰煮焦贖 111010001110011011000000111001011010011010101001100011101100111110001111110110011100011011101000111001000011111111100110110010111110001010101001100011101100111110001111110001011110011011011100 e8e6c0e5a6a98ecf8fd9c6e8e43fe6cbe2a98ecf8fc5e6dc
EUC-JP 韆タ螯ゥ煮詔ニ鞴渻賤筰煮焦贖 1111000011101000100011101100000011101010101010001000111010101001101111001101000110111110110110111000111011000110111100001110011010001111110001111110111111101100110011011110010010101011101111001101000110111110110001111110110011011110 f0e88ec0eaa88ea9bcd1bedb8ec6f0e68fc7efeccde4abbcd1bec7ecde
UTF-8 韆タ螯ゥ煮詔ニ鞴渻賤筰煮焦贖 111010011001111110000110111011111011111010000000111010001001111010101111111011111011110110101001111001111000010110101110111010001010100110010100111011111011111010000110111010011001111010110100111001101011100010111011111010001011001110100100111001111010110110110000111001111000010110101110111001111000010010100110111010001011010010010110 e99f86efbe80e89eafefbda9e785aee8a994efbe86e99eb4e6b8bbe8b3a4e7adb0e785aee784a6e8b496
UHC 韆???煮詔???賤?煮焦贖 111101001100011100111111001111110011111111101101101101001111000011011111001111110011111100111111111101001100000100111111111011011011010011110101101001011110000111011011 f4c73f3f3fedb4f0df3f3f3ff4c13fedb4f5a5e1db

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)