To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 茴域、茴域、^ 11100100101000001000100011100110100000010100000111100100101000001000100011100110100000010100000101011110 e4a088e68141e4a088e681415e
EUC-JP 茴域、茴域、^ 11101000101000101011000011101000101000011010001011101000101000101011000011101000101000011010001001011110 e8a2b0e8a1a2e8a2b0e8a1a25e
UTF-8 茴域、茴域、^ 11101000100011001011010011100101100111111001111111100011100000001000000111101000100011001011010011100101100111111001111111100011100000001000000101011110 e88cb4e59f9fe38081e88cb4e59f9fe380815e
UHC 茴域、茴域、^ 11111100111011011110011010110100101000011010001011111100111011011110011010110100101000011010001001011110 fcede6b4a1a2fcede6b4a1a25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)