To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 堯?錚坤堯?錚鵠^ 111010101001111100111111111010000100001010001101101000111110101010011111001111111110100001000010100011011001010001011110 ea9f3fe8428da3ea9f3fe8428d945e
EUC-JP 堯?錚坤堯?錚鵠^ 111101001010000100111111111011111010001110111010101001011111010010100001001111111110111110100011101110011111010001011110 f4a13fefa3baa5f4a13fefa3b9f45e
UTF-8 堯렭錚坤堯렭錚鵠^ 11100101101000001010111111101011101000001010110111101001100011001001101011100101100111011010010011100101101000001010111111101011101000001010110111101001100011001001101011101001101101011010000001011110 e5a0afeba0ade98c9ae59da4e5a0afeba0ade98c9ae9b5a05e
UHC 堯렭錚坤堯렭錚鵠^ 1110100011101011100011101011101011101110101101101100110111011110111010001110101110001110101110101110111010110110110011011101110001011110 e8eb8ebaeeb6cddee8eb8ebaeeb6cddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)