To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN ?孟吻?孟吻^ 0011111110010110110100001001010110101011001111111001011011010000100101011010101101011110 3f96d095ab3f96d095ab5e
EUC-JP ?孟吻?孟吻^ 0011111111001100110100101100101010101101001111111100110011010010110010101010110101011110 3fccd2caad3fccd2caad5e
UTF-8 料孟吻料孟吻^ 11101111101001101011111011100101101011011001111111100101100100001011101111101111101001101011111011100101101011011001111111100101100100001011101101011110 efa6bee5ad9fe590bbefa6bee5ad9fe590bb5e
UHC 料孟吻料孟吻^ 11101000111101111101100011101011110110011111110011101000111101111101100011101011110110011111110001011110 e8f7d8ebd9fce8f7d8ebd9fc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)