To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 顴擘壘顴擘壘^ 11101001010000011001110110100100100110101101110011101001010000011001110110100100100110101101110001011110 e9419da49adce9419da49adc5e
EUC-JP 顴擘壘顴擘壘^ 11110001101000101101101010100110110101001101111011110001101000101101101010100110110101001101111001011110 f1a2daa6d4def1a2daa6d4de5e
UTF-8 顴擘壘顴擘壘^ 11101001101000011011010011100110100100111001100011100101101000111001100011101001101000011011010011100110100100111001100011100101101000111001100001011110 e9a1b4e69398e5a398e9a1b4e69398e5a3985e
UHC ?擘壘?擘壘^ 0011111111011011111110111101011110100100001111111101101111111011110101111010010001011110 3fdbfbd7a43fdbfbd7a45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)