To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???[???[^ 001111110011111100111111010110110011111100111111001111110101101101011110 3f3f3f5b3f3f3f5b5e
SJIS-WIN ??彦[??彦[^ 0011111100111111100101010100011001011011001111110011111110010101010001100101101101011110 3f3f95465b3f3f95465b5e
EUC-JP ??彦[??彦[^ 0011111100111111110010011010011101011011001111110011111111001001101001110101101101011110 3f3fc9a75b3f3fc9a75b5e
UTF-8 뤳떞彦[뤳떞彦[^ 111010111010010010110011111010111001011010011110111001011011110110100110010110111110101110100100101100111110101110010110100111101110010110111101101001100101101101011110 eba4b3eb969ee5bda65beba4b3eb969ee5bda65b5e
UHC 뤳떞彦[뤳떞彦[^ 100011111110000110001011101101001110010111101001010110111000111111100001100010111011010011100101111010010101101101011110 8fe18bb4e5e95b8fe18bb4e5e95b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)