To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ’CM’H’Mi’CM’H’MiB 1001001001000011010011011001001001001000100100100100110101101001100100100100001101001101100100100100100010010010010011010110100101000010 92434d9248924d6992434d9248924d6942
SJIS-WIN ?CM?H?Mi?CM?H?MiB 0011111101000011010011010011111101001000001111110100110101101001001111110100001101001101001111110100100000111111010011010110100101000010 3f434d3f483f4d693f434d3f483f4d6942
EUC-JP ?CM?H?Mi?CM?H?MiB 0011111101000011010011010011111101001000001111110100110101101001001111110100001101001101001111110100100000111111010011010110100101000010 3f434d3f483f4d693f434d3f483f4d6942
UTF-8 ’CM’H’Mi’CM’H’MiB 1100001010010010010000110100110111000010100100100100100011000010100100100100110101101001110000101001001001000011010011011100001010010010010010001100001010010010010011010110100101000010 c292434dc29248c2924d69c292434dc29248c2924d6942
UHC ?CM?H?Mi?CM?H?MiB 0011111101000011010011010011111101001000001111110100110101101001001111110100001101001101001111110100100000111111010011010110100101000010 3f434d3f483f4d693f434d3f483f4d6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)