To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???CM???H???M???BB 001111110011111100111111010000110100110100111111001111110011111101001000001111110011111100111111010011010011111100111111001111110100001001000010 3f3f3f434d3f3f3f483f3f3f4d3f3f3f4242
SJIS-WIN 晶ェCM晶ェH晶ェM晶ェBB 1000111110111011111101001000111010101010010000110100110110001111101110111111010010001110101010100100100010001111101110111111010010001110101010100100110110001111101110111111010010001110101010100100001001000010 8fbbf48eaa434d8fbbf48eaa488fbbf48eaa4d8fbbf48eaa4242
EUC-JP 晶?ェCM晶?ェH晶?ェM晶?ェBB 1011111010111101001111111000111010101010010000110100110110111110101111010011111110001110101010100100100010111110101111010011111110001110101010100100110110111110101111010011111110001110101010100100001001000010 bebd3f8eaa434dbebd3f8eaa48bebd3f8eaa4dbebd3f8eaa4242
UTF-8 晶ェCM晶ェH晶ェM晶ェBB 111001101001100110110110111011101000110010111101111011111011110110101010010000110100110111100110100110011011011011101110100011001011110111101111101111011010101001001000111001101001100110110110111011101000110010111101111011111011110110101010010011011110011010011001101101101110111010001100101111011110111110111101101010100100001001000010 e699b6ee8cbdefbdaa434de699b6ee8cbdefbdaa48e699b6ee8cbdefbdaa4de699b6ee8cbdefbdaa4242
UHC 晶??CM晶??H晶??M晶??BB 11101111110111000011111100111111010000110100110111101111110111000011111100111111010010001110111111011100001111110011111101001101111011111101110000111111001111110100001001000010 efdc3f3f434defdc3f3f48efdc3f3f4defdc3f3f4242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)