To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 鳶??映??悅??D鳶??映??悅??D^ 100100111100111000111111001111111000100101100110001111110011111111111010101111010011111100111111010001001001001111001110001111110011111110001001011001100011111100111111111110101011110100111111001111110100010001011110 93ce3f3f89663f3ffabd3f3f4493ce3f3f89663f3ffabd3f3f445e
EUC-JP 鳶??映?????D鳶??映?????D^ 11000110110100000011111100111111101100011100011100111111001111110011111100111111001111110100010011000110110100000011111100111111101100011100011100111111001111110011111100111111001111110100010001011110 c6d03f3fb1c73f3f3f3f3f44c6d03f3fb1c73f3f3f3f3f445e
UTF-8 鳶녺룑映든땰悅롨뮓D鳶녺룑映든땰悅롨뮓D^ 111010011011001110110110111010111000010110111010111010111010001110010001111001101001100010100000111010111001001110100000111010111001010110110000111001101000001010000101111010111010000110101000111010111010111010010011010001001110100110110011101101101110101110000101101110101110101110100011100100011110011010011000101000001110101110010011101000001110101110010101101100001110011010000010100001011110101110100001101010001110101110101110100100110100010001011110 e9b3b6eb85baeba391e698a0eb93a0eb95b0e68285eba1a8ebae9344e9b3b6eb85baeba391e698a0eb93a0eb95b0e68285eba1a8ebae93445e
UHC 鳶녺룑映든땰悅롨뮓D鳶녺룑映든땰悅롨뮓D^ 111001101110100110000110111001111000111110001110111001111011000110110101111001111000101110000110111001101110110110001110111010001001001010011111010001001110011011101001100001101110011110001111100011101110011110110001101101011110011110001011100001101110011011101101100011101110100010010010100111110100010001011110 e6e986e78f8ee7b1b5e78b86e6ed8ee8929f44e6e986e78f8ee7b1b5e78b86e6ed8ee8929f445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)