To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鷹ダ◇?碇ダ?????鷹ダ????? 00111111001111110011111110010001111010011000001101011111100000011001111000111111100100101111010010000011010111110011111100111111001111110011111100111111100100011110100110000011010111110011111100111111001111110011111100111111 3f3f3f91e9835f819e3f92f4835f3f3f3f3f3f91e9835f3f3f3f3f3f
EUC-JP ???鷹ダ◇?碇ダ?薏???鷹ダ?薏??? 0011111100111111001111111100001011101011101001011100000010100001111111100011111111000100111101101010010111000000001111111000111111011001110111100011111100111111001111111100001011101011101001011100000000111111100011111101100111011110001111110011111100111111 3f3f3fc2eba5c0a1fe3fc4f6a5c03f8fd9de3f3f3fc2eba5c03f8fd9de3f3f3f
UTF-8 룶엌룫鷹ダ◇룫碇ダ룫薏룶엌룫鷹ダ룫薏룶엌룫 111010111010001110110110111011001001011110001100111010111010001110101011111010011011011110111001111000111000001110000000111000101001011110000111111010111010001110101011111001111010001010000111111000111000001110000000111010111010001110101011111010001001011010001111111010111010001110110110111011001001011110001100111010111010001110101011111010011011011110111001111000111000001110000000111010111010001110101011111010001001011010001111111010111010001110110110111011001001011110001100111010111010001110101011 eba3b6ec978ceba3abe9b7b9e38380e29787eba3abe7a287e38380eba3abe8968feba3b6ec978ceba3abe9b7b9e38380eba3abe8968feba3b6ec978ceba3ab
UHC 룶엌룫鷹ダ◇룫碇ダ룫薏룶엌룫鷹ダ룫薏룶엌룫 100011111010101110111110111111011000111110100010111010111110110110101011110000001010000111011110100011111010001011101111111011011010101111000000100011111010001011101011111110111000111110101011101111101111110110001111101000101110101111101101101010111100000010001111101000101110101111111011100011111010101110111110111111011000111110100010 8fabbefd8fa2ebedabc0a1de8fa2efedabc08fa2ebfb8fabbefd8fa2ebedabc08fa2ebfb8fabbefd8fa2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)