To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????D??????????D^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000100001111110011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f3f445e
SJIS-WIN ??ビ?萸??ビ??D??ビ?萸??ビ??D^ 0011111100111111100000110111001000111111111001001100111000111111001111111000001101110010001111110011111101000100001111110011111110000011011100100011111111100100110011100011111100111111100000110111001000111111001111110100010001011110 3f3f83723fe4ce3f3f83723f3f443f3f83723fe4ce3f3f83723f3f445e
EUC-JP ??ビ?萸??ビ??D??ビ?萸??ビ??D^ 0011111100111111101001011101001100111111111010001101000000111111001111111010010111010011001111110011111101000100001111110011111110100101110100110011111111101000110100000011111100111111101001011101001100111111001111110100010001011110 3f3fa5d33fe8d03f3fa5d33f3f443f3fa5d33fe8d03f3fa5d33f3f445e
UTF-8 룶핊ビ룫萸룶핊ビ룫琉D룶핊ビ룫萸룶핊ビ룫琉D^ 111010111010001110110110111011011001010110001010111000111000001110010011111010111010001110101011111010001001000010111000111010111010001110110110111011011001010110001010111000111000001110010011111010111010001110101011111011111010011110001100010001001110101110100011101101101110110110010101100010101110001110000011100100111110101110100011101010111110100010010000101110001110101110100011101101101110110110010101100010101110001110000011100100111110101110100011101010111110111110100111100011000100010001011110 eba3b6ed958ae38393eba3abe890b8eba3b6ed958ae38393eba3abefa78c44eba3b6ed958ae38393eba3abe890b8eba3b6ed958ae38393eba3abefa78c445e
UHC 룶핊ビ룫萸룶핊ビ룫琉D룶핊ビ룫萸룶핊ビ룫琉D^ 10001111101010111100000010001111101010111101001110001111101000101110101110101101100011111010101111000000100011111010101111010011100011111010001011101011101001000100010010001111101010111100000010001111101010111101001110001111101000101110101110101101100011111010101111000000100011111010101111010011100011111010001011101011101001000100010001011110 8fabc08fabd38fa2ebad8fabc08fabd38fa2eba4448fabc08fabd38fa2ebad8fabc08fabd38fa2eba4445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)