To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 澳?????節?????澳?????辱ц? 1110000001010011001111110011111100111111001111110011111110010000110111110011111100111111001111110011111100111111111000000101001100111111001111110011111100111111001111111001000001001010100001001000100000111111 e0533f3f3f3f3f90df3f3f3f3f3fe0533f3f3f3f3f904a84883f
EUC-JP 澳?????節??旿??澳?????辱ц? 11011111101101000011111100111111001111110011111100111111110000001110000100111111001111111000111111000001111101000011111100111111110111111011010000111111001111110011111100111111001111111011111110101011101001111110100000111111 dfb43f3f3f3f3fc0e13f3f8fc1f43f3fdfb43f3f3f3f3fbfaba7e83f
UTF-8 澳뉛쉈樂뽫뵽節뤄슛旿울쉽澳뉛쉈樂뽬쎗辱ц뜵 1110011010111110101100111110101110001001100110111110110010001001100010001110111110100110101111111110101110111101101010111110101110110101101111011110011110101111100000001110101110100100100001001110110010001010100110111110011010010111101111111110110010011010101110001110110010001001101111011110011010111110101100111110101110001001100110111110110010001001100010001110111110100110101111111110101110111101101011001110110010001110100101111110100010111110101100011101000110000110111010111001110010110101 e6beb3eb899bec8988efa6bfebbdabebb5bde7af80eba484ec8a9be697bfec9ab8ec89bde6beb3eb899bec8988efa6bfebbdacec8e97e8beb1d186eb9cb5
UHC 澳뉛쉈樂뽫뵽節뤄슛旿울쉽澳뉛쉈樂뽬쎗辱ц뜵 111001111111111010000111111011111011110110100101111010001111100110010110111001111001010010111011111011111011110110110111111011111011110110111000111001111111101010111111111011111011110110110001111001111111111010000111111011111011110110100101111010001111100110010110111010001001101110111110111010011011010010101100111010001000110110110011 e7fe87efbda5e8f996e794bbefbdb7efbdb8e7fabfefbdb1e7fe87efbda5e8f996e89bbee9b4ace88db3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)