To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 穩??澳?????揶?穩??澳?????揶?B 1110001001110010001111110011111111100000010100110011111100111111001111110011111100111111100111011000100000111111111000100111001000111111001111111110000001010011001111110011111100111111001111110011111110011101100010000011111101000010 e2723f3fe0533f3f3f3f3f9d883fe2723f3fe0533f3f3f3f3f9d883f42
EUC-JP 穩??澳??煐??揶?穩??澳??煐??揶?B 111000111101001100111111001111111101111110110100001111110011111110001111110010011111100000111111001111111101100111101000001111111110001111010011001111110011111111011111101101000011111100111111100011111100100111111000001111110011111111011001111010000011111101000010 e3d33f3fdfb43f3f8fc9f83f3fd9e83fe3d33f3fdfb43f3f8fc9f83f3fd9e83f42
UTF-8 穩븝쉿澳묈만煐면뿥揶뭭穩븝쉿澳묈만煐면뿥揶뭭B 11100111101010011010100111101011101110001001110111101100100010011011111111100110101111101011001111101011101011001000100011101011101001111000110011100111100001011001000011101011101010011011010011101011101111111010010111100110100011111011011011101011101011011010110111100111101010011010100111101011101110001001110111101100100010011011111111100110101111101011001111101011101011001000100011101011101001111000110011100111100001011001000011101011101010011011010011101011101111111010010111100110100011111011011011101011101011011010110101000010 e7a9a9ebb89dec89bfe6beb3ebac88eba78ce78590eba9b4ebbfa5e68fb6ebadade7a9a9ebb89dec89bfe6beb3ebac88eba78ce78590eba9b4ebbfa5e68fb6ebadad42
UHC 穩븝쉿澳묈만煐면뿥揶뭭穩븝쉿澳묈만煐면뿥揶뭭B 111010001011000110111010111011111011110110110010111001111111111010010001111001011011100010111000111001111011110010111000111010011001011110100101111001011010101010010010011101101110100010110001101110101110111110111101101100101110011111111110100100011110010110111000101110001110011110111100101110001110100110010111101001011110010110101010100100100111011001000010 e8b1baefbdb2e7fe91e5b8b8e7bcb8e997a5e5aa9276e8b1baefbdb2e7fe91e5b8b8e7bcb8e997a5e5aa927642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)