To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 銹???舍獅??銹?也銹???舍獅??銹?也^ 111001111111011000111111001111110011111111100100011100011000111010000010001111110011111111100111111101100011111110010110111001111110011111110110001111110011111100111111111001000111000110001110100000100011111100111111111001111111011000111111100101101110011101011110 e7f63f3f3fe4718e823f3fe7f63f96e7e7f63f3f3fe4718e823f3fe7f63f96e75e
EUC-JP 銹???舍獅??銹?也銹???舍獅??銹?也^ 111011101111100000111111001111110011111111100111110100101011101111100010001111110011111111101110111110000011111111001100111010011110111011111000001111110011111100111111111001111101001010111011111000100011111100111111111011101111100000111111110011001110100101011110 eef83f3f3fe7d2bbe23f3feef83fcce9eef83f3f3fe7d2bbe23f3feef83fcce95e
UTF-8 銹뤒뷜헤舍獅헥쨘銹롐也銹뤒뷜헤舍獅헥쨘銹롐也^ 11101001100010101011100111101011101001001001001011101011101101111001110011101101100101111010010011101000100010001000110111100111100011011000010111101101100101111010010111101100101010001001100011101001100010101011100111101011101000011001000011100100101110011001111111101001100010101011100111101011101001001001001011101011101101111001110011101101100101111010010011101000100010001000110111100111100011011000010111101101100101111010010111101100101010001001100011101001100010101011100111101011101000011001000011100100101110011001111101011110 e98ab9eba492ebb79ced97a4e8888de78d85ed97a5eca898e98ab9eba190e4b99fe98ab9eba492ebb79ced97a4e8888de78d85ed97a5eca898e98ab9eba190e4b99f5e
UHC 銹뤒뷜헤舍獅헥쨘銹롐也銹뤒뷜헤舍獅헥쨘銹롐也^ 111000101100100010001111110000101011101011100010110001111110110011011110111011001101111011100010110001111110110111000010101110101110001011001000100011101101011011100101101001011110001011001000100011111100001010111010111000101100011111101100110111101110110011011110111000101100011111101101110000101011101011100010110010001000111011010110111001011010010101011110 e2c88fc2bae2c7ecdeecdee2c7edc2bae2c88ed6e5a5e2c88fc2bae2c7ecdeecdee2c7edc2bae2c88ed6e5a55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)