Re: spidering with swish

From: Bill Moseley <moseley(at)>
Date: Wed Jan 05 2005 - 21:03:41 GMT
On Wed, Jan 05, 2005 at 12:15:48PM -0800, Lance Perry wrote:
> I am spidering a site (spidering is being called from the swish indexing).
> The site contains .exe and .zip files. I DO NOT want those files to be
> indexed (or even downloaded).

You do it the same way as the example in the docs for skipping .gif,
jpeg and .png, but specify \.exe and \.zip instead or use robots.txt
to list the files.

> --robots.txt--
> User-agent: *
> Disallow: /downloads/cisco-vpn/*.exe$

That's not valid robots.txt syntax.  You can't use regex patterns.

Bill Moseley

