On Wed, Sep 15, 2004 at 01:28:23PM -0700, Richard Morin wrote:
> I decided to spider my web pages, using the the method given
> in the "INSTALL - Swish-e Installation Instructions":
>
> # Example for spidering
> # Use the "spider.pl" program included with Swish-e
> IndexDir spider.pl
>
> # Define what site to index
> SwishProgParameters default http://...
>
> When I ran the program, I saw the following messages:
>
> rdm@flora02 $ swish-e -S prog -c swish2.conf
> Indexing Data Source: "External-Program"
> Indexing "spider.pl"
> External Program found: /u/gl/rdm/local/lib/swish-e/spider.pl
> No SWISH filters found
Hum, "No SWISH filters found" -- I wonder if that's because you don't
have catdoc or pdftotext or if the SWISH::Filters::* modules cannot be
found. My guess it's the first and I won't worry about it.
> /u/gl/rdm/local/lib/swish-e/spider.pl: Reading parameters from
> 'default'
> RobotRules: Unexpected line: Sutton
> ...
>
> Clearly, some program found "Sutton" somewhere and wasn't happy,
> but this isn't enough information to allow the user to debug
> anything. Could we have a more comprehensive error message?
No, sorry. That's not a message generated from any of the code we
control -- rather from (I suspect) the module that parsers the
robots.txt file. Do you have an invalid line in your robots.txt file
with the word "Sutton"?
--
Bill Moseley
moseley@hank.org
Unsubscribe from or help with the swish-e list:
http://swish-e.org/Discussion/
Help with Swish-e:
http://swish-e.org/current/docs
swish-e@sunsite.berkeley.edu
Received on Wed Sep 15 13:39:57 2004