On Tue, May 13, 2003 at 08:12:00AM -0700, Jody Cleveland wrote:
> Well, I've got someone who wants me to index:
> Which is on our test windows 2000 server. I run swish-e on a redhat 8 server
> and spider that location. When I do that, I get this message:
> ./spider.pl: Reading parameters from
> -- Starting to spider:
> http://18.104.22.168/www/keetra/wip/digitization/picbooks/current/pdfs/ --
> Summary for:
> Skipped: 1 (1.0/sec)
> Indexing Data Source: "External-Program"
> Indexing "stdin"
> Removing very common words...
> no words removed.
> Writing main index...
> err: No unique words indexed!
> So, since that didn't work, I had her copy all her files to
> http://22.214.171.124/picbooks and that works fine. Is swish-e only happy
> with one subdirectory, or is there a configuration somewhere I need to
Sorry, I don't really follow your question.
If you want to know why something is not sent to swish-e by the spider run
SPIDER_DEBUG=skipped swish-e -S prog ....
before running it and it will tell you why it was skipped.
Received on Tue May 13 16:56:24 2003