Skip to main content.
home | support | download

Back to List Archive

Re: Spider taking too long to index?

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Tue Oct 08 2002 - 23:26:50 GMT
At 02:46 PM 10/08/02 -0700, David VanHook wrote:
>
>OK, here's a question -- I've got to run this huge index twice: once for the
>regular index, and another time for the Fuzzy index.  Is there any way to
>only run the spider.pl part of it once, and then somehow get the results to
>both the non-fuzzy and the fuzzy SWISH indexing?  Since the work is really
>in the grabbing of all the HTML files, not in the indexing part.

How about

   ./spider.pl | gzip > docs.gz

   gzip -dc docs.gz | ./swish-e -c fuzzy -S prog -i stdin
   gzip -dc docs.gz | ./swish-e -c fuzzy -S prog -i stdin



-- 
Bill Moseley
mailto:moseley@hank.org
Received on Tue Oct 8 23:30:42 2002