Skip to main content.
home | support | download

Back to List Archive

Re: swish-e 2.1 hangs for a very long time

From: Michael <michael(at)not-real.insulin-pumpers.org>
Date: Sat Jul 13 2002 - 09:19:54 GMT
> 7-12 download of 2.1
> swish-e 2.1 hangs for a very long time
>

forgot to mention. The index is fine and search works on it without 
difficulty. I'd just like to eliminate the horrendous CPU load during 
the hang time.
 
> What did I do wrong?
> 
> here's the scenario
> 
> /usr/local/bin/swish-e \
>    -i http://members.aol.com/CamelsRFun \
>    -c swish-e/SPIDER.GENERIC.CONFIG \
>    -f swish-e/spider.CamelsRFun.index.tmp -v 3 -S http 
> Parsing config file 'swish-e/SPIDER.GENERIC.CONFIG' 
> Indexing Data Source: "HTTP-Crawler" 
> Indexing "http://members.aol.com/CamelsRFun" 
> retrieving http://members.aol.com/CamelsRFun (0)... 
> retrieving http://members.aol.com/CamelsRFun/ (0)...
> 
> Gets stuck here for maybe 5-10 minutes with 99% CPU usage
> but no packets are being sent/received via the network.
> It then moves on in what appears to be a normal fashion
> 
> Note run time:
> 
> Removing very common words...
>   Getting IgnoreLimit stopwords: Complete                          
> no words removed. Writing main index... Sorting words ... Sorting
> 3188 words alphabetically Writing header ... Writing index entries
> ...
>   Writing word text: Complete
>   Writing word hash: Complete
>   Writing word data: Complete
> 3188 unique words indexed.
> 7 properties sorted.                                              65
> files indexed.  321038 total bytes.  22202 total words. Elapsed
> time: 00:13:56 CPU time: 00:00:01 Indexing done!
> 
> 
> Config file....
> IndexDir http://www.insulin-pumpers.org
> IndexFile ./swish.index
> IndexName "Insulin Pumpers Mail Archive"
> IndexDescription "no other index was specified." 
> IndexPointer "www.insulin-pumpers.org"
> IndexAdmin "webmaster@insulin-pumpers.org"
> MetaNames author description datamodified
> IndexReport 3
> UseStemming yes
> PropertyNames author description datamodified
> IgnoreTotalWordCountWhenRanking yes
> MinWordLimit 4
> WordCharacters abcdefghijklmnopqrstuvwxyz0123456789.-_'"
> IgnoreLimit 80 1000
> IndexComments 0
> MaxDepth 4
> Delay 5
> TmpDir ./
> 
> Michael@Insulin-Pumpers.org
> 
Received on Sat Jul 13 09:23:25 2002