> 7-12 download of 2.1
> swish-e 2.1 hangs for a very long time
forgot to mention. The index is fine and search works on it without
difficulty. I'd just like to eliminate the horrendous CPU load during
the hang time.
> What did I do wrong?
> here's the scenario
> /usr/local/bin/swish-e \
> -i http://members.aol.com/CamelsRFun \
> -c swish-e/SPIDER.GENERIC.CONFIG \
> -f swish-e/spider.CamelsRFun.index.tmp -v 3 -S http
> Parsing config file 'swish-e/SPIDER.GENERIC.CONFIG'
> Indexing Data Source: "HTTP-Crawler"
> Indexing "http://members.aol.com/CamelsRFun"
> retrieving http://members.aol.com/CamelsRFun (0)...
> retrieving http://members.aol.com/CamelsRFun/ (0)...
> Gets stuck here for maybe 5-10 minutes with 99% CPU usage
> but no packets are being sent/received via the network.
> It then moves on in what appears to be a normal fashion
> Note run time:
> Removing very common words...
> Getting IgnoreLimit stopwords: Complete
> no words removed. Writing main index... Sorting words ... Sorting
> 3188 words alphabetically Writing header ... Writing index entries
> Writing word text: Complete
> Writing word hash: Complete
> Writing word data: Complete
> 3188 unique words indexed.
> 7 properties sorted. 65
> files indexed. 321038 total bytes. 22202 total words. Elapsed
> time: 00:13:56 CPU time: 00:00:01 Indexing done!
> Config file....
> IndexDir http://www.insulin-pumpers.org
> IndexFile ./swish.index
> IndexName "Insulin Pumpers Mail Archive"
> IndexDescription "no other index was specified."
> IndexPointer "www.insulin-pumpers.org"
> IndexAdmin "email@example.com"
> MetaNames author description datamodified
> IndexReport 3
> UseStemming yes
> PropertyNames author description datamodified
> IgnoreTotalWordCountWhenRanking yes
> MinWordLimit 4
> WordCharacters abcdefghijklmnopqrstuvwxyz0123456789.-_'"
> IgnoreLimit 80 1000
> IndexComments 0
> MaxDepth 4
> Delay 5
> TmpDir ./
Received on Sat Jul 13 09:23:25 2002