Skip to main content.
home | support | download

Back to List Archive

swish-e 2.1 hangs for a very long time

From: Michael <michael(at)not-real.insulin-pumpers.org>
Date: Sat Jul 13 2002 - 08:52:56 GMT
7-12 download of 2.1
swish-e 2.1 hangs for a very long time

What did I do wrong?

here's the scenario

/usr/local/bin/swish-e \
   -i http://members.aol.com/CamelsRFun \
   -c swish-e/SPIDER.GENERIC.CONFIG \
   -f swish-e/spider.CamelsRFun.index.tmp -v 3 -S http 
Parsing config file 'swish-e/SPIDER.GENERIC.CONFIG' 
Indexing Data Source: "HTTP-Crawler" 
Indexing "http://members.aol.com/CamelsRFun" 
retrieving http://members.aol.com/CamelsRFun (0)... 
retrieving http://members.aol.com/CamelsRFun/ (0)...

Gets stuck here for maybe 5-10 minutes with 99% CPU usage
but no packets are being sent/received via the network.
It then moves on in what appears to be a normal fashion

Note run time:

Removing very common words...
  Getting IgnoreLimit stopwords: Complete                           
no words removed. Writing main index... Sorting words ... Sorting 3188
words alphabetically Writing header ... Writing index entries ...
  Writing word text: Complete
  Writing word hash: Complete
  Writing word data: Complete
3188 unique words indexed.
7 properties sorted.                                              65
files indexed.  321038 total bytes.  22202 total words. Elapsed time:
00:13:56 CPU time: 00:00:01 Indexing done!


Config file....
IndexDir http://www.insulin-pumpers.org
IndexFile ./swish.index
IndexName "Insulin Pumpers Mail Archive"
IndexDescription "no other index was specified." 
IndexPointer "www.insulin-pumpers.org"
IndexAdmin "webmaster@insulin-pumpers.org"
MetaNames author description datamodified
IndexReport 3
UseStemming yes
PropertyNames author description datamodified
IgnoreTotalWordCountWhenRanking yes
MinWordLimit 4
WordCharacters abcdefghijklmnopqrstuvwxyz0123456789.-_'"
IgnoreLimit 80 1000
IndexComments 0
MaxDepth 4
Delay 5
TmpDir ./

Michael@Insulin-Pumpers.org
Received on Sat Jul 13 08:56:27 2002