Skip to main content.
home | support | download

Back to List Archive

Re: swish-e on a large scale

From: Peter Karman <karman(at)not-real.cray.com>
Date: Thu Sep 30 2004 - 18:40:21 GMT
Aaron Levitt wrote on 09/30/2004 01:17 PM:

> Last but not least... the results of the indexer's first run:
> 
> 475,944 unique words indexed.
> 5 properties sorted.
> 637,449 files indexed.  2,932,324,538 total bytes.  231,714,672 total 
> words.
> Elapsed time: 47:57:02 CPU time: 04:30:30
> Indexing done!
> 


One thing you might consider is judicious use of StopWords. That will 
help keep your indexes much smaller, though it won't necessarily speed 
up the indexing time.
-- 
Peter Karman - 651-605-9009 - karman@cray.com
Received on Thu Sep 30 11:40:35 2004