On Tue, 4 Feb 2003, Timothy Smith wrote:
> I did it again with -v3 and it has run all night, currently with a size 428M
> and has stopped at:
>
>
> Removing very common words...
> Getting IgnoreLimit stopwords: links
Sounds like you hit on a bug.
> Perhaps I should comment out having stopwords?
At least do not use IgnoreLimit. If I used stopwords at all it's
typically a very short list.
I also see that using IgnoreLimit is much slower:
With IgnoreLimit 20 250:
1347 files indexed. 25832099 total bytes. 2346049 total words.
Elapsed time: 00:01:11 CPU time: 00:01:11
Without:
1347 files indexed. 25832099 total bytes. 2346049 total words.
Elapsed time: 00:00:20 CPU time: 00:00:20
--
Bill Moseley moseley@hank.org
Received on Tue Feb 4 14:06:03 2003