You probably should take a look at Josh Rabinowitz's
article from USENIX this summer.
The paper is called
"IndexingWithSwishe-Rabinowitz.pdf" and is available
It discusses using a 3Ghz Pentium 4 with 2G of ram to
index up to 5GB of data with swish-e (and also with
mysql). Tables of index sizes and graphs of response
times are included.
That paper should provide you with some data points to
Can't comment on Lucene.
--- Venkatraju <email@example.com> wrote:
> I am evaluating open source search engines for use
> in a project where
> the data to be indexed could get pretty big (a few
> GB of documents,
> each of about 10-20KB). I would like to hear the
> experiences of
> anybody who has used Swish-E in such a scenario. Any
> hints/tips or
> caveats to be aware of? What kind of search
> performance can I expect
> (given that I run the search on a recent machine
> with lots of RAM)?
> Also, has anyone compared Swish-E vs. Lucene in
> terms of scalability
> and performance?
> Thanks in advance,
Do you Yahoo!?
Check out the new Yahoo! Front Page.
Received on Mon Nov 15 06:48:05 2004