Peter Karman wrote on 17/10/2009 05:31 AM:
> I created a corpus of 1M tiny docs. Then I ran the query:
> % swish-e -w 'a or (not a)'
> which should return all the docs in the index:
> # SWISH format: 2.5.8
> # Search words: a or ( not a )
> # Removed stopwords:
> # Number of hits: 1000000
> # Search time: 0.410 seconds
> # Run time: 0.416 seconds
> so yes, I expect that the more hits you get, the longer it will take to
> over them all and calculate a rank for each.
I have found a solution for this, I have created a search result cache for
common queries if there are more results than 50000. In this cache results
are taken from the cache, cache is automatically deleted after reindexing
and rebuild the first time the query is made.
Thomas den Braber
Users mailing list
Received on Sat Oct 17 07:24:00 2009