I agree with Rainer's opinion about summaries.
Just another point of view. If the summary is stored with
the filepath, all the file related data is contiguous in the
index file, making retrievals faster (less I/O may be expected).
If we use properties, at least we need one extra I/O operation
because the data is not contiguous.
BTW, this makes me thinking why swish-e is using just one unique
index file. The only reason that comes to my mind is simplicity, but...
- The total index file is limited to 2GB (well, I know that probably our
sites are not like google).
- Updating, inserting and deleting is really hard to do. It should be
easier with several files. Eg: one for the header and words, another
one for words'data, another one for file's data and another one for
What do you think?
Received on Wed Nov 15 18:43:45 2000