Skip to main content.
home | support | download

Back to List Archive

IndexContents and StoreDescription for doc PDF PPT XLS files

From: David Larkin <david.larkin(at)not-real.djl.co.uk>
Date: Fri Dec 02 2005 - 21:31:31 GMT
I've got swish-e to index a directory with mixed content (HTML DOC PDF XLS PPT files) and swish.cgi produces half sensible output.

At first it gave "(null)" where i'd expect to see the context of the string i was searching for.

So, i added 

StoreDescription HTML* <body> 20000
StoreDescription TXT* 20000
StoreDescription XML* <desc> 20000

and the "(null)" dissapeared , but still no context 

so i added

IndexContents HTML* .htm .html .shtml
IndexContents TXT* .txt .log .text
IndexContents XML* .xml

and now i get the context i expect for HTM files.

Can i get it to work for other filetypes ?

The documentation suggests HTML,TXT,XML are only legal arguments to StoreDescription.

Regards
David
Received on Fri Dec 2 13:31:32 2005