I've got swish-e to index a directory with mixed content (HTML DOC PDF XLS PPT files) and swish.cgi produces half sensible output.
At first it gave "(null)" where i'd expect to see the context of the string i was searching for.
So, i added
StoreDescription HTML* <body> 20000
StoreDescription TXT* 20000
StoreDescription XML* <desc> 20000
and the "(null)" dissapeared , but still no context
so i added
IndexContents HTML* .htm .html .shtml
IndexContents TXT* .txt .log .text
IndexContents XML* .xml
and now i get the context i expect for HTM files.
Can i get it to work for other filetypes ?
The documentation suggests HTML,TXT,XML are only legal arguments to StoreDescription.
Received on Fri Dec 2 13:31:32 2005