Skip to main content.
home | support | download

Back to List Archive

Re: IndexContents and StoreDescription for doc PDF PPT XLS files

From: David Larkin <david.larkin(at)not-real.djl.co.uk>
Date: Fri Dec 02 2005 - 21:45:17 GMT
More by luck than judgement I found that the following works, although i don't claim to understand why.

IndexContents HTML* .htm .html .shtml .pdf .doc .ppt .xls
StoreDescription HTML* <body> 20000

David


> I've got swish-e to index a directory with mixed content (HTML DOC PDF XLS PPT files) and swish.cgi produces half sensible output.
> 
> At first it gave "(null)" where i'd expect to see the context of the string i was searching for.
> 
> So, i added 
> 
> StoreDescription HTML* <body> 20000
> StoreDescription TXT* 20000
> StoreDescription XML* <desc> 20000
> 
> and the "(null)" dissapeared , but still no context 
> 
> so i added
> 
> IndexContents HTML* .htm .html .shtml
> IndexContents TXT* .txt .log .text
> IndexContents XML* .xml
> 
> and now i get the context i expect for HTM files.
> 
> Can i get it to work for other filetypes ?
> 
> The documentation suggests HTML,TXT,XML are only legal arguments to StoreDescription.
> 
> Regards
> David
Received on Fri Dec 2 13:45:17 2005