Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] Seeding a swish-e index

From: Sean <schliden(at)>
Date: Mon Sep 08 2008 - 01:48:04 GMT
> Swish-e doesn't index Word or PDF documents in their original formats. They must
> be converted to html, text, or xml. That's what SWISH::Filter does, for example.
Yes of course. In my case, using catdoc.exe and pdftotext.exe
> So the same ranking strategy applies. SWISH::Filter (for example) assigns
> <title> and <meta> where it can for things like PDF and MS docs.

I guess my question should have been more along the lines of where and how are <title> and <meta> 
assigned, but that is probably beyond the scope of this forum when talking about catdoc and pdftotext.

Thanks again Peter.

Users mailing list
Received on Sun Sep 7 21:48:05 2008