> Swish-e doesn't index Word or PDF documents in their original formats. They must
> be converted to html, text, or xml. That's what SWISH::Filter does, for example.
Yes of course. In my case, using catdoc.exe and pdftotext.exe
>
> So the same ranking strategy applies. SWISH::Filter (for example) assigns
> <title> and <meta> where it can for things like PDF and MS docs.
I guess my question should have been more along the lines of where and how are <title> and <meta>
assigned, but that is probably beyond the scope of this forum when talking about catdoc and pdftotext.
Thanks again Peter.
_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Sun Sep 7 21:48:05 2008