Skip to main content.
home | support | download

Back to List Archive

Re: Parsing doc, xls and excel files with swish-e and libxml2

From: Animesh Bansriyar <animesh(at)not-real.arithme.net>
Date: Tue Jun 28 2005 - 07:28:24 GMT
Dave,

David L Norris <dave@webaugur.com> wrote:
> "Native" filters are installed by the Swish-e Windows installer for Word
> (catdoc) and PDF (pdftotext) documents.  You can use catdoc, wvware,
> xpdf, or any other program that converts a document to Text, HTML, or
> XML with a FileFilter directive during indexing:
>   http://swish-e.org/docs/swish-config.html#item_filefilter
> 

By a native filter I meant a library tightly coupled with the swish-e system
which could use functions from the library to do all needed parsing and not a
third party filter.

Thanks for the detailed explanation. Solves lots of problems for me.

Regards,
Animesh
Received on Tue Jun 28 00:28:26 2005