Skip to main content.
home | support | download

Back to List Archive

Re: wvare DOC -> HTML filter

From: Roman Chyla <r.ca(at)not-real.post.cz>
Date: Mon May 24 2004 - 07:06:18 GMT
I was using wvware with Greenstone digital library for more than
6 months It is a very good filter - but not for fast indexing (in
my opinion, terribly slow)

I think that would be good for swish-e to have them both - people
may choose what suit their needs

roman


----- PŮVODNÍ ZPRÁVA -----
Od: "David L Norris" <dave@webaugur.com>
Komu: "Multiple recipients of list"
<swish-e@sunsite.berkeley.edu> Předmět: [SWISH-E] Re: wvare DOC
-> HTML
Datum: 23.5.2004 - 16:52:06

> On Sun, 2004-05-23 at 00:15, Bill Moseley
> wrote:
> > > package SWISH::Filters::Doc2html;
> > Did you compare this with how well (or
> > not well) catdoc does?
> > When would someone want this over catdoc?
> 
> I think one would almost always want Wv
> over catdoc.
> 
> catdoc doesn't handle embedded objects well
> or at all.  On complex
> documents large portions of text are
> ignored by catdoc.  URLs, indices,
> text frames, etc.
> 
> --=20
> David Norris
> http://www.webaugur.com/dave/
> ICQ - 412039
> 
> 
> 
>
*********************************************************************
> Due to deletion of content types excluded
> from this list by policy,
> this multipart message was reduced to a
> single part, and from there
> to a plain text message.
>
*********************************************************************
> 
Received on Mon May 24 00:06:22 2004