Really? So you can just use the regular XML parser? That's cool.
Note that I found other odt filters that don't require openoffice:
The first one is a perl script and it seems to work. Haven't tried the other one. However, since the regular XML parser will do the trick, they may not be necessary.
On Tue, Oct 14, 2008 at 09:47:26AM +0300, Lars Noodén wrote:
> > You could try unzipping the odt and then indexing the content.xml.
> That should also work
> > Not sure how well libxml2 will handle Openoffice's xml.
> It's regular XML, specifically it's OpenDocument Format.
> FWIW, I started on a filter for ODF a while back but got swamped before
> I could polish it. You are welcome to use whatever is useful:
> (Note: I was also aiming to index the metadata and, if they exist,
> certain pre-defined custom fields.)
> The OpenDocument Fellowship(*) has sponsored a number of ODF conversion
> tools. I'm not sure if the viewer is useful in this case:
> but the other tools might be
> (*) Member
> Users mailing list
Users mailing list
Received on Tue Oct 14 13:21:42 2008