Skip to main content.
home | support | download

Back to List Archive

RE: Parsing Excel (was: last modified date in swish-e i

From: <Rainer.Scherg(at)not-real.rexroth.de>
Date: Fri Jun 01 2001 - 14:49:04 GMT
> -----Original Message-----
> From: Bill Moseley [mailto:moseley@hank.org]
> Sent: Friday, June 01, 2001 4:07 PM
> To: Multiple recipients of list
> Subject: [SWISH-E] Parsing Excel (was: last modified date in swish-e
> index file)
> 
> 
> At 03:38 AM 06/01/01 -0700, Rainer.Scherg@rexroth.de wrote:
> >> If you are indexing .doc (Word) files, then there's the 
> >> catdoc program to
> >> extract out the text.  I believe I saw a utility to extract 
> >> out xls files, but I don't remember anything specific.  
> 
> I haven't tried either of these, but there are two Excel 
> modules on CPAN.
> 
> Spreadsheet::ParseExcel claims to extract the data from Excel docs.
> 
> "XML::Excel provides functions to easily transform Excel 
> documents into XML."
> But XML::Excel uses Spreadsheet::ParseExcel.
> 
> I'm curious.  What kind of content are you searching for in 
> your spreadsheets?

Normally just the content of the XLS cells.

Known problems on XLS filters:
  - XLS format versions(!) (Office version).
  - Multi sheet XLS files.
  - Macros/VB-Script enabled sheets.

BTW: some XLS filters are just CSV tools. 8-/


cu - rainer
 


-----------------------------------------------------------
This Mail has been checked for Viruses
Attention: Encrypted Mails can NOT be checked !

***

Diese Mail wurde auf Viren ueberprueft
Hinweis: Verschluesselte Mails koennen NICHT geprueft werden!
------------------------------------------------------------
Received on Fri Jun 1 14:52:51 2001