Mhh, indexing PDF files works fine for us (as I said: some 1000s of pdf
docs).
But I'm using the filesystsem index mode. The spidering mode has not been
tested (because of this it still beta) - I would like to have some
feedback
on this - even if the code change is the same as on the filesystem index
feature...
What does not work (AFAIK) is getting links from PDF to HTML pages.
For this, you need a good filter which converts PDF to HTML instead
of TEXT...
cu Rainer
-----Original Message-----
From: Ibon Aizpurua
Sent: Thursday, July 22, 1999 9:08 AM
To: Multiple recipients of list
Subject: [SWISH-E] indexin PDF files
Hi,
I'm trying to index the PDF files we have in the server.
As you know the PDF file can have links the same as HTML
files. Is possible take those links to index those files later???
If it is no any idea to develop this????
Another problem is that I have downloaded the SWISH-E
enhanced with filtering capabilities and it can't index PDF files,
Rainer???
Ibon
http://www.jalgi.com
----------------------------------------------------------------------
This Mail has been checked for Viruses
Attention: Encrypted Mails can NOT be checked !
* * *
Diese Mail wurde auf Viren ueberprueft
Hinweis: Verschluesselte Mails koennen NICHT geprueft werden !
----------------------------------------------------------------------
Received on Thu Jul 22 07:22:14 1999