Skip to main content.
home | support | download

Back to List Archive

spidering PDF files

From: Richard Morin <rdm(at)not-real.slac.stanford.edu>
Date: Tue Sep 21 2004 - 21:11:31 GMT
I have wandered through several Swish-e documents, trying to
figure out how to spider PDF files.  AFAICT, the current plan
involves adding a line to spider.config:

   filter_content  => \&filter_content,

but I suspect that more than this is required.  Help?

-r

P.S.  I already have spider.pl working, using a moderately
       customized spider.config file; I just want to get it
       to filter and index PDF files.
Received on Tue Sep 21 14:12:20 2004