Thank you so much for your help.
Sorry about my_pdf2html.pl, it is the same as _pdf2html.pl. The only difference was I dumped the converted html into a file to see its meta contents.
I made the changes to swish.conf as you said. After that the output screen had one more line to show some information, but the result is still not as good as I expected. (Please see the attached two file).
I appreciate for any more helps,
--- On Thu, 10/15/09, Peter Karman <email@example.com> wrote:
> From: Peter Karman <firstname.lastname@example.org>
> Subject: Re: [swish-e] How swish-e returns PDF's meta description
> To: "Swish-e Users Discussion List" <email@example.com>
> Date: Thursday, October 15, 2009, 12:46 AM
> Daqi Li wrote on 10/14/09 2:54 PM:
> > Hi,
> > I have swish-e-2.4.7 on Linux Fedora core 8 (see below
> uname -a).
> > I have PDF documents that have the summaries in their
> meta description (or keywords). When I do searches, if the
> keyword is found in a pdf body or title, I need swish-e
> returns its size, last modify date, etc. as well as the meta
> description (or keywords). Here are the things I did:
> > 1. I copied your swish.cgi to /var/www/cgi-bin.
> > 2. created .swishcgi.conf in /var/www/cgi-bin (as the
> > 3. Created swish.conf in /var/www/cgi-bin (as the
> > 3. Ran the command to index the files:
> > swish-e -c swish.conf
> > 4. Then browsed to the URL http://localhost/cgi-bin/swish.cgi.
> > Here is The search result I got:
> > 1 09-71298_Spina_mem_op-signed.pdf -- rank: 1000
> > Title: 09-71298_Spina_mem_op-signed.pdf
> > Last Modified Date: 2009-10-14 12:44:14 EDT
> > Document Size: 127153
> > Description: (null)
> > Keywords:
> try these changes in your swish.conf:
> # don't know what my_pdf2html.pl looks like, but
> # does the trick
> FileFilter .pdf /your/path/to/swish-filter-test '-headers
> -content %p'
> # add the missing * after the parser type
> StoreDescription HTML* <meta> 1000
> StoreDescription TXT* 1000
> Peter Karman . http://peknet.com/ . peter(at)not-real.peknet.com
> Users mailing list
Users mailing list
Received on Thu Oct 15 09:19:21 2009