I changed 'description' to 'swishdescription' in the .swishcgi.conf file and got the return data for swishdescription as following:
Description: Path-Name: /var/www/html/nyeb/opinions/ast_bu/09-71298_Spina_mem_op-signed.pdf Content-Length: 22353 ...
I think this long line contains much more data than I saw. I need the contents of 'keywords' in it. Is there a way to retrieve only the keywords?
Another question, is there a way to run a command line to return this data, so that I can see what is in swishdescription? Something like:
/usr/local/bin/swish-e -w "$keyphrase" -f $index -x ... (or other option)?
Thanks in advance,
--- On Thu, 10/15/09, Peter Karman <email@example.com> wrote:
> From: Peter Karman <firstname.lastname@example.org>
> Subject: Re: [swish-e] How swish-e returns PDF's meta description
> To: "Swish-e Users Discussion List" <email@example.com>
> Date: Thursday, October 15, 2009, 9:45 AM
> Daqi Li wrote on 10/15/09 8:19 AM:
> > Hi Peter,
> > Thank you so much for your help.
> > Sorry about my_pdf2html.pl, it is the same as
> _pdf2html.pl. The only difference was I dumped the converted
> html into a file to see its meta contents.
> > I made the changes to swish.conf as you said. After
> that the output screen had one more line to show some
> information, but the result is still not as good as I
> expected. (Please see the attached two file).
> > I appreciate for any more helps,
> in your swishcgi.conf file, try changing instances of
> 'description' to
> Peter Karman . http://peknet.com/ . peter(at)not-real.peknet.com
> Users mailing list
Users mailing list
Received on Thu Oct 15 13:36:56 2009