As you said, I used this command line to get swishdescription:
swish-e -w "09" -f /var/www/ast.swish-e -x "%d\n"
and then use perl to get the content of the meta name="keywords" from it. It seems to work fine to me (see the attached screenshot), except the double quote was changed to weird chars. Is there any way to fix it?
Your this line helped me much:
FileFilter .pdf /usr/local/bin/swish-filter-test '-headers -content %p'
What does swish-filter-test and '-headers -content %p' do?
--- On Thu, 10/15/09, Peter Karman <firstname.lastname@example.org> wrote:
> From: Peter Karman <email@example.com>
> Subject: Re: [swish-e] How swish-e returns PDF's meta description
> To: "Swish-e Users Discussion List" <firstname.lastname@example.org>
> Date: Thursday, October 15, 2009, 1:52 PM
> Daqi Li wrote on 10/15/09 12:36 PM:
> > I changed 'description' to 'swishdescription' in the
> .swishcgi.conf file and got the return data for
> swishdescription as following:
> > Description: Path-Name:
> Content-Length: 22353 ...
> > I think this long line contains much more data than I
> saw. I need the contents of 'keywords' in it. Is there a way
> to retrieve only the keywords?
> if you just want the keywords, then take 'swishdescription'
> out of the list of
> properties to return and just list 'keywords'.
> > Another question, is there a way to run a command line
> to return this data, so that I can see what is in
> swishdescription? Something like:
> > /usr/local/bin/swish-e -w "$keyphrase" -f $index -x
> ... (or other option)?
> yes, the -x option is what you want. try it and see.
> Peter Karman . http://peknet.com/ . peter(at)not-real.peknet.com
> Users mailing list
Users mailing list
Received on Thu Oct 15 17:07:23 2009