I tried your suggestion. It works!
----- Original Message -----
From: "Bill Moseley" <firstname.lastname@example.org>
To: "Swish-e Users Discussion List" <email@example.com>
Sent: Wednesday, July 02, 2008 4:59 PM
Subject: Re: [swish-e] problem indexing pdf
> On Wed, Jul 02, 2008 at 12:42:39PM +0200, Manasa Kandula wrote:
>> The pdf file in the website has been successfully converted to the html
>> But, once I index the output of the spider
>> (swish-e -f index.swish-e -c swish.config -S prog -i stdin < output1.txt)
>> , the part whose pathname ends with the pdf extention do not get indexed.
>> (in this example it is the entire document that doesn't get indexed).
> What happens if you don't specify the config file?
> swish-e -S prog -i stdin < output1.txt
> Bill Moseley
> Unsubscribe from or help with the swish-e list:
> Help with Swish-e:
> Users mailing list
Users mailing list
Received on Fri Jul 4 05:00:30 2008