Hey Bill,
I tried your suggestion. It works!
Thanks
----- Original Message -----
From: "Bill Moseley" <moseley@hank.org>
To: "Swish-e Users Discussion List" <users@lists.swish-e.org>
Sent: Wednesday, July 02, 2008 4:59 PM
Subject: Re: [swish-e] problem indexing pdf
> On Wed, Jul 02, 2008 at 12:42:39PM +0200, Manasa Kandula wrote:
>> The pdf file in the website has been successfully converted to the html
>> format.
>> But, once I index the output of the spider
>> (swish-e -f index.swish-e -c swish.config -S prog -i stdin < output1.txt)
>> , the part whose pathname ends with the pdf extention do not get indexed.
>> (in this example it is the entire document that doesn't get indexed).
>
> What happens if you don't specify the config file?
>
> swish-e -S prog -i stdin < output1.txt
>
>
> --
> Bill Moseley
> moseley@hank.org
>
> Unsubscribe from or help with the swish-e list:
> http://swish-e.org/Discussion/
>
> Help with Swish-e:
> http://swish-e.org/current/docs
>
> _______________________________________________
> Users mailing list
> Users@lists.swish-e.org
> http://lists.swish-e.org/listinfo/users
>
_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Fri Jul 4 05:00:30 2008