Skip to main content.
home | support | download

Back to List Archive

Re: Geting "status: 500" while indexing some pages

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Wed Feb 02 2005 - 20:25:27 GMT
On Wed, Feb 02, 2005 at 03:20:11PM -0500, Juan Carlos Avila / MTBASE wrote:
> That's one of the strange issues: my web server reports:
> 
> 192.9.202.1 - - [02/Feb/2005:15:15:43 -0500] "GET 
> /casos/VerCasoIdx?caso_numero=6896 HTTP/1.1" *200 *15816

If you set base_url to 

   base_url => 'http://your_server/casos/VerCasoIdx?caso_numero=6896',

and then set max_files to 1 or some other way to only process one file
then run the spider with debugging then you are saying that the web
server's log shows 200 and the spider shows 500?

Try running the spider with:

   SPIDER_DEBUG=headers ./spider.pl spider.conf

(if you have a shell that allows that -- otherwise set debug => 'headers'
in your spider config).

Then look at the actual response back from the server.

-- 
Bill Moseley
moseley@hank.org

Unsubscribe from or help with the swish-e list: 
   http://swish-e.org/Discussion/

Help with Swish-e:
   http://swish-e.org/current/docs
   swish-e@sunsite.berkeley.edu
Received on Wed Feb 2 12:25:28 2005