Skip to main content.
home | support | download

Back to List Archive

Re: Possible bug in spider.pl

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Thu Feb 28 2002 - 18:15:57 GMT
At 10:05 AM 02/28/02 -0800, Adam Edelman wrote:
>I've been using spider.pl from swish-e-2.1-dev-25-2002-02-12 and noticed
>that the subroutine I defined under filter_content was never getting called.

Thanks for the spot - and sorry for the trouble.

http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/swishe/swish-e/prog-bin/spider.pl.diff?r1=1.33&r2=1.34

The comment for that change is "Fix filter_response bug I created yesterday".  Geeze.



>Taking a look at the spider code I found the following:
>
>    } else {
>        return $links_extracted unless check_user_function( 'content', $uri,
>$server, $response, \$content );
>
>I changed this to:
>
>    } else {
>        return $links_extracted unless check_user_function(
>'filter_content', $uri, $server, $response, \$content );
>
>and now the subroutine is called.  Incidentally, I found the same problem in
>swish-e-2.1-dev-25-2002-02-26.  Thanks.
>
>Adam Edelman
>
>
-- 
Bill Moseley
mailto:moseley@hank.org
Received on Thu Feb 28 18:18:21 2002