Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] Version 2.4.5 Error

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Wed Sep 12 2007 - 20:44:11 GMT
On Wed, Sep 12, 2007 at 03:40:04PM -0500, Peter Karman wrote:
> So likely there's an issue with spider.pl and how it is calculating length()
> for docs with unreliable encodings. That's my guess anyway. spider.pl could
> probably be made smarter about sanity checking the docs for length and
> encoding, and made to fail gracefully somehow. I know there's been talk here
> lately about some of the encoding stuff it does.

The spider just needs to *always* decode on input, then encode back to
the original charset, and then use length() to report the length.
That seems like the most simple and correct way to go.  Seems right to
you, Peter?

-- 
Bill Moseley
moseley@hank.org

Unsubscribe from or help with the swish-e list: 
   http://swish-e.org/Discussion/

Help with Swish-e:
   http://swish-e.org/current/docs

_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Wed Sep 12 16:44:11 2007