Skip to main content.
home | support | download

Back to List Archive

RE: Indexing/Spider problem found and fixed

From: Smith, Doug <Doug.Smith(at)not-real.HaverstickConsulting.com>
Date: Mon Jan 27 2003 - 00:06:58 GMT
-----Original Message-----
From: Bill Moseley [mailto:moseley@hank.org]
Sent: Sunday, January 26, 2003 10:58 AM
To: Smith, Doug
Cc: Multiple recipients of list
Subject: Re: [SWISH-E] Indexing/Spider problem found and fixed

> So what was happening?  When you say "fail" did Perl give an error
> message?

Hi Bill,

Yes, I'm sorry, I left that part out.  Here is the command and the following error message:

--------------
[root@cincyweb search]# perl pdf2html.pm cpd_party.pdf 'title' > cpd_party.html
Malformed UTF-8 character (unexpected continuation byte 0xad, with no preceding start byte) in transliteration (tr///) at pdf2html.pm line 201, <GEN1> chunk 1.
--------------

This happens when LANG=en_US.UTF-8.  If LANG=en_US, the error goes away, and the spider completes perfectly.

Thanks,

Doug
Received on Mon Jan 27 00:07:16 2003