On Thu, Jul 29, 2004 at 10:14:43AM +0100, Jonas Wolf wrote:
> I did some more testing, and indeed swish-e is doing everything correctly.
> The XML parser recognises @ sequences fine, but breaks down on
> characters below 32, such as , which is also correct behaviour. (As a
> side note, this never generates an error message, it just stops indexing
> the document at that point - Can you force error messages?).
ParserWarnLevel.
I'm not 100% of the behavior but I think libxml2 will just abort
processing.
> The problem seems to be HTML::Entities::encode_entities, which
> generates these invalid character sequences.
Can you post a complete example?
--
Bill Moseley
moseley@hank.org
Unsubscribe from or help with the swish-e list:
http://swish-e.org/Discussion/
Help with Swish-e:
http://swish-e.org/current/docs
swish-e@sunsite.berkeley.edu
Received on Thu Jul 29 11:49:18 2004