Skip to main content.
home | support | download

Back to List Archive

Re: Fw: Re: 8-bit chars

From: John Angel <angel_john(at)not-real.hotmail.com>
Date: Sun Dec 14 2003 - 18:55:13 GMT
> There is NO WAY to store more than one encoding in the index as it is
> currently designed.
>
> And that's exactly what you are asking to do.  You want to have libxml2
> convert the document back to it's original encoding when storing the
> words in the index -- "as-is" -- and that's trying to store more than
> one encoding in the index at the same time.


Yes, that is exactly what I am asking to do.

Forget about encodings, you won't see the wider picture.

Think how can we index documents presented in 3 different languages (without
utf-8 support)? This is the only solution, and it works.
Received on Sun Dec 14 18:55:22 2003