Skip to main content.
home | support | download

Back to List Archive

Re: Fw: Re: 8-bit chars

From: Frances Coakley <frances(at)>
Date: Sun Dec 14 2003 - 19:15:06 GMT
> > There is NO WAY to store more than one encoding in the index as it is
> > currently designed.

Doesnt the meta charset give you the coding used in the original document - 
assuming that the 8bit chars are the more unusual chars then it is possible 
that a word in Icelandic charset maps onto the same sequence of 8 bit chars 
as would a different word in the Norse charset.  But if the searcher is 
viewing with the charset Icelandic set then searching for Meta 
Charset=Icelandic and word=whatever will find the Icelandic word.  Those 
pages not encoded under the Icelandic charset cannot by definition contain 
that char.
Or have I misunderstood the problem ?
Frances Coakley - website
Received on Sun Dec 14 19:15:12 2003