Skip to main content.
home | support | download

Back to List Archive

Re: Indexing differs for 2 lines swapped in file

From: <moseley(at)not-real.hank.org>
Date: Sun Oct 26 2003 - 12:43:44 GMT
On Sun, Oct 26, 2003 at 04:34:26AM -0800, Dominique Phommahaxay wrote:

> 1. Test 1
> =========
> When record number 15650 containing J2Ee is at the end of the file
> C:\private\work\jway\BTTITLE01312003\BTTITLE01312003-1.CSV, the
> redirected output command into a file:
> 
> C:\private\work\jway>swish-e -i BTTITLE01312003\BTTITLE01312003-1.CSV -T indexed_words > swish_indexed_words-J2Ee-not-found.out
> 
> does not contain J2Ee.

I couldn't really tell from the diff output below, but if you look at 
that output can you see some record where the data just stops being 
processed?

> The search does not work. The zipped (Winzip) file for
> BTTITLE01312003-1.CSV is a 1,755,903 bytes, which is I think the
> smallest file size I managed to create (from the original 54Mb zipped
> file) and find the limit between when the search for J2Ee is
> successful or not based the its position in the file. Please let me
> know if I can make it available for testing to you only (and not the
> entire forum) using moseley@hank.org as the receipient.

Sure, that's fine

-- 
Bill Moseley
moseley@hank.org
Received on Sun Oct 26 12:55:57 2003