Skip to main content.
home | support | download

Back to List Archive

Problem swish-e not finding words present in index

From: John P. Rouillard <rouilj(at)not-real.cs.umb.edu>
Date: Mon Sep 01 2003 - 20:37:08 GMT
Hi all:
This occurs with both a 2.1dev25 and a 2.4pr1 release.

Running:
/tools/swish_e-2.1dev25/bin/swish-e -w guest -TINDEX_WORDS_FULL -f hypermail.idx | less

I find:

guest
 Meta:10 http://XXXXX/mailing-lists/ZZZZZ/0016.html Freq:4 Pos/Struct
:138/9,172/9,201/9,211/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0082.html Freq:1 Pos/Struct
:122/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0113.html Freq:1 Pos/Struct
:56/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0121.html Freq:1 Pos/Struct
:59/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0162.html Freq:1 Pos/Struct
:60/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0206.html Freq:1 Pos/Struct
:212/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0213.html Freq:1 Pos/Struct
:44/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0269.html Freq:1 Pos/Struct
:55/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0271.html Freq:1 Pos/Struct
:71/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0461.html Freq:1 Pos/Struct
:121/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0473.html Freq:1 Pos/Struct
:60/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0644.html Freq:1 Pos/Struct
:50/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0741.html Freq:1 Pos/Struct
:64/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0811.html Freq:2 Pos/Struct
:51/9,84/9
 Meta:10 http://XXXXX/mailing-lists/YYYYYY/0825.html Freq:2 Pos/Struct
:67/9,92/9

Running:

/tools/swish_e-2.1dev25/bin/swish-e -w guest -f hypermail.idx 
# SWISH format: 2.1-dev-25
# Search words: guest
err: no results
.

Huh?? Any idea what's happening here? The same thing happens if I use
/tools/swish_e-2.4.0_pr1/bin/swish-e

The file header is:

# Swish-e format: 2.1-dev-25
# 
# Name: XXXXX Majordomo Mailing list archives
# Saved as: hypermail.idx
# Counts: 10147 words, 980 files
# Indexed on: 2003-09-01 16:21:29 EDT
# Description: Index of XXXXX Majordomo mailing list archives
# Pointer: http://XXXX/mailing-lists
# Maintained by: admin@XXXXX
# DocumentProperties: Enabled
# Stemming Applied: 0
# Soundex Applied: 0
# IgnoreTotalWordCountWhenRanking: 1
# WordCharacters: 0123456789abcdefghijklmnopqrstuvwxyz
# MinWordLimit: 1
# MaxWordLimit: 40
# BeginCharacters: 0123456789abcdefghijklmnopqrstuvwxyz
# EndCharacters: 0123456789abcdefghijklmnopqrstuvwxyz
# IgnoreFirstChar: 
# IgnoreLastChar: 

The WordCharacters, BeginCharacters, and EndCharaters also include
some 8 bit characters that aren't pasted here.

				-- rouilj
John Rouillard
===========================================================================
My employers don't acknowledge my existence much less my opinions.
Received on Mon Sep 1 20:37:36 2003