>From: Bill Moseley <moseley@hank.org>
>Date: Thu, 13 Nov 2003 15:22:36 -0800
>On Thu, Nov 13, 2003 at 11:19:43PM +0000, Dave Moreau wrote:
> > >Date: Wed, 12 Nov 2003 23:02:56 -0800 (PST)
> > >
> > >On Wed, Nov 12, 2003 at 06:23:30PM -0800, Dave Moreau wrote:
> > >> I am having a problem with how swish-e indexes my XML files. I have
> > >> formatting information on poem files like the following:
> > >>
> > >> <line indent="3em">fills ...
> > >>
> > >> Which indexes the word 3emfills!
> > >
> > >Can you provide an example?
> > >
> > >moseley@bumby:~$ cat c
> > >ParserWarnLevel 9
> > >DefaultContents XML2
> > >UndefinedMetaTags index
> > >
> > >moseley@bumby:~$ cat 1.xml
> > ><xml>
> > ><line indent="3em">fills</line>
> > ></xml>
> > >
> > >moseley@bumby:~$ swish-e -c c -i 1.xml -T indexed_words -v0
> > > Adding:[1:swishdefault(1)] 'fills' Pos:5 Stuct:0x1 ( FILE )
> > >
> > >
> > >--
> > >Bill Moseley
> > >moseley@hank.org
Here's an example
D:\SWISH-E2_4\test>type t
IndexFile D:\swish-e2_4\test\i
IndexDir D:\swish-e2_4\test
UndefinedMetaTags index
UndefinedXMLAttributes index
IndexOnly .xml
IndexContents XML2 .xml
D:\SWISH-E2_4\test>type test.xml
<stanza>
<line>Winds</line>
<line indent="3em"> steal warmth;</line>
<line>Silence echoes</line>
<line indent="3em">the thunder of one heart beating--but</line>
</stanza>
D:\SWISH-E2_4\test>..\swish-e -c t -T indexed_words -v3
Parsing config file 't'
Indexing Data Source: "File-System"
Indexing "D:\swish-e2_4\test"
Checking dir "D:/swish-e2_4/test"...
test.xml - Using XML2 parser - Adding:[1:swishdefault(1)] 'winds'
Pos:5 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] '3em' Pos:12 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'steal' Pos:13 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'warmth' Pos:14 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'silence' Pos:15 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'echoes' Pos:16 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] '3emthe' Pos:21 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'thunder' Pos:22 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'of' Pos:23 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'one' Pos:24 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'heart' Pos:25 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'beating' Pos:26 Stuct:0x1 ( FILE )
Adding:[1:swishdefault(1)] 'but' Pos:27 Stuct:0x1 ( FILE )
(13 words)
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 13 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
13 unique words indexed.
4 properties sorted.
1 file indexed. 175 total bytes. 13 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!
You can see, it indexed 3emthe
dave
_________________________________________________________________
The new MSN 8: advanced junk mail protection and 2 months FREE*
http://join.msn.com/?page=features/junkmail
Received on Fri Nov 14 00:19:32 2003