Skip to main content.
home | support | download

Back to List Archive

Re: [XWarn] Re: Re: attribute value attaching to wrords

From: Dave Moreau <davidjmoreau(at)not-real.hotmail.com>
Date: Fri Nov 14 2003 - 00:19:27 GMT
>From: Bill Moseley <moseley@hank.org>
>Date: Thu, 13 Nov 2003 15:22:36 -0800
>On Thu, Nov 13, 2003 at 11:19:43PM +0000, Dave Moreau wrote:
> > >Date: Wed, 12 Nov 2003 23:02:56 -0800 (PST)
> > >
> > >On Wed, Nov 12, 2003 at 06:23:30PM -0800, Dave Moreau wrote:
> > >> I am having a problem with how swish-e indexes my XML files. I have
> > >> formatting information on poem files like the following:
> > >>
> > >>   <line indent="3em">fills ...
> > >>
> > >> Which indexes the word 3emfills!
> > >
> > >Can you provide an example?
> > >
> > >moseley@bumby:~$ cat c
> > >ParserWarnLevel 9
> > >DefaultContents XML2
> > >UndefinedMetaTags index
> > >
> > >moseley@bumby:~$ cat 1.xml
> > ><xml>
> > ><line indent="3em">fills</line>
> > ></xml>
> > >
> > >moseley@bumby:~$ swish-e -c c -i 1.xml -T indexed_words -v0
> > >    Adding:[1:swishdefault(1)]   'fills'   Pos:5  Stuct:0x1 ( FILE )
> > >
> > >
> > >--
> > >Bill Moseley
> > >moseley@hank.org

Here's an example

D:\SWISH-E2_4\test>type t
IndexFile       D:\swish-e2_4\test\i
IndexDir        D:\swish-e2_4\test
UndefinedMetaTags       index
UndefinedXMLAttributes  index
IndexOnly .xml
IndexContents   XML2    .xml

D:\SWISH-E2_4\test>type test.xml
<stanza>
<line>Winds</line>
<line indent="3em"> steal warmth;</line>
<line>Silence echoes</line>
<line indent="3em">the thunder of one heart beating--but</line>
</stanza>

D:\SWISH-E2_4\test>..\swish-e -c t -T indexed_words -v3
Parsing config file 't'
Indexing Data Source: "File-System"
Indexing "D:\swish-e2_4\test"

Checking dir "D:/swish-e2_4/test"...
  test.xml - Using XML2 parser -     Adding:[1:swishdefault(1)]   'winds'   
Pos:5  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   '3em'   Pos:12  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'steal'   Pos:13  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'warmth'   Pos:14  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'silence'   Pos:15  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'echoes'   Pos:16  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   '3emthe'   Pos:21  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'thunder'   Pos:22  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'of'   Pos:23  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'one'   Pos:24  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'heart'   Pos:25  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'beating'   Pos:26  Stuct:0x1 ( FILE )
    Adding:[1:swishdefault(1)]   'but'   Pos:27  Stuct:0x1 ( FILE )
(13 words)

Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 13 words alphabetically
Writing header ...
Writing index entries ...
  Writing word text: Complete
  Writing word hash: Complete
  Writing word data: Complete
13 unique words indexed.
4 properties sorted.
1 file indexed.  175 total bytes.  13 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!

You can see, it indexed 3emthe

dave

_________________________________________________________________
The new MSN 8: advanced junk mail protection and 2 months FREE* 
http://join.msn.com/?page=features/junkmail
Received on Fri Nov 14 00:19:32 2003