On Sat, Feb 10, 2007 at 12:14:40AM +1000, Matt Paine wrote:
> > Adding:[1:swishdefault(1)] 'hello' Pos:5 Stuct:0x29 ( HEADING BODY FILE )
> Adding:[1:swishdefault(1)] 'hello' Pos:1 Stuct:0x21 ( HEADING > FILE )
> One thing I'm noticing is the first thing to get indexed is HEADING
> FILE, where as in your indexing its HEADING BODY FILE. By putting <body>
> tags around the html I can get it to say that, but I still cant get the
> <id> tag or the type tag to index as a META BODY FILE like yours.
Then perhaps it's your version of libxml2. Libxml2 is doing the
parsing and we are parsing an invalid html file, so maybe different
versions of libxml2 handle it differently.
I'm running 2.6.27 on Debian.
$ swish-e -c c -i doc.html -T parsed_tags -v0
<id> (meta [id])
<id> (property [id])
<name> (undefined meta name - no action)
<type> (meta [type])
<type> (property [type])
Unsubscribe from or help with the swish-e list:
Help with Swish-e:
Users mailing list
Received on Fri Feb 9 10:29:34 2007