On Fri, 7 Feb 2003, John Almberg wrote:
> This sounded promising, but didn't work.
>
> The javascript already is in comments . . . like:
>
> <javascript>
> <!--
> stuff
> // -->
> </javascript>
Which parser are you using?
$ cat 1.html
<html>
<head>
<title>somecompany</title>
<!-- comment
<script language="JavaScript">
// a javascript comment
alert("Yikes, Javacript!");
</script>
// -->
</head>
<body>
bodyword
</body>
</html>
$ ./swish-e -i 1.html -T indexed_words -v0
Adding:[1:swishdefault(1)] 'somecompany' Pos:2 Stuct:0x7 ( HEAD TITLE FILE )
Adding:[1:swishdefault(1)] 'bodyword' Pos:5 Stuct:0x9 ( BODY FILE )
That's using the libxml2 parser.
--
Bill Moseley moseley@hank.org
Received on Fri Feb 7 20:14:47 2003