Skip to main content.
home | support | download

Back to List Archive

XML and XHTML support

From: Heimo Hanninen <heimohanninen(at)not-real.yahoo.co.uk>
Date: Fri Dec 12 2003 - 12:27:54 GMT
Hello there

A simple question. is it possible and how to
index and search string in context (XML element or
attribute). 

I am testing windows binaries (latest 2.4.0) on
command line, and
believe having libxml2 on board by default?

slow learner?

PROBLEMOS
I just couldn't't quite figure it out from
documentation how to set up:
-config file
-run indexing (this I guess follows the normal path)
-do proper context searching
-does the source have to be XML (XHTM)? I guess yes in
order to parse correctly...

CONTEXT SEARCH GOALS (examples):

-find docs which created later than 10-10-2000 
-find <DIV> elements where @type="maintenance"

-and combining: find string "turbine" and <DIV>
element where @type="maintenance"



SOURCE:
a snippet from source HTML (currently but could be
easily generated as XHTML):

   <head>
      <meta http-equiv="Content-Type"
content="text/html; charset=utf-8">
      <meta name="created" content="11-12-2002">
      <title>Org - Some IETM - General
Information</title>
   </head>
   <body onload="showIDs(parent.parent.topbar.IDs);
showChanges(parent.parent.topbar.Changes, true)">

<DIV id="TipLayer"
style="visibility:hidden;position:absolute;z-index:1000;top:-100"></DIV>
      <div type="maintenance" class="module">
          <h1 class="part"><a class="ids"
name="p-000ml001-0001-0001"></a>PART 1 - GENERAL
INFORMATION</h1>
         <h1 class="section">Section 1 - General SSW
Information</h1>
         <div class="module">
            <h2 class="module"><a class="ids"
name="o-000ml-0101"></a>Contents of this book</h2>
               <p class="para">This publication
                <span id="changemarker">contains
information</span>...


Thanks, Heimo Hänninen, newbie

PS. thanks for the answer to previous quastion about
running IE & SWISH locally without a web server...
dropped the idea...putting everythin on IIS.

________________________________________________________________________
BT Yahoo! Broadband - Save £80 when you order online today. Hurry! Offer ends 21st December 2003. The way the internet was meant to be. http://uk.rd.yahoo.com/evt=21064/*http://btyahoo.yahoo.co.uk
Received on Fri Dec 12 12:28:02 2003