Quick thought for you. if you can add the XML to the files, you should be
able to write a program that uses those tags to present each chapter (or
even page) individually to swish-e while indexing (programmatically, not via
the file system) and still refer to the location of the composite file,
possibly even using anchor tags (mybook.htm#chap3 for example).
[mailto:email@example.com] On Behalf Of Lucas Meadows
Sent: Friday, January 08, 2010 7:28 PM
Subject: [swish-e] Indexing page and chapter of a book (single document)
I have a large number of text files (books) that I am trying to make
searchable with swish-e.
I would like search results to display the page and chapter in which a match
with the search term occurred (in addition to the book itself). Again, I
would prefer to not have to break the files down into separate
page/chapters, even though it would allow me to index by page and chapter
without additional fuss.
My question is whether it is possible to leverage swish-e's support for xml
tags and add some kind of xml to the files so that they can be indexed in a
way that captures the relationship of the terms to the pages and chapters
that contain them.
Many thanks in advance!
Users mailing list
Received on Sat Jan 9 08:45:16 2010