François Tissandier wrote on 10/06/2011 06:58 AM:
>> Get them from your web server's log files. Or (better) add some code to
>> your search script to log the search terms separately. We do this to let
>> us add a search cloud pop-up of the top ten most recently-searched terms.
> Mmm good idea, but that's not what I want to do. I want to propose the
> most popular keywords from my content, not from the searches ! And those
> keywords are in the index, so I thought there is maybe a way to extract
> the most popular ones. By "popular" I mean "keywords appearing the most
> often". Sorry if my frenglish is not clear !
you probably want:
per this email thread:
Note though, that you should really not equate "popularity" with
"frequency" -- especially if you are not using StopWords -- because e.g.
the frequency of the word 'the' will skew your definition of "popular".
I don't use a StopWords list because my use cases demand precision. If
it were me, I would research the actual frequency of words in my
collection using the countwords.pl script, and then identify a "sweet
spot" range of frequency that ignores what would otherwise be StopWords.
Peter Karman . http://peknet.com/ . peter(at)not-real.peknet.com
Users mailing list
Received on Thu Oct 06 2011 - 13:58:35 GMT