Skip to main content.
home | support | download

Back to List Archive

Re: Grouping results

From: John Angel <angel_john(at)not-real.hotmail.com>
Date: Wed Dec 03 2003 - 11:42:57 GMT
Hi Bill,

It sounds too complicated for me, I am not the programmer. It was feature 
request :)

Thanks anyway,
John


>From: Bill Moseley <moseley@hank.org>
>To: John Angel <angel_john@hotmail.com>
>CC: Multiple recipients of list <swish-e@sunsite.berkeley.edu>
>Subject: Re: [SWISH-E] Re: Grouping results
>Date: Mon, 1 Dec 2003 14:36:01 -0800
>
>On Mon, Dec 01, 2003 at 01:47:15PM -0800, John Angel wrote:
> > That way there will be less than 10 results per page.
> >
> > E.g. what if all 10 results on page are from the same site, there will 
>be
> > only 2 results displayed?
>
>Well, that's what I meant when I said you would need to do some post
>processing.  So instead of saying pages start at 0, 10, 20,... you would
>have to track better and just offer previous and next.
>
>So on the first page you fetch enough results to make a complete page.
>Then look ahead for the first record on the "next" page and then pass
>that as the starting location in your links (to the next page).
>"Previous Page" would also need to be tracked in links because you can't 
>just
>subtract 20 from the current location.
>
>Regardless, you would want to use the API so you can easily scan through
>all the results.
>
>BTW -- the result list that swish maintains doesn't have backwards
>links, IIRC.  SwishSeek() just starts at the beginning of the linked
>list and walks (runs?) the list looking for the requested entry.  When
>seaching multiple indexes (and sorting by path) swish has to read all
>the pathnames off disk when sorting). So, in other words, you
>may want to avoid seeking too many times.
>
>
> >
> >
> > >From: Bill Moseley <moseley@hank.org>
> > >Reply-To: moseley@hank.org
> > >To: Multiple recipients of list <swish-e@sunsite.berkeley.edu>
> > >Subject: [SWISH-E] Re: Grouping results
> > >Date: Tue, 25 Nov 2003 13:28:52 -0800 (PST)
> > >
> > >On Tue, Nov 25, 2003 at 01:24:41PM -0800, Bill Moseley wrote:
> > > > On Sun, Nov 23, 2003 at 12:45:23PM -0800, John Angel wrote:
> > > > > Is it possible to group results by site like on Google (to display
> > >only 2
> > > > > hits from the same site, not all of them)?
> > > >
> > > > Did I already respond to this?
> > > >
> > > > You would have to post-process;  Need to think about what to do if
> > > > showing a page of results at a time -- you might come up short.
> > > >
> > > > Fake code:
> > > >
> > > > my %seen;
> > > > while ( my $result = next_result() ) {
> > > >     my $uri = URI->new( $result->swishdocpath );
> > > >     next if $seen{ $uri->host }++ == 2;
> > >
> > >I assume you want something more like >= 2.
> > >
> > >
> > > >     show_result( $result );
> > > > }
> > > >
> > > > --
> > > > Bill Moseley
> > > > moseley@hank.org
> > > >
> > > >
> > >
> > >--
> > >Bill Moseley
> > >moseley@hank.org
> > >
> >
> > _________________________________________________________________
> > Tired of spam? Get advanced junk mail protection with MSN 8.
> > http://join.msn.com/?page=features/junkmail
> >
> >
>
>--
>Bill Moseley
>moseley@hank.org
>

_________________________________________________________________
Protect your PC - get McAfee.com VirusScan Online 
http://clinic.mcafee.com/clinic/ibuy/campaign.asp?cid=3963
Received on Wed Dec 3 11:43:08 2003