Sorting the Stop word lists

Started by Simon Riggsover 18 years ago3 messages
#1Simon Riggs
simon@2ndquadrant.com

I notice we sort the stop word list after we read it into memory.

Wouldn't it be easier to

1. Sort the stopword lists in the main distribution

2. Require them to be sorted

3. Remove the sort from readstoplist()

We should at very least do (1) to improve the sort speed at start.

--
Simon Riggs
2ndQuadrant http://www.2ndQuadrant.com

#2Teodor Sigaev
teodor@sigaev.ru
In reply to: Simon Riggs (#1)
Re: Sorting the Stop word lists

1. Sort the stopword lists in the main distribution
2. Require them to be sorted
3. Remove the sort from readstoplist()

I don't believe that will a big win in performance - lists are rather small. And
it needed to add check of sorting

--
Teodor Sigaev E-mail: teodor@sigaev.ru
WWW: http://www.sigaev.ru/

#3Tom Lane
tgl@sss.pgh.pa.us
In reply to: Simon Riggs (#1)
Re: Sorting the Stop word lists

Simon Riggs <simon@2ndquadrant.com> writes:

I notice we sort the stop word list after we read it into memory.

I see nothing wrong with that; it only happens once per backend session,
and it makes maintenance of the files easier.

regards, tom lane