Sorting the Stop word lists
Started by Simon Riggsover 18 years ago3 messages
I notice we sort the stop word list after we read it into memory.
Wouldn't it be easier to
1. Sort the stopword lists in the main distribution
2. Require them to be sorted
3. Remove the sort from readstoplist()
We should at very least do (1) to improve the sort speed at start.
--
Simon Riggs
2ndQuadrant http://www.2ndQuadrant.com
Re: Sorting the Stop word lists
1. Sort the stopword lists in the main distribution
2. Require them to be sorted
3. Remove the sort from readstoplist()
I don't believe that will a big win in performance - lists are rather small. And
it needed to add check of sorting
--
Teodor Sigaev E-mail: teodor@sigaev.ru
WWW: http://www.sigaev.ru/