1. 07 Jun, 2010 3 commits
  2. 05 Jun, 2010 1 commit
  3. 04 Jun, 2010 2 commits
  4. 03 Jun, 2010 18 commits
  5. 02 Jun, 2010 1 commit
  6. 01 Jun, 2010 6 commits
  7. 31 May, 2010 7 commits
  8. 30 May, 2010 2 commits
    • Tom Lane's avatar
      Fix misuse of Lossy Counting (LC) algorithm in compute_tsvector_stats(). · bc0f0809
      Tom Lane authored
      We must filter out hashtable entries with frequencies less than those
      specified by the algorithm, else we risk emitting junk entries whose
      actual frequency is much less than other lexemes that did not get
      tabulated.  This is bad enough by itself, but even worse is that
      tsquerysel() believes that the minimum frequency seen in pg_statistic is a
      hard upper bound for lexemes not included, and was thus underestimating
      the frequency of non-MCEs.
      
      Also, set the threshold frequency to something with a little bit of theory
      behind it, to wit assume that the input distribution is approximately
      Zipfian.  This might need adjustment in future, but some preliminary
      experiments suggest that it's not too unreasonable.
      
      Back-patch to 8.4, where this code was introduced.
      
      Jan Urbanski, with some editorialization by Tom
      bc0f0809
    • Tom Lane's avatar
      Change the notation for calling functions with named parameters from · b12b7a90
      Tom Lane authored
      "val AS name" to "name := val", as per recent discussion.
      
      This patch catches everything in the original named-parameters patch,
      but I'm not certain that no other dependencies snuck in later (grepping
      the source tree for all uses of AS soon proved unworkable).
      
      In passing I note that we've dropped the ball at least once on keeping
      ecpg's lexer (as opposed to parser) in sync with the backend.  It would
      be a good idea to go through all of pgc.l and see if it's in sync now.
      I didn't attempt that at the moment.
      b12b7a90