1. 05 Sep, 2011 6 commits
  2. 04 Sep, 2011 7 commits
    • Tom Lane's avatar
      Change get_variable_numdistinct's API to flag default estimates explicitly. · 4c2777d0
      Tom Lane authored
      Formerly, callers tested for DEFAULT_NUM_DISTINCT, which had the problem
      that a perfectly solid estimate might be mistaken for a content-free
      default.
      4c2777d0
    • Tom Lane's avatar
      Dig down into sub-selects to look for column statistics. · 1cb108ef
      Tom Lane authored
      If a sub-select's output column is a simple Var, recursively look for
      statistics applying to that Var, and use them if available.  The need for
      this was foreseen ages ago, but we didn't have enough infrastructure to do
      it with reasonable speed until just now.
      
      We punt and stick with default estimates if the subquery uses set
      operations, GROUP BY, or DISTINCT, since those operations would change the
      underlying column statistics (particularly, the relative frequencies of
      different values) beyond recognition.  This means that the types of
      sub-selects for which this improvement applies are fairly limited, since
      most subqueries satisfying those restrictions would have gotten flattened
      into the parent query anyway.  But it does help for some cases, such as
      subqueries with ORDER BY or LIMIT.
      1cb108ef
    • Tom Lane's avatar
      Can't print PlannerGlobal's subroots list in outfuncs. · 698df335
      Tom Lane authored
      Since the subroots will surely link back to the same glob struct, this
      necessarily leads to infinite recursion.  Doh.  Found while trying to
      debug some other code.
      698df335
    • Tom Lane's avatar
      Clean up the #include mess a little. · 1609797c
      Tom Lane authored
      walsender.h should depend on xlog.h, not vice versa.  (Actually, the
      inclusion was circular until a couple hours ago, which was even sillier;
      but Bruce broke it in the expedient rather than logically correct
      direction.)  Because of that poor decision, plus blind application of
      pgrminclude, we had a situation where half the system was depending on
      xlog.h to include such unrelated stuff as array.h and guc.h.  Clean up
      the header inclusion, and manually revert a lot of what pgrminclude had
      done so things build again.
      
      This episode reinforces my feeling that pgrminclude should not be run
      without adult supervision.  Inclusion changes in header files in particular
      need to be reviewed with great care.  More generally, it'd be good if we
      had a clearer notion of module layering to dictate which headers can sanely
      include which others ... but that's a big task for another day.
      1609797c
    • Tom Lane's avatar
      Remove unnecessary and circular #include. · f116b1f5
      Tom Lane authored
      storage/proc.h should not include replication/syncrep.h, especially not
      when the latter includes storage/proc.h; but in any case this was a pretty
      poor thing from a modular layering standpoint.
      f116b1f5
    • Bruce Momjian's avatar
      walsender.h doesn't need xlog.h, per Tom. · 5bce637a
      Bruce Momjian authored
      5bce637a
    • Bruce Momjian's avatar
      Move AllowCascadeReplication() define from xlog.h to replication include · 85e6e166
      Bruce Momjian authored
      file.
      
      Per suggestion from Alvaro.
      85e6e166
  3. 03 Sep, 2011 3 commits
    • Bruce Momjian's avatar
      Remove find_lt sgml tool, as it is not needed. · ca598c18
      Bruce Momjian authored
      Per suggestion from Peter.
      ca598c18
    • Tom Lane's avatar
      Fix typo in pg_srand48 (srand48 in older branches). · 48e4b8dc
      Tom Lane authored
      ">" should be ">>".  This typo results in failure to use all of the bits
      of the provided seed.
      
      This might rise to the level of a security bug if we were relying on
      srand48 for any security-critical purposes, but we are not --- in fact,
      it's not used at all unless the platform lacks srandom(), which is
      improbable.  Even on such a platform the exposure seems minimal.
      
      Reported privately by Andres Freund.
      48e4b8dc
    • Tom Lane's avatar
      Rearrange planner to save the whole PlannerInfo (subroot) for a subquery. · b3aaf908
      Tom Lane authored
      Formerly, set_subquery_pathlist and other creators of plans for subqueries
      saved only the rangetable and rowMarks lists from the lower-level
      PlannerInfo.  But there's no reason not to remember the whole PlannerInfo,
      and indeed this turns out to simplify matters in a number of places.
      
      The immediate reason for doing this was so that the subroot will still be
      accessible when we're trying to extract column statistics out of an
      already-planned subquery.  But now that I've done it, it seems like a good
      code-beautification effort in its own right.
      
      I also chose to get rid of the transient subrtable and subrowmark fields in
      SubqueryScan nodes, in favor of having setrefs.c look up the subquery's
      RelOptInfo.  That required changing all the APIs in setrefs.c to pass
      PlannerInfo not PlannerGlobal, which was a large but quite mechanical
      transformation.
      
      One side-effect not foreseen at the beginning is that this finally broke
      inheritance_planner's assumption that replanning the same subquery RTE N
      times would necessarily give interchangeable results each time.  That
      assumption was always pretty risky, but now we really have to make a
      separate RTE for each instance so that there's a place to carry the
      separate subroots.
      b3aaf908
  4. 02 Sep, 2011 4 commits
  5. 01 Sep, 2011 19 commits
  6. 31 Aug, 2011 1 commit
    • Tom Lane's avatar
      Improve eqjoinsel's ndistinct clamping to work for multiple levels of join. · 97930cf5
      Tom Lane authored
      This patch fixes an oversight in my commit
      7f3eba30 of 2008-10-23.  That patch
      accounted for baserel restriction clauses that reduced the number of rows
      coming out of a table (and hence the number of possibly-distinct values of
      a join variable), but not for join restriction clauses that might have been
      applied at a lower level of join.  To account for the latter, look up the
      sizes of the min_lefthand and min_righthand inputs of the current join,
      and clamp with those in the same way as for the base relations.
      
      Noted while investigating a complaint from Ben Chobot, although this in
      itself doesn't seem to explain his report.
      
      Back-patch to 8.4; previous versions used different estimation methods
      for which this heuristic isn't relevant.
      97930cf5