1. 16 Nov, 2013 3 commits
    • Tom Lane's avatar
      Allow aggregates to provide estimates of their transition state data size. · 6cb86143
      Tom Lane authored
      Formerly the planner had a hard-wired rule of thumb for guessing the amount
      of space consumed by an aggregate function's transition state data.  This
      estimate is critical to deciding whether it's OK to use hash aggregation,
      and in many situations the built-in estimate isn't very good.  This patch
      adds a column to pg_aggregate wherein a per-aggregate estimate can be
      provided, overriding the planner's default, and infrastructure for setting
      the column via CREATE AGGREGATE.
      
      It may be that additional smarts will be required in future, perhaps even
      a per-aggregate estimation function.  But this is already a step forward.
      
      This is extracted from a larger patch to improve the performance of numeric
      and int8 aggregates.  I (tgl) thought it was worth reviewing and committing
      this infrastructure separately.  In this commit, all built-in aggregates
      are given aggtransspace = 0, so no behavior should change.
      
      Hadi Moshayedi, reviewed by Pavel Stehule and Tomas Vondra
      6cb86143
    • Peter Eisentraut's avatar
      55c3d86a
    • Tom Lane's avatar
      Remove pgbench's hardwired limit on line length in custom script files. · 61a07bae
      Tom Lane authored
      pgbench formerly failed on lines longer than BUFSIZ, unexpectedly
      splitting them into multiple commands.  Allow it to work with any
      length of input line.
      
      Sawada Masahiko
      61a07bae
  2. 15 Nov, 2013 10 commits
    • Tom Lane's avatar
      Fix incorrect loop counts in tidbitmap.c. · f1f21b2d
      Tom Lane authored
      A couple of places that should have been iterating over WORDS_PER_CHUNK
      words were iterating over WORDS_PER_PAGE words instead.  This thinko
      accidentally failed to fail, because (at least on common architectures
      with default BLCKSZ) WORDS_PER_CHUNK is a bit less than WORDS_PER_PAGE,
      and the extra words being looked at were always zero so nothing happened.
      Still, it's a bug waiting to happen if anybody ever fools with the
      parameters affecting TIDBitmap sizes, and it's a small waste of cycles
      too.  So back-patch to all active branches.
      
      Etsuro Fujita
      f1f21b2d
    • Tom Lane's avatar
      Speed up printing of INSERT statements in pg_dump. · 97e1ec46
      Tom Lane authored
      In --inserts and especially --column-inserts mode, we can get a useful
      speedup by generating the common prefix of all a table's INSERT commands
      just once, and then printing the prebuilt string for each row.  This avoids
      multiple invocations of fmtId() and other minor fooling around.
      
      David Rowley
      97e1ec46
    • Tom Lane's avatar
      Clean up password prompting logic in streamutil.c. · 3172eea0
      Tom Lane authored
      The previous coding was fairly unreadable and drew double-free warnings
      from clang.  I believe the double free was actually not reachable, because
      PQconnectionNeedsPassword is coded to not return true if a password was
      provided, so that the loop can't iterate more than twice.  Nonetheless
      it seems worth rewriting.  No back-patch since this is just cosmetic.
      3172eea0
    • Tom Lane's avatar
      Compute correct em_nullable_relids in get_eclass_for_sort_expr(). · f3b3b8d5
      Tom Lane authored
      Bug #8591 from Claudio Freire demonstrates that get_eclass_for_sort_expr
      must be able to compute valid em_nullable_relids for any new equivalence
      class members it creates.  I'd worried about this in the commit message
      for db9f0e1d, but claimed that it wasn't a
      problem because multi-member ECs should already exist when it runs.  That
      is transparently wrong, though, because this function is also called by
      initialize_mergeclause_eclasses, which runs during deconstruct_jointree.
      The example given in the bug report (which the new regression test item
      is based upon) fails because the COALESCE() expression is first seen by
      initialize_mergeclause_eclasses rather than process_equivalence.
      
      Fixing this requires passing the appropriate nullable_relids set to
      get_eclass_for_sort_expr, and it requires new code to compute that set
      for top-level expressions such as ORDER BY, GROUP BY, etc.  We store
      the top-level nullable_relids in a new field in PlannerInfo to avoid
      computing it many times.  In the back branches, I've added the new
      field at the end of the struct to minimize ABI breakage for planner
      plugins.  There doesn't seem to be a good alternative to changing
      get_eclass_for_sort_expr's API signature, though.  There probably aren't
      any third-party extensions calling that function directly; moreover,
      if there are, they probably need to think about what to pass for
      nullable_relids anyway.
      
      Back-patch to 9.2, like the previous patch in this area.
      f3b3b8d5
    • Tom Lane's avatar
      Prevent leakage of cached plans and execution trees in plpgsql DO blocks. · c7b849a8
      Tom Lane authored
      plpgsql likes to cache query plans and simple-expression execution state
      trees across calls.  This is a considerable win for multiple executions
      of the same function.  However, it's useless for DO blocks, since by
      definition those are executed only once and discarded.  Nonetheless,
      we were allowing a DO block's expression execution trees to survive
      until end of transaction, resulting in a significant intra-transaction
      memory leak, as reported by Yeb Havinga.  Worse, if the DO block exited
      with an error, the compiled form of the block's code was leaked till
      end of session --- along with subsidiary plancache entries.
      
      To fix, make DO blocks keep their expression execution trees in a private
      EState that's deleted at exit from the block, and add a PG_TRY block
      to plpgsql_inline_handler to make sure that memory cleanup happens
      even on error exits.  Also add a regression test covering error handling
      in a DO block, because my first try at this broke that.  (The test is
      not meant to prove that we don't leak memory anymore, though it could
      be used for that with a much larger loop count.)
      
      Ideally we'd back-patch this into all versions supporting DO blocks;
      but the patch needs to add a field to struct PLpgSQL_execstate, and that
      would break ABI compatibility for third-party plugins such as the plpgsql
      debugger.  Given the small number of complaints so far, fixing this in
      HEAD only seems like an acceptable choice.
      c7b849a8
    • Tom Lane's avatar
      Minor comment corrections for sequence hashtable patch. · 80e3a470
      Tom Lane authored
      There were enough typos in the comments to annoy me ...
      80e3a470
    • Kevin Grittner's avatar
      Fix buffer overrun in isolation test program. · 7cb964ac
      Kevin Grittner authored
      Commit 061b88c7 saved argv0 to a
      global buffer without ensuring that it was zero terminated,
      allowing references to it to overrun the buffer and access other
      memory.  This probably would not have presented any security risk,
      but could have resulted in very confusing failures if the path to
      the executable was very long.
      
      Reported by David Rowley
      7cb964ac
    • Robert Haas's avatar
      doc: Restore proper alphabetical order. · 71dd54ad
      Robert Haas authored
      Colin 't Hart
      71dd54ad
    • Heikki Linnakangas's avatar
      Fix bogus hash table creation. · 5cb719be
      Heikki Linnakangas authored
      Andres Freund
      5cb719be
    • Heikki Linnakangas's avatar
      Use a hash table to store current sequence values. · 21025d4a
      Heikki Linnakangas authored
      This speeds up nextval() and currval(), when you touch a lot of different
      sequences in the same backend.
      
      David Rowley
      21025d4a
  3. 14 Nov, 2013 3 commits
  4. 13 Nov, 2013 8 commits
  5. 12 Nov, 2013 4 commits
  6. 11 Nov, 2013 4 commits
    • Tom Lane's avatar
      Fix failure with whole-row reference to a subquery. · ebefbb5f
      Tom Lane authored
      Simple oversight in commit 1cb108ef ---
      recursively examining a subquery output column is only sane if the
      original Var refers to a single output column.  Found by Kevin Grittner.
      ebefbb5f
    • Tom Lane's avatar
      Fix ruleutils pretty-printing to not generate trailing whitespace. · 0b7e660d
      Tom Lane authored
      The pretty-printing logic in ruleutils.c operates by inserting a newline
      and some indentation whitespace into strings that are already valid SQL.
      This naturally results in leaving some trailing whitespace before the
      newline in many cases; which can be annoying when processing the output
      with other tools, as complained of by Joe Abbate.  We can fix that in
      a pretty localized fashion by deleting any trailing whitespace before
      we append a pretty-printing newline.  In addition, we have to modify the
      code inserted by commit 2f582f76 so that
      we also delete trailing whitespace when transposing items from temporary
      buffers into the main result string, when a temporary item starts with a
      newline.
      
      This results in rather voluminous changes to the regression test results,
      but it's easily verified that they are only removal of trailing whitespace.
      
      Back-patch to 9.3, because the aforementioned commit resulted in many
      more cases of trailing whitespace than had occurred in earlier branches.
      0b7e660d
    • Tom Lane's avatar
      Re-allow duplicate aliases within aliased JOINs. · 648bd05b
      Tom Lane authored
      Although the SQL spec forbids duplicate table aliases, historically
      we've allowed queries like
          SELECT ... FROM tab1 x CROSS JOIN (tab2 x CROSS JOIN tab3 y) z
      on the grounds that the aliased join (z) hides the aliases within it,
      therefore there is no conflict between the two RTEs named "x".  The
      LATERAL patch broke this, on the misguided basis that "x" could be
      ambiguous if tab3 were a LATERAL subquery.  To avoid breaking existing
      queries, it's better to allow this situation and complain only if
      tab3 actually does contain an ambiguous reference.  We need only remove
      the check that was throwing an error, because the column lookup code
      is already prepared to handle ambiguous references.  Per bug #8444.
      648bd05b
    • Magnus Hagander's avatar
      Don't abort pg_basebackup when receiving empty WAL block · 705556a6
      Magnus Hagander authored
      This is a similar fix as c6ec8793aa59d1842082e14b4b4aae7d4bd883fd
      9.2. This should never happen in 9.3 and newer since the special case
      cannot happen there, but this patch synchronizes up the code so there
      is no confusion on why they're different. An empty block is as harmless
      in 9.3 as it was in 9.2, and can safely be ignored.
      705556a6
  7. 10 Nov, 2013 1 commit
  8. 09 Nov, 2013 1 commit
    • Robert Haas's avatar
      Fix ECPG compiler warning. · dca09ac5
      Robert Haas authored
      Commit 9b4d52f2 failed to notice
      that pg_regress_ecpg needed updating.
      
      This patch was independently submitted by both David Rowley
      and Andres Freund.
      dca09ac5
  9. 08 Nov, 2013 6 commits
    • Heikki Linnakangas's avatar
      Fix race condition in GIN posting tree page deletion. · ac4ab97e
      Heikki Linnakangas authored
      If a page is deleted, and reused for something else, just as a search is
      following a rightlink to it from its left sibling, the search would continue
      scanning whatever the new contents of the page are. That could lead to
      incorrect query results, or even something more curious if the page is
      reused for a different kind of a page.
      
      To fix, modify the search algorithm to lock the next page before releasing
      the previous one, and refrain from deleting pages from the leftmost branch
      of the tree.
      
      Add a new Concurrency section to the README, explaining why this works.
      There is a lot more one could say about concurrency in GIN, but that's for
      another patch.
      
      Backpatch to all supported versions.
      ac4ab97e
    • Robert Haas's avatar
      doc: Clarify under what circumstances pg_dump needs superuser access. · 636b868f
      Robert Haas authored
      Inspired by, but different from, a patch from Ivan Lezhnjov IV
      636b868f
    • Robert Haas's avatar
      Fix pg_isolation_regress to work outside its build directory. · 9b4d52f2
      Robert Haas authored
      This makes it possible to, for example, use the isolation tester to
      test a contrib module.
      
      Andres Freund
      9b4d52f2
    • Robert Haas's avatar
      Add the notion of REPLICA IDENTITY for a table. · 07cacba9
      Robert Haas authored
      Pending patches for logical replication will use this to determine
      which columns of a tuple ought to be considered as its candidate key.
      
      Andres Freund, with minor, mostly cosmetic adjustments by me
      07cacba9
    • Tom Lane's avatar
      Make contain_volatile_functions/contain_mutable_functions look into SubLinks. · b97ee66c
      Tom Lane authored
      This change prevents us from doing inappropriate subquery flattening in
      cases such as dangerous functions hidden inside a sub-SELECT in the
      targetlist of another sub-SELECT.  That could result in unexpected behavior
      due to multiple evaluations of a volatile function, as in a recent
      complaint from Etienne Dube.  It's been questionable from the very
      beginning whether these functions should look into subqueries (as noted in
      their comments), and this case seems to provide proof that they should.
      
      Because the new code only descends into SubLinks, not SubPlans or
      InitPlans, the change only affects the planner's behavior during
      prepjointree processing and not later on --- for example, you can still get
      it to use a volatile function in an indexqual if you wrap the function in
      (SELECT ...).  That's a historical behavior, for sure, but it's reasonable
      given that the executor's evaluation rules for subplans don't depend on
      whether there are volatile functions inside them.  In any case, we need to
      constrain the behavioral change as narrowly as we can to make this
      reasonable to back-patch.
      b97ee66c
    • Tom Lane's avatar
      Fix subtly-wrong volatility checking in BeginCopyFrom(). · 060b22a9
      Tom Lane authored
      contain_volatile_functions() is best applied to the output of
      expression_planner(), not its input, so that insertion of function
      default arguments and constant-folding have been done.  (See comments
      at CheckMutability, for instance.)  It's perhaps unlikely that anyone
      will notice a difference in practice, but still we should do it properly.
      
      In passing, change variable type from Node* to Expr* to reduce the net
      number of casts needed.
      
      Noted while perusing uses of contain_volatile_functions().
      060b22a9