1. 07 Jun, 2016 8 commits
  2. 06 Jun, 2016 6 commits
    • Tom Lane's avatar
      Don't reset changes_since_analyze after a selective-columns ANALYZE. · f64340e7
      Tom Lane authored
      If we ANALYZE only selected columns of a table, we should not postpone
      auto-analyze because of that; other columns may well still need stats
      updates.  As committed, the counter is left alone if a column list is
      given, whether or not it includes all analyzable columns of the table.
      Per complaint from Tomasz Ostrowski.
      
      It's been like this a long time, so back-patch to all supported branches.
      
      Report: <ef99c1bd-ff60-5f32-2733-c7b504eb960c@ato.waw.pl>
      f64340e7
    • Robert Haas's avatar
      Stop the executor if no more tuples can be sent from worker to leader. · c6dbf1fe
      Robert Haas authored
      If a Gather node has read as many tuples as it needs (for example, due
      to Limit) it may detach the queue connecting it to the worker before
      reading all of the worker's tuples.  Rather than let the worker
      continue to generate and send all of the results, have it stop after
      sending the next tuple.
      
      More could be done here to stop the worker even quicker, but this is
      about as well as we can hope to do for 9.6.
      
      This is in response to a problem report from Andreas Seltenreich.
      Commit 44339b89 should be actually be
      sufficient to fix that example even without this change, but it seems
      better to do this, too, since we might otherwise waste quite a large
      amount of effort in one or more workers.
      
      Discussion: CAA4eK1KOKGqmz9bGu+Z42qhRwMbm4R5rfnqsLCNqFs9j14jzEA@mail.gmail.com
      
      Amit Kapila
      c6dbf1fe
    • Robert Haas's avatar
      shm_mq: After a send fails with SHM_MQ_DETACHED, later ones should too. · 44339b89
      Robert Haas authored
      Prior to this patch, it was occasionally possible, after shm_mq_sendv
      had previously returned SHM_MQ_DETACHED, for a later shm_mq_sendv
      operation to fail an assertion instead of just again returning
      SHM_MQ_ATTACHED.  From the shm_mq code's point of view, it was
      expecting to be called again with the same arguments, since the
      previous operation had only partially completed.  However, a caller
      who isn't using non-blocking mode won't be prepared to repeat the call
      with the same arguments, and this code shouldn't expect that they
      will.  Repair in such a way that we'll be OK whether the next call
      uses the same arguments or not.
      
      Found by Andreas Seltenreich.  Analysis and sketch of fix by Amit
      Kapila.  Patch by me, reviewed by Amit Kapila.
      44339b89
    • Robert Haas's avatar
      pg_upgrade: Don't overwrite existing files. · e191a690
      Robert Haas authored
      For historical reasons, copyFile and rewriteVisibilityMap took a force
      argument which was always passed as true, meaning that any existing
      file should be overwritten.  However, it seems much safer to instead
      fail if a file we need to write already exists.
      
      While we're at it, remove the "force" argument altogether, since it was
      never passed as anything other than true (and now we would never pass
      it as anything other than false, if we kept it).
      
      Noted by Andres Freund during post-commit review of the patch that added
      rewriteVisibilityMap, commit 7087166a,
      but this also changes the behavior when copying files without rewriting
      them.
      
      Patch by Masahiko Sawada.
      e191a690
    • Robert Haas's avatar
      Fix typo. · 932b97a0
      Robert Haas authored
      Jim Nasby
      932b97a0
    • Robert Haas's avatar
      pg_upgrade: Improve error checking in rewriteVisibilityMap. · aba89430
      Robert Haas authored
      In the old logic, if read() were to return an error, we'd silently stop
      rewriting the visibility map at that point in the file.  That's safe,
      but reporting the error is better, so do that instead.
      
      Report by Andres Freund.  Patch by Masahiko Sawada, with one correction
      by me.
      aba89430
  3. 05 Jun, 2016 2 commits
    • Peter Eisentraut's avatar
      Fix whitespace · 6201a8ef
      Peter Eisentraut authored
      6201a8ef
    • Tom Lane's avatar
      Properly initialize SortSupport for ORDER BY rechecks in nodeIndexscan.c. · 8a859691
      Tom Lane authored
      Fix still another bug in commit 35fcb1b3: it failed to fully initialize
      the SortSupport states it introduced to allow the executor to re-check
      ORDER BY expressions containing distance operators.  That led to a null
      pointer dereference if the sortsupport code tried to use ssup_cxt.  The
      problem only manifests in narrow cases, explaining the lack of previous
      field reports.  It requires a GiST-indexable distance operator that lacks
      SortSupport and is on a pass-by-ref data type, which among core+contrib
      seems to be only btree_gist's interval opclass; and it requires the scan
      to be done as an IndexScan not an IndexOnlyScan, which explains how
      btree_gist's regression test didn't catch it.  Per bug #14134 from
      Jihyun Yu.
      
      Peter Geoghegan
      
      Report: <20160511154904.2603.43889@wrigleys.postgresql.org>
      8a859691
  4. 04 Jun, 2016 1 commit
  5. 03 Jun, 2016 12 commits
    • Tom Lane's avatar
      Fix grammar's AND/OR flattening to work with operator_precedence_warning. · 05104f69
      Tom Lane authored
      It'd be good for "(x AND y) AND z" to produce a three-child AND node
      whether or not operator_precedence_warning is on, but that failed to
      happen when it's on because makeAndExpr() didn't look through the added
      AEXPR_PAREN node.  This has no effect on generated plans because prepqual.c
      would flatten the AND nest anyway; but it does affect the number of parens
      printed in ruleutils.c, for example.  I'd already fixed some similar
      hazards in parse_expr.c in commit abb16465, but didn't think to search
      gram.y for problems of this ilk.  Per gripe from Jean-Pierre Pelletier.
      
      Report: <fa0535ec6d6428cfec40c7e8a6d11156@mail.gmail.com>
      05104f69
    • Tom Lane's avatar
      Inline the easy cases in MakeExpandedObjectReadOnly(). · d50183c5
      Tom Lane authored
      This attempts to buy back some of whatever performance we lost from fixing
      bug #14174 by inlining the initial checks in MakeExpandedObjectReadOnly()
      into the callers.  We can do that in a macro without creating multiple-
      evaluation hazards, so it's pretty much free notationally; and the amount
      of code added to callers should be minimal as well.  (Testing a value can't
      take many more instructions than passing it to a subroutine.)
      
      Might as well inline DatumIsReadWriteExpandedObject() while we're at it.
      
      This is an ABI break for callers, so it doesn't seem safe to put into 9.5,
      but I see no reason not to do it in HEAD.
      d50183c5
    • Tom Lane's avatar
      Mark read/write expanded values as read-only in ValuesNext(), too. · 9eaf5be5
      Tom Lane authored
      Further thought about bug #14174 motivated me to try the case of a
      R/W datum being returned from a VALUES list, and sure enough it was
      broken.  Fix that.
      
      Also add a regression test case exercising the same scenario for
      FunctionScan.  That's not broken right now, because the function's
      result will get shoved into a tuplestore between generation and use;
      but it could easily become broken whenever we get around to optimizing
      FunctionScan better.
      
      There don't seem to be any other places where we put the result of
      expression evaluation into a virtual tuple slot that could then be
      the source for Vars of further expression evaluation, so I think
      this is the end of this bug.
      9eaf5be5
    • Tom Lane's avatar
      Mark read/write expanded values as read-only in ExecProject(). · 69f526aa
      Tom Lane authored
      If a plan node output expression returns an "expanded" datum, and that
      output column is referenced in more than one place in upper-level plan
      nodes, we need to ensure that what is returned is a read-only reference
      not a read/write reference.  Otherwise one of the referencing sites could
      scribble on or even delete the expanded datum before we have evaluated the
      others.  Commit 1dc5ebc9, which introduced this feature, supposed
      that it'd be sufficient to make SubqueryScan nodes force their output
      columns to read-only state.  The folly of that was revealed by bug #14174
      from Andrew Gierth, and really should have been immediately obvious
      considering that the planner will happily optimize SubqueryScan nodes
      out of the plan without any regard for this issue.
      
      The safest fix seems to be to make ExecProject() force its results into
      read-only state; that will cover every case where a plan node returns
      expression results.  Actually we can delegate this to ExecTargetList()
      since we can recursively assume that plain Vars will not reference
      read-write datums.  That should keep the extra overhead down to something
      minimal.  We no longer need ExecMakeSlotContentsReadOnly(), which was
      introduced only in support of the idea that just a few plan node types
      would need to do this.
      
      In the future it would be nice to have the planner account for this problem
      and inject force-to-read-only expression evaluation nodes into only the
      places where there's a risk of multiple evaluation.  That's not a suitable
      solution for 9.5 or even 9.6 at this point, though.
      
      Report: <20160603124628.9932.41279@wrigleys.postgresql.org>
      69f526aa
    • Robert Haas's avatar
      Remove bogus code to apply PathTargets to partial paths. · 04ae11f6
      Robert Haas authored
      The partial paths that get modified may already have been used as
      part of a GatherPath which appears in the path list, so modifying
      them is not a good idea at this stage - especially because this
      code has no check that the PathTarget is in fact parallel-safe.
      
      When partial aggregation is being performed, this is actually
      harmless because we'll end up replacing the pathtargets here with
      the correct ones within create_grouping_paths().  But if we've got
      a query tree containing only scan/join operations then this can
      result in incorrectly pushing down parallel-restricted target
      list entries.  If those are, for example, references to subqueries,
      that can crash the server; but it's wrong in any event.
      
      Amit Kapila
      04ae11f6
    • Robert Haas's avatar
      Mark PostmasterPid as PGDLLIMPORT. · cac83219
      Robert Haas authored
      This is so that extensions can use it.
      
      Michael Paquier
      cac83219
    • Kevin Grittner's avatar
      Add new snapshot fields to serialize/deserialize functions. · 370a46fc
      Kevin Grittner authored
      The "snapshot too old" condition was not being recognized when
      using a copied snapshot, since the original timestamp and lsn were
      not being passed along.  Noticed when testing the combination of
      "snapshot too old" with parallel query execution.
      370a46fc
    • Robert Haas's avatar
      Fix comment to be more accurate. · 6436a853
      Robert Haas authored
      Now that we skip vacuuming all-frozen pages, this comment needs
      updating.
      
      Masahiko Sawada
      6436a853
    • Tom Lane's avatar
      Suppress -Wunused-result warnings about write(), again. · 6c72a28e
      Tom Lane authored
      Adopt the same solution as in commit aa90e148, but this time
      let's put the ugliness inside the write_stderr() macro, instead of
      expecting each call site to deal with it.  Back-port that decision
      into psql/common.c where I got the macro from in the first place.
      
      Per gripe from Peter Eisentraut.
      6c72a28e
    • Greg Stark's avatar
      Fix various common mispellings. · e1623c39
      Greg Stark authored
      Mostly these are just comments but there are a few in documentation
      and a handful in code and tests. Hopefully this doesn't cause too much
      unnecessary pain for backpatching. I relented from some of the most
      common like "thru" for that reason. The rest don't seem numerous
      enough to cause problems.
      
      Thanks to Kevin Lyda's tool https://pypi.python.org/pypi/misspellings
      e1623c39
    • Tom Lane's avatar
      Measure Bloom index signature-length reloption in bits, not words. · ee4af347
      Tom Lane authored
      Per discussion, this is a more understandable and future-proof way of
      exposing the setting to users.  On-disk, we can still store it in words,
      so as to not break on-disk compatibility with beta1.
      
      Along the way, clean up the code associated with Bloom reloptions.
      Provide explicit macros for default and maximum lengths rather than
      having magic numbers buried in multiple places in the code.  Drop
      the adjustBloomOptions() code altogether: it was useless in view of
      the fact that reloptions.c already performed default-substitution and
      range checking for the options.  Rename a couple of macros and types
      for more clarity.
      
      Discussion: <23767.1464926580@sss.pgh.pa.us>
      ee4af347
    • Robert Haas's avatar
      Cosmetic improvements to freeze map code. · fdfaccfa
      Robert Haas authored
      Per post-commit review comments from Andres Freund, improve variable
      names, comments, and in one place, slightly improve the code structure.
      
      Masahiko Sawada
      fdfaccfa
  6. 02 Jun, 2016 4 commits
    • Greg Stark's avatar
      Be conservative about alignment requirements of struct epoll_event. · a3b30763
      Greg Stark authored
      Use MAXALIGN size/alignment to guarantee that later uses of memory are
      aligned correctly. E.g. epoll_event might need 8 byte alignment on some
      platforms, but earlier allocations like WaitEventSet and WaitEvent might
      not sized to guarantee that when purely using sizeof().
      
      Found by myself while testing on an Sun Ultra 5 (Sparc IIi) with some
      editorializing by Andres Freund.
      
      In passing fix a couple typos in the area
      a3b30763
    • Kevin Grittner's avatar
      C comment improvement & typo fix. · 4edb7bd2
      Kevin Grittner authored
      Thomas Munro
      4edb7bd2
    • Tom Lane's avatar
      Redesign handling of SIGTERM/control-C in parallel pg_dump/pg_restore. · e652273e
      Tom Lane authored
      Formerly, Unix builds of pg_dump/pg_restore would trap SIGINT and similar
      signals and set a flag that was tested in various data-transfer loops.
      This was prone to errors of omission (cf commit 3c8aa665); and even if
      the client-side response was prompt, we did nothing that would cause
      long-running SQL commands (e.g. CREATE INDEX) to terminate early.
      Also, the master process would effectively do nothing at all upon receipt
      of SIGINT; the only reason it seemed to work was that in typical scenarios
      the signal would also be delivered to the child processes.  We should
      support termination when a signal is delivered only to the master process,
      though.
      
      Windows builds had no console interrupt handler, so they would just fall
      over immediately at control-C, again leaving long-running SQL commands to
      finish unmolested.
      
      To fix, remove the flag-checking approach altogether.  Instead, allow the
      Unix signal handler to send a cancel request directly and then exit(1).
      In the master process, also have it forward the signal to the children.
      On Windows, add a console interrupt handler that behaves approximately
      the same.  The main difference is that a single execution of the Windows
      handler can send all the cancel requests since all the info is available
      in one process, whereas on Unix each process sends a cancel only for its
      own database connection.
      
      In passing, fix an old problem that DisconnectDatabase tends to send a
      cancel request before exiting a parallel worker, even if nothing went
      wrong.  This is at least a waste of cycles, and could lead to unexpected
      log messages, or maybe even data loss if it happened in pg_restore (though
      in the current code the problem seems to affect only pg_dump).  The cause
      was that after a COPY step, pg_dump was leaving libpq in PGASYNC_BUSY
      state, causing PQtransactionStatus() to report PQTRANS_ACTIVE.  That's
      normally harmless because the next PQexec() will silently clear the
      PGASYNC_BUSY state; but in a parallel worker we might exit without any
      additional SQL commands after a COPY step.  So add an extra PQgetResult()
      call after a COPY to allow libpq to return to PGASYNC_IDLE state.
      
      This is a bug fix, IMO, so back-patch to 9.3 where parallel dump/restore
      were introduced.
      
      Thanks to Kyotaro Horiguchi for Windows testing and code suggestions.
      
      Original-Patch: <7005.1464657274@sss.pgh.pa.us>
      Discussion: <20160602.174941.256342236.horiguchi.kyotaro@lab.ntt.co.jp>
      e652273e
    • Kevin Grittner's avatar
      Fix btree mark/restore bug. · 7392eed7
      Kevin Grittner authored
      Commit 2ed5b87f introduced a bug in
      mark/restore, in an attempt to optimize repeated restores to the
      same page.  This caused an assertion failure during a merge join
      which fed directly from an index scan, although the impact would
      not be limited to that case.  Revert the bad chunk of code from
      that commit.
      
      While investigating this bug it was discovered that a particular
      "paranoia" set of the mark position field would not prevent bad
      behavior; it would just make it harder to diagnose.  Change that
      into an assertion, which will draw attention to any future problem
      in that area more directly.
      
      Backpatch to 9.5, where the bug was introduced.
      
      Bug #14169 reported by Shinta Koyanagi.
      Preliminary analysis by Tom Lane identified which commit caused
      the bug.
      7392eed7
  7. 01 Jun, 2016 1 commit
    • Tom Lane's avatar
      Clean up some minor inefficiencies in parallel dump/restore. · 763eec6b
      Tom Lane authored
      Parallel dump did a totally pointless query to find out the name of each
      table to be dumped, which it already knows.  Parallel restore runs issued
      lots of redundant SET commands because _doSetFixedOutputState() was invoked
      once per TOC item rather than just once at connection start.  While the
      extra queries are insignificant if you're dumping or restoring large
      tables, it still seems worth getting rid of them.
      
      Also, give the responsibility for selecting the right client_encoding for
      a parallel dump worker to setup_connection() where it naturally belongs,
      instead of having ad-hoc code for that in CloneArchive().  And fix some
      minor bugs like use of strdup() where pg_strdup() would be safer.
      
      Back-patch to 9.3, mostly to keep the branches in sync in an area that
      we're still finding bugs in.
      
      Discussion: <5086.1464793073@sss.pgh.pa.us>
      763eec6b
  8. 31 May, 2016 5 commits
    • Peter Eisentraut's avatar
      doc: Update version() and current_date output in tutorial · 9ee56dfe
      Peter Eisentraut authored
      While the version number is automatically updated in the example output,
      the other details looked a bit dated.
      
      suggested by mike2.schneider@gmail.com
      9ee56dfe
    • Tom Lane's avatar
      Avoid useless closely-spaced writes of statistics files. · 22b27b4c
      Tom Lane authored
      The original intent in the stats collector was that we should not write out
      stats data oftener than every PGSTAT_STAT_INTERVAL msec.  Backends will not
      make requests at all if they see the existing data is newer than that, and
      the stats collector is supposed to disregard requests having a cutoff_time
      older than its most recently written data, so that close-together requests
      don't result in multiple writes.  But the latter part of that got broken
      in commit 187492b6, so that if two backends concurrently decide
      the existing stats are too old, the collector would write the data twice.
      (In principle the collector's logic would still merge requests as long as
      the second one arrives before we've actually written data ... but since
      the message collection loop would write data immediately after processing
      a single inquiry message, that never happened in practice, and in any case
      the window in which it might work would be much shorter than
      PGSTAT_STAT_INTERVAL.)
      
      To fix, improve pgstat_recv_inquiry so that it checks whether the cutoff
      time is too old, and doesn't add a request to the queue if so.  This means
      that we do not need DBWriteRequest.request_time, because the decision is
      taken before making a queue entry.  And that means that we don't really
      need the DBWriteRequest data structure at all; an OID list of database
      OIDs will serve and allow removal of some rather verbose and crufty code.
      
      In passing, improve the comments in this area, which have been rather
      neglected.  Also change backend_read_statsfile so that it's not silently
      relying on MyDatabaseId to have some particular value in the autovacuum
      launcher process.  It accidentally worked as desired because MyDatabaseId
      is zero in that process; but that does not seem like a dependency we want,
      especially with no documentation about it.
      
      Although this patch is mine, it turns out I'd rediscovered a known bug,
      for which Tomas Vondra had already submitted a patch that's functionally
      equivalent to the non-cosmetic aspects of this patch.  Thanks to Tomas
      for reviewing this version.
      
      Back-patch to 9.3 where the bug was introduced.
      
      Prior-Discussion: <1718942738eb65c8407fcd864883f4c8@fuzzy.cz>
      Patch: <4625.1464202586@sss.pgh.pa.us>
      22b27b4c
    • Peter Eisentraut's avatar
      Fix whitespace · aa14bc41
      Peter Eisentraut authored
      aa14bc41
    • Tom Lane's avatar
      Fix typo in CREATE DATABASE syntax synopsis. · 6d69ea33
      Tom Lane authored
      Misplaced "]", evidently a thinko in commit 213335c1.
      6d69ea33
    • Noah Misch's avatar
      Mirror struct Aggref field order in _copyAggref(). · 2195c5af
      Noah Misch authored
      This is cosmetic, and no supported release has the affected fields.
      2195c5af
  9. 30 May, 2016 1 commit