1. 28 Nov, 2016 1 commit
    • Tom Lane's avatar
      Code review for early drop of orphaned temp relations in autovacuum. · dafa0848
      Tom Lane authored
      Commit a734fd5d exposed some race conditions that existed previously
      in the autovac code, but were basically harmless because autovac would
      not try to delete orphaned relations immediately.  Specifically, the test
      for orphaned-ness was made on a pg_class tuple that might be dead by now,
      allowing autovac to try to remove a table that the owning backend had just
      finished deleting.  This resulted in a hard crash due to inadequate caution
      about accessing the table's catalog entries without any lock.  We must take
      a relation lock and then recheck whether the table is still present and
      still looks deletable before we do anything.
      
      Also, it seemed to me that deleting multiple tables per transaction, and
      trying to continue after errors, represented unjustifiable complexity.
      We do not expect this code path to be taken often in the field, nor even
      during testing, which means that prioritizing performance over correctness
      is a bad tradeoff.  Rip all that out in favor of just starting a new
      transaction after each successful temp table deletion.  If we're unlucky
      enough to get an error, which shouldn't happen anyway now that we're being
      more cautious, let the autovacuum worker fail as it normally would.
      
      In passing, improve the order of operations in the initial scan loop.
      Now that we don't care about whether a temp table is a wraparound hazard,
      there's no need to perform extract_autovac_opts, get_pgstat_tabentry_relid,
      or relation_needs_vacanalyze for temp tables.
      
      Also, if GetTempNamespaceBackendId returns InvalidBackendId (indicating
      it doesn't recognize the schema as temp), treat that as meaning it's NOT
      an orphaned temp table, not that it IS one, which is what happened before
      because BackendIdGetProc necessarily failed.  The case really shouldn't
      come up for a table that has RELPERSISTENCE_TEMP, but the consequences
      if it did seem undesirable.  (This might represent a back-patchable bug
      fix; not sure if it's worth the trouble.)
      
      Discussion: https://postgr.es/m/21299.1480272347@sss.pgh.pa.us
      dafa0848
  2. 27 Nov, 2016 1 commit
  3. 26 Nov, 2016 2 commits
    • Tom Lane's avatar
      Fix test about ignoring extension dependencies during extension scripts. · 182db070
      Tom Lane authored
      Commit 08dd23ce introduced an exception to the rule that extension member
      objects can only be dropped as part of dropping the whole extension,
      intending to allow such drops while running the extension's own creation or
      update scripts.  However, the exception was only applied at the outermost
      recursion level, because it was modeled on a pre-existing check to ignore
      dependencies on objects listed in pendingObjects.  Bug #14434 from Philippe
      Beaudoin shows that this is inadequate: in some cases we can reach an
      extension member object by recursion from another one.  (The bug concerns
      the serial-sequence case; I'm not sure if there are other cases, but there
      might well be.)
      
      To fix, revert 08dd23ce's changes to findDependentObjects() and instead
      apply the creating_extension exception regardless of stack level.
      
      Having seen this example, I'm a bit suspicious that the pendingObjects
      logic is also wrong and such cases should likewise be allowed at any
      recursion level.  However, changing that would interact in subtle ways
      with the recursion logic (at least it would need to be moved to after the
      recursing-from check).  Given that the code's been like that a long time,
      I'll refrain from touching it without a clear example showing it's wrong.
      
      Back-patch to all active branches.  In HEAD and 9.6, where suitable
      test infrastructure exists, add a regression test case based on the
      bug report.
      
      Report: <20161125151448.6529.33039@wrigleys.postgresql.org>
      Discussion: <13224.1480177514@sss.pgh.pa.us>
      182db070
    • Robert Haas's avatar
      Mark IsPostmasterEnvironment and IsBackgroundWorker as PGDLLIMPORT. · 27327059
      Robert Haas authored
      Per request from Craig Ringer.
      27327059
  4. 25 Nov, 2016 4 commits
    • Tom Lane's avatar
      Bring some clarity to the defaults for the xxx_flush_after parameters. · dbdfd114
      Tom Lane authored
      Instead of confusingly stating platform-dependent defaults for these
      parameters in the comments in postgresql.conf.sample (with the main
      entry being a lie on Linux), teach initdb to install the correct
      platform-dependent value in postgresql.conf, similarly to the way
      we handle other platform-dependent defaults.  This won't do anything
      for existing 9.6 installations, but since it's effectively only a
      documentation improvement, that seems OK.
      
      Since this requires initdb to have access to the default values,
      move the #define's for those to pg_config_manual.h; the original
      placement in bufmgr.h is unworkable because that file can't be
      included by frontend programs.
      
      Adjust the default value for wal_writer_flush_after so that it is 1MB
      regardless of XLOG_BLCKSZ, conforming to what is stated in both the
      SGML docs and postgresql.conf.  (We could alternatively make it scale
      with XLOG_BLCKSZ, but I'm not sure I see the point.)
      
      Copy-edit related SGML documentation.
      
      Fabien Coelho and Tom Lane, per a gripe from Tomas Vondra.
      
      Discussion: <30ebc6e3-8358-09cf-44a8-578252938424@2ndquadrant.com>
      dbdfd114
    • Tom Lane's avatar
      Mark a query's topmost Paths parallel-unsafe if they will have initPlans. · ab77a5a4
      Tom Lane authored
      Andreas Seltenreich found another case where we were being too optimistic
      about allowing a plan to be considered parallelizable despite it containing
      initPlans.  It seems like the real issue here is that if we know we are
      going to tack initPlans onto the topmost Plan node for a subquery, we
      had better mark that subquery's result Paths as not-parallel-safe.  That
      fixes this problem and allows reversion of a kluge (added in commit
      7b67a0a4 and extended in f24cf960) to not trust the parallel_safe flag
      at top level.
      
      Discussion: <874m2w4k5d.fsf@ex.ansel.ydns.eu>
      ab77a5a4
    • Tom Lane's avatar
      Check for pending trigger events on far end when dropping an FK constraint. · 4e026b32
      Tom Lane authored
      When dropping a foreign key constraint with ALTER TABLE DROP CONSTRAINT,
      we refuse the drop if there are any pending trigger events on the named
      table; this ensures that we won't remove the pg_trigger row that will be
      consulted by those events.  But we should make the same check for the
      referenced relation, else we might remove a due-to-be-referenced pg_trigger
      row for that relation too, resulting in "could not find trigger NNN" or
      "relation NNN has no triggers" errors at commit.  Per bug #14431 from
      Benjie Gillam.  Back-patch to all supported branches.
      
      Report: <20161124114911.6530.31200@wrigleys.postgresql.org>
      4e026b32
    • Magnus Hagander's avatar
      Fix typo in comment · 8afb8110
      Magnus Hagander authored
      Thomas Munro
      8afb8110
  5. 24 Nov, 2016 2 commits
  6. 23 Nov, 2016 3 commits
    • Tom Lane's avatar
      Avoid masking a function parameter name with a local variable name. · 6fa391be
      Tom Lane authored
      No actual bug here, but it might confuse readers, so change the name
      of the local variable.
      
      Ashutosh Bapat
      6fa391be
    • Tom Lane's avatar
      Make sure ALTER TABLE preserves index tablespaces. · bd673e8e
      Tom Lane authored
      When rebuilding an existing index, ALTER TABLE correctly kept the
      physical file in the same tablespace, but it messed up the pg_class
      entry if the index had been in the database's default tablespace
      and "default_tablespace" was set to some non-default tablespace.
      This led to an inaccessible index.
      
      Fix by fixing pg_get_indexdef_string() to always include a tablespace
      clause, whether or not the index is in the default tablespace.  The
      previous behavior was installed in commit 537e92e4, and I think it just
      wasn't thought through very clearly; certainly the possible effect of
      default_tablespace wasn't considered.  There's some risk in changing the
      behavior of this function, but there are no other call sites in the core
      code.  Even if it's being used by some third party extension, it's fairly
      hard to envision a usage that is okay with a tablespace clause being
      appended some of the time but can't handle it being appended all the time.
      
      Back-patch to all supported versions.
      
      Code fix by me, investigation and test cases by Michael Paquier.
      
      Discussion: <1479294998857-5930602.post@n3.nabble.com>
      bd673e8e
    • Robert Haas's avatar
      Remove barrier.h · e343dfa4
      Robert Haas authored
      A new thing also called a "barrier" is proposed, but whether we decide
      to take that patch or not, this file seems to have outlived its
      usefulness.
      
      Thomas Munro
      e343dfa4
  7. 22 Nov, 2016 7 commits
    • Tom Lane's avatar
      Doc: improve documentation about composite-value usage. · e1320266
      Tom Lane authored
      Create a section specifically for the syntactic rules around whole-row
      variable usage, such as expansion of "foo.*".  This was previously
      documented only haphazardly, with some critical info buried in
      unexpected places like xfunc-sql-composite-functions.  Per repeated
      questions in different mailing lists.
      
      Discussion: <16288.1479610770@sss.pgh.pa.us>
      e1320266
    • Robert Haas's avatar
      Code review for commit 274bb2b3. · 9a1d0af4
      Robert Haas authored
      Avoid memory leak in conninfo_uri_parse_options.  Use the current host
      rather than the comma-separated list of host names when the host name
      is needed for GSS, SSPI, or SSL authentication.  Document the way
      connect_timeout interacts with multiple host specifications.
      
      Takayuki Tsunakawa
      9a1d0af4
    • Tom Lane's avatar
      Improve handling of "UPDATE ... SET (column_list) = row_constructor". · 906bfcad
      Tom Lane authored
      Previously, the right-hand side of a multiple-column assignment, if it
      wasn't a sub-SELECT, had to be a simple parenthesized expression list,
      because gram.y was responsible for "bursting" the construct into
      independent column assignments.  This had the minor defect that you
      couldn't write ROW (though you should be able to, since the standard says
      this is a row constructor), and the rather larger defect that unlike other
      uses of row constructors, we would not expand a "foo.*" item into multiple
      columns.
      
      Fix that by changing the RHS to be just "a_expr" in the grammar, leaving
      it to transformMultiAssignRef to separate the elements of a RowExpr;
      which it will do only after performing standard transformation of the
      RowExpr, so that "foo.*" behaves as expected.
      
      The key reason we didn't do that before was the hard-wired handling of
      DEFAULT tokens (SetToDefault nodes).  This patch deals with that issue by
      allowing DEFAULT in any a_expr and having parse analysis throw an error
      if SetToDefault is found in an unexpected place.  That's an improvement
      anyway since the error can be more specific than just "syntax error".
      
      The SQL standard suggests that the RHS could be any a_expr yielding a
      suitable row value.  This patch doesn't really move the goal posts in that
      respect --- you're still limited to RowExpr or a sub-SELECT --- but it does
      fix the grammar restriction, so it provides some tangible progress towards
      a full implementation.  And the limitation is now documented by an explicit
      error message rather than an unhelpful "syntax error".
      
      Discussion: <8542.1479742008@sss.pgh.pa.us>
      906bfcad
    • Robert Haas's avatar
      Support condition variables. · e8ac886c
      Robert Haas authored
      Condition variables provide a flexible way to sleep until a
      cooperating process causes an arbitrary condition to become true.  In
      simple cases, this can be accomplished with a WaitLatch/ResetLatch
      loop; the cooperating process can call SetLatch after performing work
      that might cause the condition to be satisfied, and the waiting
      process can recheck the condition each time.  However, if the process
      performing the work doesn't have an easy way to identify which
      processes might be waiting, this doesn't work, because it can't
      identify which latches to set.  Condition variables solve that problem
      by internally maintaining a list of waiters; a process that may have
      caused some waiter's condition to be satisfied must "signal" or
      "broadcast" on the condition variable.
      
      Robert Haas and Thomas Munro
      e8ac886c
    • Tom Lane's avatar
      Doc: add a section in Part II concerning RETURNING. · 1c7861e8
      Tom Lane authored
      There are assorted references to RETURNING in Part II, but nothing
      that would qualify as an explanation of the feature, which seems
      like an oversight considering how useful it is.  Add something.
      
      Noted while looking for a place to point a cross-reference to ...
      1c7861e8
    • Tom Lane's avatar
      Make contrib/test_decoding regression tests safe for CZ locale. · e2a0ee69
      Tom Lane authored
      A little COLLATE "C" goes a long way.
      
      Pavel Stehule, per suggestion from Craig Ringer
      
      Discussion: <CAFj8pRA8nJZcozgxN=RMSqMmKuHVOkcGAAKPKdFeiMWGDSUDLA@mail.gmail.com>
      e2a0ee69
    • Tom Lane's avatar
      Fix uninitialized variable. · ae92a9a3
      Tom Lane authored
      Oversight in a734fd5d.
      
      Michael Paquier
      ae92a9a3
  8. 21 Nov, 2016 4 commits
    • Tom Lane's avatar
      Fix PGLC_localeconv() to handle errors better. · a4930e7c
      Tom Lane authored
      The code was intentionally not very careful about leaking strdup'd
      strings in case of an error.  That was forgivable probably, but it
      also failed to notice strdup() failures, which could lead to subsequent
      null-pointer-dereference crashes, since many callers unsurprisingly
      didn't check for null pointers in the struct lconv fields.  An even
      worse problem is that it could throw error while we were setlocale'd
      to a non-C locale, causing unwanted behavior in subsequent libc calls.
      
      Rewrite to ensure that we cannot throw elog(ERROR) until after we've
      restored the previous locale settings, or at least attempted to.
      (I'm sorely tempted to make restore failure be a FATAL error, but
      will refrain for the moment.)  Having done that, it's not much more
      work to ensure that we clean up strdup'd storage on the way out, too.
      
      This code is substantially the same in all supported branches, so
      back-patch all the way.
      
      Michael Paquier and Tom Lane
      
      Discussion: <CAB7nPqRMbGqa_mesopcn4MPyTs34eqtVEK7ELYxvvV=oqS00YA@mail.gmail.com>
      a4930e7c
    • Tom Lane's avatar
      Fix optimization for skipping searches for parallel-query hazards. · 4324ade9
      Tom Lane authored
      Fix thinko in commit da1c9163: even if the original query was free of
      parallel hazards, we might introduce such a hazard by adding PARAM_EXEC
      Param nodes.  Adjust is_parallel_safe() so that it will scan the given
      expression whenever any such nodes have been created.  Per report from
      Andreas Seltenreich.
      
      Discussion: <878tse6yvf.fsf@credativ.de>
      4324ade9
    • Robert Haas's avatar
      autovacuum: Drop orphan temp tables more quickly but with more caution. · a734fd5d
      Robert Haas authored
      Previously, we only dropped an orphan temp table when it became old
      enough to threaten wraparound; instead, doing it immediately.  The
      only value of waiting is that someone might be able to examine the
      contents of the orphan temp table for forensic purposes, but it's
      pretty difficult to actually do that and few users will wish to do so.
      On the flip side, not performing the drop immediately generates log
      spam and bloats pg_class.
      
      In addition, per a report from Grigory Smolkin, if a temporary schema
      contains a very large number of temporary tables, a backend attempting
      to clear the temporary schema might fail due to lock table exhaustion.
      It's helpful for autovacuum to clean up after such cases, and we don't
      want it to wait for wraparound to threaten before doing so.  To
      prevent autovacuum from failing in the same manner as a backend trying
      to drop an entire temp schema, remove orphan temp tables in batches of
      50, committing after each batch, so that we don't accumulate an
      unbounded number of locks.  If a drop fails, retry other orphan tables
      that need to be dropped up to 10 times before giving up.  With this
      system, if a backend does fail to clean a temporary schema due to
      lock table exhaustion, autovacuum should hopefully put things right
      the next time it processes the database.
      
      Discussion: CAB7nPqSbYT6dRwsXVgiKmBdL_ARemfDZMPA+RPeC_ge0GK70hA@mail.gmail.com
      
      Michael Paquier, with a bunch of comment changes by me.
      a734fd5d
    • Tom Lane's avatar
      Fix test for subplans in force-parallel mode. · f24cf960
      Tom Lane authored
      We mustn't force parallel mode if the query has any subplans, since
      ExecSerializePlan doesn't transmit them to workers.  Testing
      top_plan->initPlan is inadequate because (1) there might be initPlans
      attached to lower plan nodes, and (2) non-initPlan subplans don't
      work either.  There's certainly room for improvement in those
      restrictions, but for the moment that's what we've got.
      
      Amit Kapila, per report from Andreas Seltenreich
      
      Discussion: <8737im6pmh.fsf@credativ.de>
      f24cf960
  9. 20 Nov, 2016 1 commit
    • Tom Lane's avatar
      Prevent multicolumn expansion of "foo.*" in an UPDATE source expression. · c5f365f3
      Tom Lane authored
      Because we use transformTargetList() for UPDATE as well as SELECT
      tlists, the code accidentally tried to expand a "*" reference into
      several columns.  This is nonsensical, because the UPDATE syntax
      provides exactly one target column to put the value into.  The
      immediate result was that transformUpdateTargetList() got confused
      and reported "UPDATE target count mismatch --- internal error".
      It seems better to treat such a reference as a plain whole-row
      variable, as it would be in other contexts.  (This could produce
      useful results when the target column is of composite type.)
      
      Fix by tweaking transformTargetList() to perform *-expansion only
      conditionally, depending on its exprKind parameter.
      
      Back-patch to 9.3.  The problem exists further back, but a fix would be
      much more invasive before that, because transformTargetList() wasn't
      told what kind of list it was working on.  Doesn't seem worth the
      trouble given the lack of field reports.  (I only noticed it because
      I was checking the code while trying to improve the documentation about
      how we handle "foo.*".)
      
      Discussion: <4308.1479595330@sss.pgh.pa.us>
      c5f365f3
  10. 19 Nov, 2016 2 commits
    • Tom Lane's avatar
      Fix latent costing error in create_merge_append_path. · 0832f2db
      Tom Lane authored
      create_merge_append_path should use the path rowcount it just computed,
      not rel->tuples, for costing purposes.  Those numbers should always be
      the same at present, but if we ever support parameterized MergeAppend
      paths (a case this function is otherwise prepared for), the former would
      be right and the latter wrong.
      
      No need for back-patch since the problem is only latent.
      
      Ashutosh Bapat
      
      Discussion: <CAFjFpRek+cLCnTo24youuGtsq4zRphEB8EUUPjDxZjnL4n4HYQ@mail.gmail.com>
      0832f2db
    • Tom Lane's avatar
      Code review for GUC serialization/deserialization code. · 13671b4b
      Tom Lane authored
      The serialization code dumped core for a string-valued GUC whose value
      is NULL, which is a legal state.  The infrastructure isn't capable of
      transmitting that state exactly, but fortunately, transmitting an empty
      string instead should be close enough (compare, eg, commit e45e990e).
      
      The code potentially underestimated the space required to format a
      real-valued variable, both because it made an unwarranted assumption that
      %g output would never be longer than %e output, and because it didn't count
      right even for %e format.  In practice this would pretty much always be
      masked by overestimates for other variables, but it's still wrong.
      
      Also fix boundary-case error in read_gucstate, incorrect handling of the
      case where guc_sourcefile is non-NULL but zero length (not clear that can
      happen, but if it did, this code would get totally confused), and
      confusingly useless check for a NULL result from read_gucstate.
      
      Andreas Seltenreich discovered the core dump; other issues noted while
      reading nearby code.  Back-patch to 9.5 where this code was introduced.
      
      Michael Paquier and Tom Lane
      
      Discussion: <871sy78wno.fsf@credativ.de>
      13671b4b
  11. 18 Nov, 2016 2 commits
    • Peter Eisentraut's avatar
      Add pg_sequences view · 67dc4ccb
      Peter Eisentraut authored
      Like pg_tables, pg_views, and others, this view contains information
      about sequences in a way that is independent of the system catalog
      layout but more comprehensive than the information schema.
      
      To help implement the view, add a new internal function
      pg_sequence_last_value() to return the last value of a sequence.  This
      is kept separate from pg_sequence_parameters() to separate querying
      run-time state from catalog-like information.
      Reviewed-by: default avatarAndreas Karlsson <andreas@proxel.se>
      67dc4ccb
    • Stephen Frost's avatar
      Clean up pg_dump tests, re-enable BLOB testing · 8f91f323
      Stephen Frost authored
      Add a loop to check that each test covers all of the pg_dump runs.  We
      (I) had been a bit sloppy when adding new runs and not making sure to
      mark if they should be under like or unlike for each test, this loop
      makes sure that the test system will complain if any are forgotten in
      the future.
      
      The loop also correctly handles the 'catch all' cases, which are used to
      avoid running unnecessary specific checks when a single catch-all can be
      done (eg: a no-acl run should not have any GRANT commands).
      
      Also, re-enable the testing of blobs, but use lo_from_bytea() instead of
      trying to be cute and writing out to a file and then reading it back in
      with psql, which proved to be difficult for some buildfarm members.
      This allows us to add support for testing the --no-blobs option which
      will be getting added shortly, provided the buildfarm doesn't blow up on
      this.
      8f91f323
  12. 17 Nov, 2016 5 commits
    • Robert Haas's avatar
      Remove or reduce verbosity of some debug messages. · a43f1939
      Robert Haas authored
      The debug messages that merely print StartTransactionCommand,
      CommitTransactionCommand, ProcessUtilty, or ProcessQuery with no
      additional details seem to be useless.  Get rid of them.
      
      The transaction status messages produced by ShowTransactionState are
      occasionally useful, but they are extremely verbose, producing
      multiple lines of log output every time they fire, which can happens
      multiple times per transaction.  So, reduce the level to DEBUG5; avoid
      emitting an extra line just to explain which debug point is at issue;
      and tighten up the rest of the message so it doesn't use quite so much
      horizontal space.
      
      With these changes, it's possible to run a somewhat busy system with a
      log level even as high as DEBUG4, whereas previously anything above
      DEBUG2 would flood the log with output that probably wasn't really all
      that useful.
      a43f1939
    • Tom Lane's avatar
      Fix pg_dump's handling of circular dependencies in views. · d8c05aff
      Tom Lane authored
      pg_dump's traditional solution for breaking a circular dependency involving
      a view was to create the view with CREATE TABLE and then later issue CREATE
      RULE "_RETURN" ... to convert the table to a view, relying on the backend's
      very very ancient code that supports making views that way.  We've wanted
      to get rid of that kluge for a long time, but the thing that finally
      motivates doing something about it is the recognition that this method
      fails with the --clean option, because it leads to issuing DROP RULE
      "_RETURN" followed by DROP TABLE --- and the backend won't let you drop a
      view's _RETURN rule.
      
      Instead, let's break circular dependencies by initially creating the view
      using CREATE VIEW AS SELECT NULL::columntype AS columnname, ... (so that
      it has the right column names and types to support external references,
      but no dependencies beyond the column data types), and then later dumping
      the ON SELECT rule using the spelling CREATE OR REPLACE VIEW.  This method
      wasn't available when this code was originally written, but it's been
      possible since PG 7.3, so it seems fine to start relying on it now.
      
      To solve the --clean problem, make the dropStmt for an ON SELECT rule
      be CREATE OR REPLACE VIEW with the same dummy target list as above.
      In this way, during the DROP phase, we first reduce the view to have
      no extra dependencies, and then we can drop it entirely when we've
      gotten rid of whatever had a circular dependency on it.
      
      (Note: this should work adequately well with the --if-exists option, since
      the CREATE OR REPLACE VIEW will go through whether the view exists or not.
      It could fail if the view exists with a conflicting column set, but we
      don't really support --clean against a non-matching database anyway.)
      
      This allows cleaning up some other kluges inside pg_dump, notably that
      we don't need a notion of reloptions attached to a rule anymore.
      
      Although this is a bug fix, commit to HEAD only for now.  The problem's
      existed for a long time and we've had relatively few complaints, so it
      doesn't really seem worth taking risks to fix it in the back branches.
      We might revisit that choice if no problems emerge.
      
      Discussion: <19092.1479325184@sss.pgh.pa.us>
      d8c05aff
    • Tom Lane's avatar
      Improve pg_dump/pg_restore --create --if-exists logic. · ac888986
      Tom Lane authored
      Teach it not to complain if the dropStmt attached to an archive entry
      is actually spelled CREATE OR REPLACE VIEW, since that will happen due to
      an upcoming bug fix.  Also, if it doesn't recognize a dropStmt, have it
      print a WARNING and then emit the dropStmt unmodified.  That seems like a
      much saner behavior than Assert'ing or dumping core due to a null-pointer
      dereference, which is what would happen before :-(.
      
      Back-patch to 9.4 where this option was introduced.
      
      Discussion: <19092.1479325184@sss.pgh.pa.us>
      ac888986
    • Tom Lane's avatar
      Re-pgindent src/bin/pg_dump/* · fcf70e0d
      Tom Lane authored
      Cleanup for recent patches --- it's not much change, but I got annoyed
      while re-indenting the view-rule fix I'm working on.
      fcf70e0d
    • Alvaro Herrera's avatar
      Avoid pin scan for replay of XLOG_BTREE_VACUUM in all cases · f65b94f6
      Alvaro Herrera authored
      Replay of XLOG_BTREE_VACUUM during Hot Standby was previously thought to
      require complex interlocking that matched the requirements on the
      master. This required an O(N) operation that became a significant
      problem with large indexes, causing replication delays of seconds or in
      some cases minutes while the XLOG_BTREE_VACUUM was replayed.
      
      This commit skips the “pin scan” that was previously required, by
      observing in detail when and how it is safe to do so, with full
      documentation. The pin scan is skipped only in replay; the VACUUM code
      path on master is not touched here.
      
      No tests included. Manual tests using an additional patch to view WAL records
      and their timing have shown the change in WAL records and their handling has
      successfully reduced replication delay.
      
      This is a back-patch of commits 687f2cd7, 3e4b7d87, b6028426
      by Simon Riggs, to branches 9.4 and 9.5.  No further backpatch is
      possible because this depends on catalog scans being MVCC.  I (Álvaro)
      additionally updated a slight problem in the README, which explains why
      this touches the 9.6 and master branches.
      f65b94f6
  13. 16 Nov, 2016 2 commits
  14. 15 Nov, 2016 4 commits
    • Tom Lane's avatar
      Check that result tupdesc has exactly 1 column in return_next scalar case. · 4ecd1974
      Tom Lane authored
      This should always be true, but since we're relying on a tuple descriptor
      passed from outside pltcl itself, let's check.  Per a gripe from Coverity.
      4ecd1974
    • Robert Haas's avatar
      Reserve zero as an invalid DSM handle. · b40b4dd9
      Robert Haas authored
      Previously, the handle for the control segment could not be zero, but
      some other DSM segment could potentially have a handle value of zero.
      However, that means that if someone wanted to store a dsm_handle that
      might or might not be valid, they would need a separate boolean to
      keep track of whether the associated value is legal.  That's annoying,
      so change things so that no DSM segment can ever have a handle of 0 -
      or as we call it here, DSM_HANDLE_INVALID.
      
      Thomas Munro.  This was submitted as part of a much larger patch to
      add an malloc-like allocator for dynamic shared memory, but this part
      seems like a good idea independently of the rest of the patch.
      b40b4dd9
    • Tom Lane's avatar
      Allow DOS-style line endings in ~/.pgpass files. · 0a748193
      Tom Lane authored
      On Windows, libc will mask \r\n line endings for us, since we read the
      password file in text mode.  But that doesn't happen on Unix.  People
      who share password files across both systems might have \r\n line endings
      in a file they use on Unix, so as a convenience, ignore trailing \r.
      Per gripe from Josh Berkus.
      
      In passing, put the existing check for empty line somewhere where it's
      actually useful, ie after stripping the newline not before.
      
      Vik Fearing, adjusted a bit by me
      
      Discussion: <0de37763-5843-b2cc-855e-5d0e5df25807@agliodbs.com>
      0a748193
    • Tom Lane's avatar
      Account for catalog snapshot in PGXACT->xmin updates. · ffaa44cb
      Tom Lane authored
      The CatalogSnapshot was not plugged into SnapshotResetXmin()'s accounting
      for whether MyPgXact->xmin could be cleared or advanced.  In normal
      transactions this was masked by the fact that the transaction snapshot
      would be older, but during backend startup and certain utility commands
      it was possible to re-use the CatalogSnapshot after MyPgXact->xmin had
      been cleared, meaning that recently-deleted rows could be pruned even
      though this snapshot could still see them, causing unexpected catalog
      lookup failures.  This effect appears to be the explanation for a recent
      failure on buildfarm member piculet.
      
      To fix, add the CatalogSnapshot to the RegisteredSnapshots heap whenever
      it is valid.
      
      In the previous logic, it was possible for the CatalogSnapshot to remain
      valid across waits for client input, but with this change that would mean
      it delays advance of global xmin in cases where it did not before.  To
      avoid possibly causing new table-bloat problems with clients that sit idle
      for long intervals, add code to invalidate the CatalogSnapshot before
      waiting for client input.  (When the backend is busy, it's unlikely that
      the CatalogSnapshot would be the oldest snap for very long, so we don't
      worry about forcing early invalidation of it otherwise.)
      
      In passing, remove the CatalogSnapshotStale flag in favor of using
      "CatalogSnapshot != NULL" to represent validity, as we do for the other
      special snapshots in snapmgr.c.  And improve some obsolete comments.
      
      No regression test because I don't know a deterministic way to cause this
      failure.  But the stress test shown in the original discussion provokes
      "cache lookup failed for relation 1255" within a few dozen seconds for me.
      
      Back-patch to 9.4 where MVCC catalog scans were introduced.  (Note: it's
      quite easy to produce similar failures with the same test case in branches
      before 9.4.  But MVCC catalog scans were supposed to fix that.)
      
      Discussion: <16447.1478818294@sss.pgh.pa.us>
      ffaa44cb