1. 11 Mar, 2021 10 commits
    • Peter Geoghegan's avatar
      Add back vacuum_cleanup_index_scale_factor parameter. · effdd3f3
      Peter Geoghegan authored
      Commit 9f3665fb removed the vacuum_cleanup_index_scale_factor storage
      parameter.  However, that creates dump/reload hazards when moving across
      major versions.
      
      Add back the vacuum_cleanup_index_scale_factor parameter (though not the
      GUC of the same name) purely to avoid problems when using tools like
      pg_upgrade.  The parameter remains disabled and undocumented.
      
      No backpatch to Postgres 13, since vacuum_cleanup_index_scale_factor was
      only disabled by REL_13_STABLE's version of master branch commit
      9f3665fb in the first place -- the parameter already looks like this on
      REL_13_STABLE.
      
      Discussion: https://postgr.es/m/YEm/a3Ko3nKnBuVq@paquier.xyz
      effdd3f3
    • Robert Haas's avatar
      Be clear about whether a recovery pause has taken effect. · 32fd2b57
      Robert Haas authored
      Previously, the code and documentation seem to have essentially
      assumed than a call to pg_wal_replay_pause() would take place
      immediately, but that's not the case, because we only check for a
      pause in certain places. This means that a tool that uses this
      function and then wants to do something else afterward that is
      dependent on the pause having taken effect doesn't know how long it
      needs to wait to be sure that no more WAL is going to be replayed.
      
      To avoid that, add a new function pg_get_wal_replay_pause_state()
      which returns either 'not paused', 'paused requested', or 'paused'.
      After calling pg_wal_replay_pause() the status will immediate change
      from 'not paused' to 'pause requested'; when the startup process
      has noticed this, the status will change to 'pause'.  For backward
      compatibility, pg_is_wal_replay_paused() still exists and returns
      the same thing as before: true if a pause has been requested,
      whether or not it has taken effect yet; and false if not.
      The documentation is updated to clarify.
      
      To improve the changes that a pause request is quickly confirmed
      effective, adjust things so that WaitForWALToBecomeAvailable will
      swiftly reach a call to recoveryPausesHere() when a pause request
      is made.
      
      Dilip Kumar, reviewed by Simon Riggs, Kyotaro Horiguchi, Yugo Nagata,
      Masahiko Sawada, and Bharath Rupireddy.
      
      Discussion: http://postgr.es/m/CAFiTN-vcLLWEm8Zr%3DYK83rgYrT9pbC8VJCfa1kY9vL3AUPfu6g%40mail.gmail.com
      32fd2b57
    • Tom Lane's avatar
      Re-simplify management of inStart in pqParseInput3's subroutines. · 51c54bb6
      Tom Lane authored
      Commit 92785dac copied some logic related to advancement of inStart
      from pqParseInput3 into getRowDescriptions and getAnotherTuple,
      because it wanted to allow user-defined row processor callbacks to
      potentially longjmp out of the library, and inStart would have to be
      updated before that happened to avoid an infinite loop.  We later
      decided that that API was impossibly fragile and reverted it, but
      we didn't undo all of the related code changes, and this bit of
      messiness survived.  Undo it now so that there's just one place in
      pqParseInput3's processing where inStart is advanced; this will
      simplify addition of better tracing support.
      
      getParamDescriptions had grown similar processing somewhere along
      the way (not in 92785dac; I didn't track down just when), but it's
      actually buggy because its handling of corrupt-message cases seems to
      have been copied from the v2 logic where we lacked a known message
      length.  The cases where we "goto not_enough_data" should not simply
      return EOF, because then we won't consume the message, potentially
      creating an infinite loop.  That situation now represents a
      definitively corrupt message, and we should report it as such.
      
      Although no field reports of getParamDescriptions getting stuck in
      a loop have been seen, it seems appropriate to back-patch that fix.
      I chose to back-patch all of this to keep the logic looking more alike
      in supported branches.
      
      Discussion: https://postgr.es/m/2217283.1615411989@sss.pgh.pa.us
      51c54bb6
    • Robert Haas's avatar
      Refactor and generalize the ParallelSlot machinery. · f71519e5
      Robert Haas authored
      Create a wrapper object, ParallelSlotArray, to encapsulate the
      number of slots and the slot array itself, plus some other relevant
      bits of information. This reduces the number of parameters we have
      to pass around all over the place.
      
      Allow for a ParallelSlotArray to contain slots connected to
      different databases within a single cluster. The current clients
      of this mechanism don't need this, but it is expected to be used
      by future patches.
      
      Defer connecting to databases until we actually need the connection
      for something. This is a slight behavior change for vacuumdb and
      reindexdb. If you specify a number of jobs that is larger than the
      number of objects, the extra connections will now not be used.
      But, on the other hand, if you specify a number of jobs that is
      so large that it's going to fail, the failure would previously have
      happened before any operations were actually started, and now it
      won't.
      
      Mark Dilger, reviewed by me.
      
      Discussion: http://postgr.es/m/12ED3DA8-25F0-4B68-937D-D907CFBF08E7@enterprisedb.com
      Discussion: http://postgr.es/m/BA592F2D-F928-46FF-9516-2B827F067F57@enterprisedb.com
      f71519e5
    • Michael Paquier's avatar
      Set libcrypto callbacks for all connection threads in libpq · 2c0cefcd
      Michael Paquier authored
      Based on an analysis of the OpenSSL code with Jacob, moving to EVP for
      the cryptohash computations makes necessary the setup of the libcrypto
      callbacks that were getting set only for SSL connections, but not for
      connections without SSL.  Not setting the callbacks makes the use of
      threads potentially unsafe for connections calling cryptohashes during
      authentication, like MD5 or SCRAM, if a failure happens during a
      cryptohash computation.  The logic setting the libssl and libcrypto
      states is then split into two parts, both using the same locking, with
      libcrypto being set up for SSL and non-SSL connections, while SSL
      connections set any libssl state afterwards as needed.
      
      Prior to this commit, only SSL connections would have set libcrypto
      callbacks that are necessary to ensure a proper thread locking when
      using multiple concurrent threads in libpq (ENABLE_THREAD_SAFETY).  Note
      that this is only required for OpenSSL 1.0.2 and 1.0.1 (oldest version
      supported on HEAD), as 1.1.0 has its own internal locking and it has
      dropped support for CRYPTO_set_locking_callback().
      
      Tests with up to 300 threads with OpenSSL 1.0.1 and 1.0.2, mixing SSL
      and non-SSL connection threads did not show any performance impact after
      some micro-benchmarking.  pgbench can be used here with -C and a
      mostly-empty script (with one \set meta-command for example) to stress
      authentication requests, and we have mixed that with some custom
      programs for testing.
      
      Reported-by: Jacob Champion
      Author: Michael Paquier
      Reviewed-by: Jacob Champion
      Discussion: https://postgr.es/m/fd3ba610085f1ff54623478cf2f7adf5af193cbb.camel@vmware.com
      2c0cefcd
    • Peter Geoghegan's avatar
      Doc: B-Tree only has one additional parameter. · 3f0daeb0
      Peter Geoghegan authored
      Oversight in commit 9f3665fb.
      
      Backpatch: 13-, just like commit 9f3665fb.
      3f0daeb0
    • Thomas Munro's avatar
      Improve comment for struct BufferDesc. · 049d9b87
      Thomas Munro authored
      Add a note that per-buffer I/O condition variables currently live
      outside the BufferDesc struct.  Follow-up for commit d8725104.
      Reported-by: default avatarJulien Rouhaud <rjuju123@gmail.com>
      Discussion: https://postgr.es/m/20210311031118.hucytmrgwlktjxgq%40nol
      049d9b87
    • Bruce Momjian's avatar
      tutorial: land height is "elevation", not "altitude" · 2950ff32
      Bruce Momjian authored
      This is a follow-on patch to 92c12e46.  In that patch, we renamed
      "altitude" to "elevation" in the docs, based on these details:
      
         https://mapscaping.com/blogs/geo-candy/what-is-the-difference-between-elevation-relief-and-altitude
      
      This renames the tutorial SQL files to match the documentation.
      
      Reported-by: max1@inbox.ru
      
      Discussion: https://postgr.es/m/161512392887.1046.3137472627109459518@wrigleys.postgresql.org
      
      Backpatch-through: 9.6
      2950ff32
    • Peter Geoghegan's avatar
      VACUUM ANALYZE: Always update pg_class.reltuples. · 5f8727f5
      Peter Geoghegan authored
      vacuumlazy.c sometimes fails to update pg_class entries for each index
      (to ensure that pg_class.reltuples is current), even though analyze.c
      assumed that that must have happened during VACUUM ANALYZE.  There are
      at least a couple of reasons for this.  For example, vacuumlazy.c could
      fail to update pg_class when the index AM indicated that its statistics
      are merely an estimate, per the contract for amvacuumcleanup() routines
      established by commit e5734597 back in 2006.
      
      Stop assuming that pg_class must have been updated with accurate
      statistics within VACUUM ANALYZE -- update pg_class for indexes at the
      same time as the table relation in all cases.  That way VACUUM ANALYZE
      will never fail to keep pg_class.reltuples reasonably accurate.
      
      The only downside of this approach (compared to the old approach) is
      that it might inaccurately set pg_class.reltuples for indexes whose heap
      relation ends up with the same inaccurate value anyway.  This doesn't
      seem too bad.  We already consistently called vac_update_relstats() (to
      update pg_class) for the heap/table relation twice during any VACUUM
      ANALYZE -- once in vacuumlazy.c, and once in analyze.c.  We now make
      sure that we call vac_update_relstats() at least once (though often
      twice) for each index.
      
      This is follow up work to commit 9f3665fb, which dealt with issues in
      btvacuumcleanup().  Technically this fixes an unrelated issue, though.
      btvacuumcleanup() no longer provides an accurate num_index_tuples value
      following commit 9f3665fb (when there was no btbulkdelete() call during
      the VACUUM operation in question), but hashvacuumcleanup() has worked in
      the same way for many years now.
      
      Author: Peter Geoghegan <pg@bowt.ie>
      Reviewed-By: default avatarMasahiko Sawada <sawada.mshk@gmail.com>
      Discussion: https://postgr.es/m/CAH2-WzknxdComjhqo4SUxVFk_Q1171GJO2ZgHZ1Y6pion6u8rA@mail.gmail.com
      Backpatch: 13-, just like commit 9f3665fb.
      5f8727f5
    • Peter Geoghegan's avatar
      Don't consider newly inserted tuples in nbtree VACUUM. · 9f3665fb
      Peter Geoghegan authored
      Remove the entire idea of "stale stats" within nbtree VACUUM (stop
      caring about stats involving the number of inserted tuples).  Also
      remove the vacuum_cleanup_index_scale_factor GUC/param on the master
      branch (though just disable them on postgres 13).
      
      The vacuum_cleanup_index_scale_factor/stats interface made the nbtree AM
      partially responsible for deciding when pg_class.reltuples stats needed
      to be updated.  This seems contrary to the spirit of the index AM API,
      though -- it is not actually necessary for an index AM's bulk delete and
      cleanup callbacks to provide accurate stats when it happens to be
      inconvenient.  The core code owns that.  (Index AMs have the authority
      to perform or not perform certain kinds of deferred cleanup based on
      their own considerations, such as page deletion and recycling, but that
      has little to do with pg_class.reltuples/num_index_tuples.)
      
      This issue was fairly harmless until the introduction of the
      autovacuum_vacuum_insert_threshold feature by commit b07642db, which had
      an undesirable interaction with the vacuum_cleanup_index_scale_factor
      mechanism: it made insert-driven autovacuums perform full index scans,
      even though there is no real benefit to doing so.  This has been tied to
      a regression with an append-only insert benchmark [1].
      
      Also have remaining cases that perform a full scan of an index during a
      cleanup-only nbtree VACUUM indicate that the final tuple count is only
      an estimate.  This prevents vacuumlazy.c from setting the index's
      pg_class.reltuples in those cases (it will now only update pg_class when
      vacuumlazy.c had TIDs for nbtree to bulk delete).  This arguably fixes
      an oversight in deduplication-related bugfix commit 48e12913.
      
      [1] https://smalldatum.blogspot.com/2021/01/insert-benchmark-postgres-is-still.html
      
      Author: Peter Geoghegan <pg@bowt.ie>
      Reviewed-By: default avatarMasahiko Sawada <sawada.mshk@gmail.com>
      Discussion: https://postgr.es/m/CAD21AoA4WHthN5uU6+WScZ7+J_RcEjmcuH94qcoUPuB42ShXzg@mail.gmail.com
      Backpatch: 13-, where autovacuum_vacuum_insert_threshold was added.
      9f3665fb
  2. 10 Mar, 2021 18 commits
  3. 09 Mar, 2021 6 commits
    • Alexander Korotkov's avatar
      Fix vague comment in jsonb documentation · 6540cc51
      Alexander Korotkov authored
      The sample query fails because of an attempt to update the key of a numeric.
      But the comment says it's just because of the missing object key.  That's not
      correct because jsonb subscription automatically adds missing keys.
      
      Reported-by: Nikita Konev
      6540cc51
    • Peter Eisentraut's avatar
      libpq: Remove deprecated connection parameters authtype and tty · 14d9b376
      Peter Eisentraut authored
      The authtype parameter was deprecated and made inactive in commit
      d5bbe2ac, but the environment variable was left defined and thus
      tested with a getenv call even though the value is of no use.  Also,
      if it would exist it would be copied but never freed as the cleanup
      code had been removed.
      
      tty was deprecated in commit cb7fb3ca but most of the
      infrastructure around it remained in place.
      
      Author: Daniel Gustafsson <daniel@yesql.se>
      Discussion: https://postgr.es/m/DDDF36F3-582A-4C02-8598-9B464CC42B34@yesql.se
      14d9b376
    • Michael Paquier's avatar
      Switch back sslcompression to be a normal input field in libpq · 096bbf7c
      Michael Paquier authored
      Per buildfarm member crake, any servers including a postgres_fdw server
      with this option set would fail to do a pg_upgrade properly as the
      option got hidden in f9264d15 by becoming a debug option, making the
      restore of the FDW server fail.
      
      This changes back the option in libpq to be visible, but still inactive
      to fix this upgrade issue.
      
      Discussion: https://postgr.es/m/YEbq15JKJwIX+S6m@paquier.xyz
      096bbf7c
    • Fujii Masao's avatar
      Track total amounts of times spent writing and syncing WAL data to disk. · ff99918c
      Fujii Masao authored
      This commit adds new GUC track_wal_io_timing. When this is enabled,
      the total amounts of time XLogWrite writes and issue_xlog_fsync syncs
      WAL data to disk are counted in pg_stat_wal. This information would be
      useful to check how much WAL write and sync affect the performance.
      
      Enabling track_wal_io_timing will make the server query the operating
      system for the current time every time WAL is written or synced,
      which may cause significant overhead on some platforms. To avoid such
      additional overhead in the server with track_io_timing enabled,
      this commit introduces track_wal_io_timing as a separate parameter from
      track_io_timing.
      
      Note that WAL write and sync activity by walreceiver has not been tracked yet.
      
      This commit makes the server also track the numbers of times XLogWrite
      writes and issue_xlog_fsync syncs WAL data to disk, in pg_stat_wal,
      regardless of the setting of track_wal_io_timing. This counters can be
      used to calculate the WAL write and sync time per request, for example.
      
      Bump PGSTAT_FILE_FORMAT_ID.
      
      Bump catalog version.
      
      Author: Masahiro Ikeda
      Reviewed-By: Japin Li, Hayato Kuroda, Masahiko Sawada, David Johnston, Fujii Masao
      Discussion: https://postgr.es/m/0509ad67b585a5b86a83d445dfa75392@oss.nttdata.com
      ff99918c
    • Michael Paquier's avatar
      Add support for more progress reporting in COPY · 9d2d4570
      Michael Paquier authored
      The command (TO or FROM), its type (file, pipe, program or callback),
      and the number of tuples excluded by a WHERE clause in COPY FROM are
      added to the progress reporting already available.
      
      The column "lines_processed" is renamed to "tuples_processed" to
      disambiguate the meaning of this column in the cases of CSV and BINARY
      COPY and to be more consistent with the other catalog progress views.
      
      Bump catalog version, again.
      
      Author: Matthias van de Meent
      Reviewed-by: Michael Paquier, Justin Pryzby, Bharath Rupireddy, Josef
      Šimánek, Tomas Vondra
      Discussion: https://postgr.es/m/CAEze2WiOcgdH4aQA8NtZq-4dgvnJzp8PohdeKchPkhMY-jWZXA@mail.gmail.com
      9d2d4570
    • Michael Paquier's avatar
      Remove support for SSL compression · f9264d15
      Michael Paquier authored
      PostgreSQL disabled compression as of e3bdb2d9 and the documentation
      recommends against using it since.  Additionally, SSL compression has
      been disabled in OpenSSL since version 1.1.0, and was disabled in many
      distributions long before that.  The most recent TLS version, TLSv1.3,
      disallows compression at the protocol level.
      
      This commit removes the feature itself, removing support for the libpq
      parameter sslcompression (parameter still listed for compatibility
      reasons with existing connection strings, just ignored), and removes
      the equivalent field in pg_stat_ssl and de facto PgBackendSSLStatus.
      
      Note that, on top of removing the ability to activate compression by
      configuration, compression is actively disabled in both frontend and
      backend to avoid overrides from local configurations.
      
      A TAP test is added for deprecated SSL parameters to check after
      backwards compatibility.
      
      Bump catalog version.
      
      Author: Daniel Gustafsson
      Reviewed-by: Peter Eisentraut, Magnus Hagander, Michael Paquier
      Discussion:  https://postgr.es/m/7E384D48-11C5-441B-9EC3-F7DB1F8518F6@yesql.se
      f9264d15
  4. 08 Mar, 2021 5 commits
    • Tom Lane's avatar
      Complain if a function-in-FROM returns a set when it shouldn't. · d4545dc1
      Tom Lane authored
      Throw a "function protocol violation" error if a function in FROM
      tries to return a set though it wasn't marked proretset.  Although
      such cases work at the moment, it doesn't seem like something we
      want to guarantee will keep working.  Besides, there are other
      negative consequences of not setting the proretset flag, such as
      potentially bad plans.
      
      No back-patch, since if there is any third-party code violating
      this expectation, people wouldn't appreciate us breaking it in
      a minor release.
      
      Discussion: https://postgr.es/m/1636062.1615141782@sss.pgh.pa.us
      d4545dc1
    • Tom Lane's avatar
      Properly mark pg_stat_get_subscription() as returning a set. · fed10d4e
      Tom Lane authored
      The initial catalog data for this function failed to set proretset
      or provide a prorows estimate.  It accidentally worked anyway when
      invoked in the FROM clause, because the executor isn't too picky
      about this; but the planner didn't expect the function to return
      multiple rows, which could lead to bad plans.  Also the function
      would fail if invoked in the SELECT list.
      
      We can't easily back-patch this fix, but fortunately the bug's
      consequences aren't awful in most cases.  Getting this right is
      mainly an exercise in future-proofing.
      
      Discussion: https://postgr.es/m/1636062.1615141782@sss.pgh.pa.us
      fed10d4e
    • Tom Lane's avatar
      Validate the OID argument of pg_import_system_collations(). · 5c06abb9
      Tom Lane authored
      "SELECT pg_import_system_collations(0)" caused an assertion failure.
      With a random nonzero argument --- or indeed with zero, in non-assert
      builds --- it would happily make pg_collation entries with garbage
      values of collnamespace.  These are harmless as far as I can tell
      (unless maybe the OID happens to become used for a schema, later on?).
      In any case this isn't a security issue, since the function is
      superuser-only.  But it seems like a gotcha for unwary DBAs, so let's
      add a check that the given OID belongs to some schema.
      
      Back-patch to v10 where this function was introduced.
      5c06abb9
    • Tom Lane's avatar
      Further tweak memory management for regex DFAs. · 6c20bdb2
      Tom Lane authored
      Coverity is still unhappy after commit 190c7988, and after looking
      closer I think it might be onto something.  The callers of newdfa()
      typically drop out if v->err has been set nonzero, which newdfa()
      is faithfully doing if it fails.  However, what if v->err was already
      nonzero before we entered newdfa()?  Then newdfa() could succeed and
      the caller would promptly leak its result.
      
      I don't think this scenario can actually happen, but the predicate
      "v->err is always zero when newdfa() is called" seems difficult to be
      entirely sure of; there's a good deal of code that potentially could
      get that wrong.
      
      It seems better to adjust the callers to directly check for a null
      result instead of relying on ISERR() tests.  This is slightly cheaper
      than the previous coding anyway.
      
      Lacking evidence that there's any real bug, no back-patch.
      6c20bdb2
    • Amit Kapila's avatar
      Track replication origin progress for rollbacks. · 8a812e51
      Amit Kapila authored
      Commit 1eb6d652 allowed to track replica origin replay progress for 2PC
      but it was not complete. It misses to properly track the progress for
      rollback prepared especially it missed updating the code for recovery.
      Additionally, we need to allow tracking it on subscriber nodes where
      wal_level might not be logical.
      
      It is required to track decoding of 2PC which is committed in PG14
      (a271a1b5) and also nobody complained about this till now so not
      backpatching it.
      
      Author: Amit Kapila
      Reviewed-by: Michael Paquier and Ajin Cherian
      Discussion: https://postgr.es/m/CAA4eK1L-kHmMnSdrRW6UhRbCjR7cgh04c+6psY15qzT6ktcd+g@mail.gmail.com
      8a812e51
  5. 06 Mar, 2021 1 commit