1. 18 Oct, 2021 1 commit
    • Michael Paquier's avatar
      Reset properly snapshot export state during transaction abort · 5b353aaf
      Michael Paquier authored
      During a replication slot creation, an ERROR generated in the same
      transaction as the one creating a to-be-exported snapshot would have
      left the backend in an inconsistent state, as the associated static
      export snapshot state was not being reset on transaction abort, but only
      on the follow-up command received by the WAL sender that created this
      snapshot on replication slot creation.  This would trigger inconsistency
      failures if this session tried to export again a snapshot, like during
      the creation of a replication slot.
      
      Note that a snapshot export cannot happen in a transaction block, so
      there is no need to worry resetting this state for subtransaction
      aborts.  Also, this inconsistent state would very unlikely show up to
      users.  For example, one case where this could happen is an
      out-of-memory error when building the initial snapshot to-be-exported.
      Dilip found this problem while poking at a different patch, that caused
      an error in this code path for reasons unrelated to HEAD.
      
      Author: Dilip Kumar
      Reviewed-by: Michael Paquier, Zhihong Yu
      Discussion: https://postgr.es/m/CAFiTN-s0zA1Kj0ozGHwkYkHwa5U0zUE94RSc_g81WrpcETB5=w@mail.gmail.com
      Backpatch-through: 9.6
      5b353aaf
  2. 16 Oct, 2021 2 commits
    • Tom Lane's avatar
      Avoid core dump in pg_dump when dumping from pre-8.3 server. · 3e4c8db9
      Tom Lane authored
      Commit f0e21f2f6 missed adding a tgisinternal output column
      to getTriggers' query for pre-8.3 servers.  Back-patch to v11,
      like that commit.
      3e4c8db9
    • Tom Lane's avatar
      Make pg_dump acquire lock on partitioned tables that are to be dumped. · b5152e3b
      Tom Lane authored
      It was clearly the intent to do so all along, but the original coding
      fat-fingered this by checking the wrong array element.  We fixed it
      in passing in 403a3d91, but that later got reverted, and we forgot
      to keep this bug fix.
      
      Most of the time this'd be relatively harmless, since once we lock
      any of the partitioned table's leaf partitions, that would suffice
      to prevent major DDL on the partitioned table itself.  However, a
      childless partitioned table would get dumped with no relevant lock
      whatsoever, possibly allowing dump failure or inconsistent output.
      
      Unlike 403a3d91, there are no versioning concerns, since every server
      version that has partitioned tables will allow you to lock one.
      
      Back-patch to v10 where partitioned tables were introduced.
      
      Discussion: https://postgr.es/m/1018205.1634346327@sss.pgh.pa.us
      b5152e3b
  3. 15 Oct, 2021 1 commit
  4. 14 Oct, 2021 3 commits
  5. 13 Oct, 2021 5 commits
    • Alvaro Herrera's avatar
      Change recently added test code for stability · 79c7fe1a
      Alvaro Herrera authored
      The test code added with ff9f111bce24 fails under valgrind, and probably
      other slow cases too, because if (say) autovacuum runs in between and
      produces WAL of its own, the large INSERT fails to account for that in
      the LSN calculations.  Rewrite to use a DO loop.
      
      Per complaint from Andres Freund
      
      Backpatch to all branches.
      
      Discussion: https://postgr.es/m/20211013180338.5guyqzpkcisqugrl@alap3.anarazel.de
      79c7fe1a
    • Peter Geoghegan's avatar
      pg_amcheck: avoid unhelpful verification attempts. · dd58194c
      Peter Geoghegan authored
      Avoid calling contrib/amcheck functions with relations that are
      unsuitable for checking.  Specifically, don't attempt verification of
      temporary relations, or indexes whose pg_index entry indicates that the
      index is invalid, or not ready.
      
      These relations are not supported by any of the contrib/amcheck
      functions, for reasons that are pretty fundamental.  For example, the
      implementation of REINDEX CONCURRENTLY can add its own "transient"
      pg_index entries, which has rather unclear implications for the B-Tree
      verification functions, at least in the general case -- so they just
      treat it as an error.  It falls to the amcheck caller (in this case
      pg_amcheck) to deal with the situation at a higher level.
      
      pg_amcheck now simply treats these conditions as additional "visibility
      concerns" when it queries system catalogs.  This is a little arbitrary.
      It seems to have the least problems among any of the available
      alternatives.
      
      Author: Mark Dilger <mark.dilger@enterprisedb.com>
      Reported-By: default avatarAlexander Lakhin <exclusion@gmail.com>
      Reviewed-By: default avatarPeter Geoghegan <pg@bowt.ie>
      Reviewed-By: default avatarRobert Haas <robertmhaas@gmail.com>
      Bug: #17212
      Discussion: https://postgr.es/m/17212-34dd4a1d6bba98bf@postgresql.org
      Backpatch: 14-, where pg_amcheck was introduced.
      dd58194c
    • Etsuro Fujita's avatar
      postgres_fdw: Move comments about elog level in (sub)abort cleanup. · 419d27b1
      Etsuro Fujita authored
      The comments were misplaced when adding postgres_fdw.  Fix that by
      moving the comments to more appropriate functions.
      
      Author: Etsuro Fujita
      Backpatch-through: 9.6
      Discussion: https://postgr.es/m/CAPmGK164sAXQtC46mDFyu6d-T25Mzvh5qaRNkit06VMmecYnOA%40mail.gmail.com
      419d27b1
    • Michael Paquier's avatar
      Fix use-after-free with multirange types in CREATE TYPE · 922e15c4
      Michael Paquier authored
      The code was freeing the name of the multirange type function stored in
      the parse tree but it should not do that.  Event triggers could for
      example look at such a corrupted parsed tree with a ddl_command_end
      event.
      
      Author: Alex Kozhemyakin, Sergey Shinderuk
      Reviewed-by: Peter Eisentraut, Michael Paquier
      Discussion: https://postgr.es/m/d5042d46-b9cd-6efb-219a-71ed0cf45bc8@postgrespro.ru
      Backpatch-through: 14
      922e15c4
    • Michael Paquier's avatar
      Fix tests of pg_upgrade across different major versions · f4e1c889
      Michael Paquier authored
      This fixes a set of issues that cause different breakages or annoyances
      when using pg_upgrade's test.sh to do upgrades across different major
      versions:
      - test.sh is completely broken when using v14 as new version because of
      the removal of testtablespace/ as Makefile rule.  Older versions of
      pg_regress don't support --make-tablespacedir, blocking the creation of
      the tablespace.  In order to fix that, it is simple enough to create
      those directories in the script itself, but only do that when an old
      version is involved.  This fix is needed on HEAD and REL_14_STABLE.
      - The script would fail when using PG <= v11 as old version because of
      WITH OIDS relations not supported in v12.  In order to fix this, this
      steals a method from the buildfarm that uses a DO block to change all
      the relations marked as WITH OIDS, allowing pg_upgrade to pass.  This is
      more portable than using ALTER TABLE queries on the relations causing
      issues.  This is fixed down to v12, and authored originally by Andrew
      Dunstan.
      - Not using --extra-float-digits=0 with v11 as old version causes
      a lot of diffs in the dumps, making the whole unreadable.  This gets
      only done when using v11 as old version.  This is fixed down to v12.
      The buildfarm code uses that already.
      
      Note that the addition of --wal-segsize and --allow-group-access breaks
      the script when using v10 or older at initdb time as these got added in
      11.  10 would be EOL'd next year and nobody has complained about those
      problems yet, so nothing is done about that.  This means that this
      commit fixes upgrade tests using test.sh with v11 as minimum older
      version, up to HEAD, and that it is enough to apply this change down to
      12.  The old and new dumps still generate diffs, still require manual
      checks, and more could be done to reduce the noise, but this allows the
      tests to run with a rather minimal amount of them.
      
      I have tested this commit and test.sh with v11 as minimum across all the
      branches where this is applied.  Note that this commit has no impact on
      the normal pg_upgrade test run with a simple "make check".
      
      Author:  Justin Pryzby, Andrew Dunstan, Michael Paquier
      Discussion: https://postgr.es/m/20201206180248.GI24052@telsasoft.com
      Backpatch-through: 12
      f4e1c889
  6. 12 Oct, 2021 4 commits
    • Peter Geoghegan's avatar
      Doc: normalize vacuum_multixact_failsafe_age ID. · 070c402b
      Peter Geoghegan authored
      Author: Pavel Luzanov <p.luzanov@postgrespro.ru>
      Discussion: https://postgr.es/m/c71a3cfc-a267-3d9f-1b44-fbd668d0ab10@postgrespro.ru
      Backpatch: 14-, where the failsafe was introduced.
      070c402b
    • Michael Paquier's avatar
      Add more $Test::Builder::Level in the TAP tests · d834ebcf
      Michael Paquier authored
      Incrementing the level of the call stack reported is useful for
      debugging purposes as it allows to control which part of the test is
      exactly failing, especially if a test is structured with subroutines
      that call routines from Test::More.
      
      This adds more incrementations of $Test::Builder::Level where debugging
      gets improved (for example it does not make sense for some paths like
      pg_rewind where long subroutines are used).
      
      A note is added to src/test/perl/README about that, based on a
      suggestion from Andrew Dunstan and a wording coming from both of us.
      
      Usage of Test::Builder::Level has spread in 12, so a backpatch down to
      this version is done.
      
      Reviewed-by: Andrew Dunstan, Peter Eisentraut, Daniel Gustafsson
      Discussion: https://postgr.es/m/YV1CCFwgM1RV1LeS@paquier.xyz
      Backpatch-through: 12
      d834ebcf
    • Fujii Masao's avatar
      Make autovacuum launcher more responsive to pg_log_backend_memory_contexts(). · 62e821ad
      Fujii Masao authored
      Previously when pg_log_backend_memory_contexts() sent the request to
      the autovacuum launcher, it could take more than several seconds to
      log its memory contexts. Because the function (HandleAutoVacLauncherInterrupts)
      to process any new interrupts that autovacuum launcher received
      didn't handle the request for logging of memory contexts. This commit changes
      the function so that it handles the request, to make autovacuum launcher
      more responsitve to pg_log_backend_memory_contexts().
      
      Back-patch to v14 where pg_log_backend_memory_contexts() was added.
      
      Author: Koyu Tanigawa
      Reviewed-by: Bharath Rupireddy, Atsushi Torikoshi
      Discussion: https://postgr.es/m/0aae3e074face409b35153451be5cc11@oss.nttdata.com
      62e821ad
    • Peter Geoghegan's avatar
      amcheck: Skip unlogged relations in Hot Standby. · e7712155
      Peter Geoghegan authored
      Have verify_heapam.c treat unlogged relations as if they were simply
      empty when in Hot Standby mode.  This brings it in line with
      verify_nbtree.c, which has handled unlogged relations in the same way
      since bugfix commit 6754fe65.  This was an oversight in commit
      866e24d4, which extended contrib/amcheck to check heap relations.
      
      In passing, lower the verbosity used when reporting that a relation has
      been skipped like this, from NOTICE to DEBUG1.  This is appropriate
      because the skipping behavior is only an implementation detail, needed
      to work around the fact that unlogged tables don't have smgr-level
      storage for their main fork when in Hot Standby mode.
      
      Affected unlogged relations should be considered "trivially verified",
      not skipped over.  They are verified in the same sense that a totally
      empty relation can be verified.  This behavior seems least surprising
      overall, since unlogged relations on a replica will initially be empty
      if and when the replica is promoted and Hot Standby ends.
      
      Author: Mark Dilger <mark.dilger@enterprisedb.com>
      Reviewed-By: default avatarPeter Geoghegan <pg@bowt.ie>
      Discussion: https://postgr.es/m/CAH2-Wzk_pukOFY7JmdiFLsrz+Pd3V8OwgC1TH2Vd5BH5ZgK4bA@mail.gmail.com
      Backpatch: 14-, where heapam verification was introduced.
      e7712155
  7. 11 Oct, 2021 1 commit
    • Tom Lane's avatar
      Fix EXPLAIN of SEARCH BREADTH FIRST queries some more. · 2c25db32
      Tom Lane authored
      Commit 3f50b8263 had an oversight: formerly, to deparse expressions
      attached to a plan node, it was only necessary to update the
      deparse_namespace ancestors list alongside calling set_deparse_plan.
      Now it's necessary to update the ancestors list *first*, because
      set_deparse_plan consults it, and one call site got that wrong.
      
      This error was masked in most cases because explain.c uses just one
      List object for the ancestors list, updating it in-place as the plan
      is scanned, so that we accidentally had the right List assigned to
      dpns->ancestors before it was needed.  It would fail only if a
      WorkTableScan node were the first one that we tried to deparse a
      subexpression of.
      
      Per report from Markus Winand.  Like the previous patch,
      back-patch to v14.
      
      Discussion: https://postgr.es/m/648B0505-AA57-42C2-A2DA-E551DE46FA15@winand.at
      2c25db32
  8. 07 Oct, 2021 2 commits
  9. 06 Oct, 2021 3 commits
  10. 05 Oct, 2021 1 commit
  11. 04 Oct, 2021 5 commits
    • Bruce Momjian's avatar
      doc: remove URL for ICU explorer/locexp · 1f00a290
      Bruce Momjian authored
      The old URL was HTTP 404 and the git link didn't build.  Also update two
      other ICU links.  If we ever get a good link we will add it back.
      
      Reported-by: Anton Voloshin
      
      Author: Laurenz Albe
      
      Backpatch-through: 10
      1f00a290
    • Andres Freund's avatar
      Fix TestLib::slurp_file() with offset on windows. · c4465cd0
      Andres Freund authored
      3c5b0685 used setFilePointer() to set the position of the filehandle, but
      passed the wrong filehandle, always leaving the position at 0. Instead of just
      fixing that, remove use of setFilePointer(), we have a perl fd at this point,
      so we can just use perl's seek().
      
      Additionally, the perl filehandle wasn't closed, just the windows filehandle.
      Reviewed-By: default avatarAndrew Dunstan <andrew@dunslane.net>
      Author: Andres Freund <andres@anarazel.de>
      Discussion: https://postgr.es/m/20211003173038.64mmhgxctfqn7wl6@alap3.anarazel.de
      Backpatch: 9.6-, like 3c5b0685
      c4465cd0
    • Tom Lane's avatar
      Update our mapping of Windows time zone names some more. · 919c08d9
      Tom Lane authored
      Per discussion, let's just follow CLDR's default zone mappings
      faithfully.  There are two changes here that are clear improvements:
      
      * Mapping "Greenwich Standard Time" to Atlantic/Reykjavik is actually
      a better fit than using London, because Iceland hasn't observed DST
      since 1968, so this is more nearly what people might expect.
      
      * Since the "Samoa" zone is specified to be UTC+13:00, we must map
      it to Pacific/Apia not Pacific/Samoa; the latter refers to American
      Samoa which is now on the other side of the date line.
      
      The rest of these changes look like they're choosing the most populous
      IANA zone as representative.  Whatever the details, we're just going
      to say "if you don't like this mapping, complain to CLDR".
      
      Discussion: https://postgr.es/m/3266414.1633045628@sss.pgh.pa.us
      919c08d9
    • Tom Lane's avatar
      Doc: fix minor issues in GiST support function documentation. · 5f461800
      Tom Lane authored
      gist.sgml and xindex.sgml hadn't been fully updated for the
      addition of a sortsupport support function (commit 16fa9b2b).
      xindex.sgml also missed that the compress and decompress support
      functions are optional, an apparently far older oversight.
      
      In passing, fix gratuitous inconsistencies in wording and
      capitalization.
      
      Noted by E. Rogov.  Back-patch to v14; the residual issues
      before that aren't significant enough to bother with.
      
      Discussion: https://postgr.es/m/163335322905.12519.5711557029494638051@wrigleys.postgresql.org
      5f461800
    • Michael Paquier's avatar
      Fix snapshot builds during promotion of hot standby node with 2PC · 828f7f00
      Michael Paquier authored
      Some specific logic is done at the end of recovery when involving 2PC
      transactions:
      1) Call RecoverPreparedTransactions(), to recover the state of 2PC
      transactions into memory (re-acquire locks, etc.).
      2) ShutdownRecoveryTransactionEnvironment(), to move back to normal
      operations, mainly cleaning up recovery locks and KnownAssignedXids
      (including any 2PC transaction tracked previously).
      3) Switch XLogCtl->SharedRecoveryState to RECOVERY_STATE_DONE, which is
      the tipping point for any process calling RecoveryInProgress() to check
      if the cluster is still in recovery or not.
      
      Any snapshot taken between steps 2) and 3) would be empty, causing any
      transaction relying on a snapshot at this point to potentially corrupt
      data as there could still be some 2PC transactions to track, with
      RecentXmin moving backwards on successive calls to GetSnapshotData() in
      the same transaction.
      
      As SharedRecoveryState is the point to take into account to know if it
      is safe to discard KnownAssignedXids, this commit moves step 2) after
      step 3), so as we can never finish with empty snapshots.
      
      This exists since the introduction of hot standby, so backpatch all the
      way down.  The window with incorrect snapshots is extremely small, but I
      have seen it when running 023_pitr_prepared_xact.pl, as did buildfarm
      member fairywren.  Thomas Munro also found it independently.  Special
      thanks to Andres Freund for taking the time to analyze this issue.
      
      Reported-by: Thomas Munro, Michael Paquier
      Analyzed-by: Andres Freund
      Discussion: https://postgr.es/m/20210422203603.fdnh3fu2mmfp2iov@alap3.anarazel.de
      Backpatch-through: 9.6
      828f7f00
  12. 03 Oct, 2021 1 commit
    • Tom Lane's avatar
      Fix checking of query type in plpgsql's RETURN QUERY command. · e0eba586
      Tom Lane authored
      Prior to v14, we insisted that the query in RETURN QUERY be of a type
      that returns tuples.  (For instance, INSERT RETURNING was allowed,
      but not plain INSERT.)  That happened indirectly because we opened a
      cursor for the query, so spi.c checked SPI_is_cursor_plan().  As a
      consequence, the error message wasn't terribly on-point, but at least
      it was there.
      
      Commit 2f48ede0 lost this detail.  Instead, plain RETURN QUERY
      insisted that the query be a SELECT (by checking for SPI_OK_SELECT)
      while RETURN QUERY EXECUTE failed to check the query type at all.
      Neither of these changes was intended.
      
      The only convenient place to check this in the EXECUTE case is inside
      _SPI_execute_plan, because we haven't done parse analysis until then.
      So we need to pass down a flag saying whether to enforce that the
      query returns tuples.  Fortunately, we can squeeze another boolean
      into struct SPIExecuteOptions without an ABI break, since there's
      padding space there.  (It's unlikely that any extensions would
      already be using this new struct, but preserving ABI in v14 seems
      like a smart idea anyway.)
      
      Within spi.c, it seemed like _SPI_execute_plan's parameter list
      was already ridiculously long, and I didn't want to make it longer.
      So I thought of passing SPIExecuteOptions down as-is, allowing that
      parameter list to become much shorter.  This makes the patch a bit
      more invasive than it might otherwise be, but it's all internal to
      spi.c, so that seems fine.
      
      Per report from Marc Bachmann.  Back-patch to v14 where the
      faulty code came in.
      
      Discussion: https://postgr.es/m/1F2F75F0-27DF-406F-848D-8B50C7EEF06A@gmail.com
      e0eba586
  13. 02 Oct, 2021 2 commits
    • Tom Lane's avatar
      Update our mapping of Windows time zone names using CLDR info. · fa8db487
      Tom Lane authored
      This corrects a bunch of entries in win32_tzmap[], and adds a few
      new ones, based on the CLDR project's windowsZones.xml file.
      Non-cosmetic changes fall into four main categories:
      
      * Flat-out errors:
      
      US/Aleutan doesn't exist
      America/Salvador doesn't exist
      Asia/Baku is wrong for Yerevan
      Asia/Dhaka (Bangladesh) is wrong for Astana (Kazakhstan)
      Europe/Bucharest is wrong for Chisinau
      America/Mexico_City is wrong for Chetumal
      America/Buenos_Aires is wrong for Cayenne
      America/Caracas has its own zone, so poor fit for La Paz
      US/Eastern is wrong for Haiti
      US/Eastern is wrong for Indiana (East)
      Asia/Karachi is wrong for Tashkent
      Etc/UTC+12 doesn't exist
      Signs of Etc/GMT zones were backwards
      
      * Judgment calls:
      
      (These changes follow CLDR's choices, except for the first one)
      
      Use Europe/London for "Greenwich Standard Time", since that seems much
      more likely than Africa/Casablanca to be what people will think that
      zone name means.  CLDR has Atlantic/Reykjavik here, but that's no better.
      
      Asia/Shanghai seems a better fit than Hong Kong for "China Standard
      Time".
      
      Europe/Sarajevo is now a link to Belgrade, ie "Central Europe Standard
      Time"; so use Warsaw for "Central European Standard Time".
      
      America/Sao_Paulo seems more representative than Araguaina for
      "E. South America Standard Time".
      
      Africa/Johannesburg seems more representative than Harare for
      "South Africa Standard Time".
      
      * New Windows zone names:
      
      "Israel Standard Time"
      "Kaliningrad Standard Time"
      "Russia Time Zone N" for various N
      "Singapore Standard Time"
      "South Sudan Standard Time"
      "W. Central Africa Standard Time"
      "West Bank Standard Time"
      "Yukon Standard Time"
      
      Some of these replace older spellings, but I kept the older spellings
      too in case our code runs on a machine with the older data.
      
      * Replace aliases (tzdb Links) with underlying city-named zones:
      
      (This tracks tzdb's longstanding practice, and reduces inconsistency
      with the rest of the entries, as well as with CLDR.)
      
      US/Alaska
      Asia/Kuwait
      Asia/Muscat
      Canada/Atlantic
      Australia/Canberra
      Canada/Saskatchewan
      US/Central
      US/Eastern
      US/Hawaii
      US/Mountain
      Canada/Newfoundland
      US/Pacific
      
      Back-patch to all supported branches, as is our usual practice for
      time zone data updates.
      
      Discussion: https://postgr.es/m/3266414.1633045628@sss.pgh.pa.us
      fa8db487
    • Tom Lane's avatar
      Re-alphabetize the win32_tzmap[] array. · 81464999
      Tom Lane authored
      The original intent seems to have been to sort case-insensitively
      by the Windows zone name, but various changes over the years did
      not get that memo.  This commit just moves a few entries to
      restore exact alphabetic order, to ease comparison to the outputs
      of processing scripts.
      
      Back-patch to all supported branches, as is our usual practice for
      time zone data updates.
      
      Discussion: https://postgr.es/m/3266414.1633045628@sss.pgh.pa.us
      81464999
  14. 01 Oct, 2021 7 commits
  15. 30 Sep, 2021 2 commits
    • Tom Lane's avatar
      Remove gratuitous environment dependency in 002_types.pl test. · afc6081f
      Tom Lane authored
      Computing related timestamps by subtracting "N days" is sensitive
      to the prevailing timezone, since we interpret that as "same local
      time on the N'th prior day".  Even though the intervals in question
      are only two to four days, through remarkable bad luck they managed
      to cross the end of Ramadan in 2014, causing the test's output to
      change if timezone is set to Africa/Casablanca.  (Maybe in other
      Muslim areas as well; I didn't check.)  There's absolutely no reason
      for this test to exercise interval subtraction, so just get rid of
      that and use plain timestamptz constants representing the intended
      values.
      
      Per report from Andres Freund.  Back-patch to v10 where this test
      script came in.
      
      Discussion: https://postgr.es/m/20210930183641.7lh4jhvpipvromca@alap3.anarazel.de
      afc6081f
    • Alvaro Herrera's avatar
      Repair two portability oversights of new test · e3731bac
      Alvaro Herrera authored
      First, as pointed out by Tom Lane and Michael Paquier, I failed to
      realize that Windows' PostgresNode needs an extra pg_hba.conf line
      (added by PostgresNode->set_replication_conf, called internally by
      ->init() when 'allows_streaming=>1' is given -- but I purposefully
      omitted that).  I think a good fix should be to have nodes with only
      'has_archiving=>1' set up for replication too, but that's a bigger
      discussion.  Fix it by calling ->set_replication_conf, which is not
      unprecedented, as pointed out by Andrew Dunstan.
      
      I also forgot to uncomment a ->finish() call for a pumpable IPC::Run
      file descriptor.  Apparently this is innocuous in almost all platforms.
      
      Backpatch to 14.  The older branches were added this file too, but not
      this particular part of the test.
      
      Discussion: https://postgr.es/m/3000074.1632947632@sss.pgh.pa.us
      Discussion: https://postgr.es/m/YVT7qwhR8JmC2kfz@paquier.xyz
      e3731bac