1. 07 Apr, 2019 1 commit
  2. 06 Apr, 2019 3 commits
  3. 05 Apr, 2019 15 commits
    • Tom Lane's avatar
      Use Append rather than MergeAppend for scanning ordered partitions. · 959d00e9
      Tom Lane authored
      If we need ordered output from a scan of a partitioned table, but
      the ordering matches the partition ordering, then we don't need to
      use a MergeAppend to combine the pre-ordered per-partition scan
      results: a plain Append will produce the same results.  This
      both saves useless comparison work inside the MergeAppend proper,
      and allows us to start returning tuples after istarting up just
      the first child node not all of them.
      
      However, all is not peaches and cream, because if some of the
      child nodes have high startup costs then there will be big
      discontinuities in the tuples-returned-versus-elapsed-time curve.
      The planner's cost model cannot handle that (yet, anyway).
      If we model the Append's startup cost as being just the first
      child's startup cost, we may drastically underestimate the cost
      of fetching slightly more tuples than are available from the first
      child.  Since we've had bad experiences with over-optimistic choices
      of "fast start" plans for ORDER BY LIMIT queries, that seems scary.
      As a klugy workaround, set the startup cost estimate for an ordered
      Append to be the sum of its children's startup costs (as MergeAppend
      would).  This doesn't really describe reality, but it's less likely
      to cause a bad plan choice than an underestimated startup cost would.
      In practice, the cases where we really care about this optimization
      will have child plans that are IndexScans with zero startup cost,
      so that the overly conservative estimate is still just zero.
      
      David Rowley, reviewed by Julien Rouhaud and Antonin Houska
      
      Discussion: https://postgr.es/m/CAKJS1f-hAqhPLRk_RaSFTgYxd=Tz5hA7kQ2h4-DhJufQk8TGuw@mail.gmail.com
      959d00e9
    • Alvaro Herrera's avatar
      Add facility to copy replication slots · 9f06d79e
      Alvaro Herrera authored
      This allows the user to create duplicates of existing replication slots,
      either logical or physical, and even changing properties such as whether
      they are temporary or the output plugin used.
      
      There are multiple uses for this, such as initializing multiple replicas
      using the slot for one base backup; when doing investigation of logical
      replication issues; and to select a different output plugins.
      
      Author: Masahiko Sawada
      Reviewed-by: Michael Paquier, Andres Freund, Petr Jelinek
      Discussion: https://postgr.es/m/CAD21AoAm7XX8y_tOPP6j4Nzzch12FvA1wPqiO690RCk+uYVstg@mail.gmail.com
      9f06d79e
    • Thomas Munro's avatar
      Wake up interested backends when a checkpoint fails. · de2b3841
      Thomas Munro authored
      Commit c6c9474a switched to condition variables instead of sleep
      loops to notify backends of checkpoint start and stop, but forgot
      to broadcast in case of checkpoint failure.
      
      Author: Thomas Munro
      Discussion: https://postgr.es/m/CA%2BhUKGJKbCd%2B_K%2BSEBsbHxVT60SG0ivWHHAdvL0bLTUt2xpA2w%40mail.gmail.com
      de2b3841
    • Robert Haas's avatar
      Fix missing word. · 6665305e
      Robert Haas authored
      Nathan Bossart
      
      Discussion: http://postgr.es/m/2C63765B-AD31-4F6C-8DA7-C8544634C714@amazon.com
      6665305e
    • Tom Lane's avatar
      Ensure consistent name matching behavior in processSQLNamePattern(). · 478cacb5
      Tom Lane authored
      Prior to v12, if you used a collation-sensitive regex feature in a
      pattern handled by processSQLNamePattern() (for instance, \d '\\w+'
      in psql), the behavior you got matched the database's default collation.
      Since commit 586b98fd you'd usually get C-collation behavior, because
      the catalog "name"-type columns are now marked as COLLATE "C".  Add
      explicit COLLATE specifications to restore the prior behavior.
      
      (Note for whoever writes the v12 release notes: the need for this shows
      that while 586b98fd preserved pre-v12 behavior of "name" columns for
      simple comparison operators, it changed the behavior of regex operators
      on those columns.  Although this patch fixes it for pattern matches
      generated by our own tools, user-written queries will still be affected.
      So we'd better mention this issue as a compatibility item.)
      
      Daniel Vérité
      
      Discussion: https://postgr.es/m/701e51f0-0ec0-4e70-a365-1958d66dd8d2@manitou-mail.org
      478cacb5
    • Andres Freund's avatar
      table: docs: fix typos and grammar. · 86cc06d1
      Andres Freund authored
      Author: Justin Pryzby
      Discussion: https://postgr.es/m/20190404055138.GA24864@telsasoft.com
      86cc06d1
    • Etsuro Fujita's avatar
      Doc: Update documentation on partitioning vs. foreign tables. · 3e6b0c47
      Etsuro Fujita authored
      The limitations that it is not allowed to create/attach a foreign table
      as a partition of an indexed partitioned table were not documented.
      
      Reported-By: Stepan Yankevych
      Author: Etsuro Fujita
      Reviewed-By: Amit Langote
      Backpatch-through: 11 where partitioned index was introduced
      Discussion: https://postgr.es/m/1553869152.858391073.5f8m3n0x@frv53.fwdcdn.com
      3e6b0c47
    • Peter Eisentraut's avatar
      Fix compiler warning · edda32ee
      Peter Eisentraut authored
      Rewrite get_attgenerated() to avoid compiler warning if the compiler
      does not recognize that elog(ERROR) does not return.
      Reported-by: default avatarDavid Rowley <david.rowley@2ndquadrant.com>
      edda32ee
    • Noah Misch's avatar
      Revert "Consistently test for in-use shared memory." · 82150a05
      Noah Misch authored
      This reverts commits 2f932f71,
      16ee6eaf and
      6f0e1900.  The buildfarm has revealed
      several bugs.  Back-patch like the original commits.
      
      Discussion: https://postgr.es/m/20190404145319.GA1720877@rfd.leadboat.com
      82150a05
    • Thomas Munro's avatar
      Fix bugs in mdsyncfiletag(). · 794c543b
      Thomas Munro authored
      Commit 3eb77eba moved a _mdfd_getseg() call from mdsync() into a new
      callback function mdsyncfiletag(), but didn't get the arguments quite
      right.  Without the EXTENSION_DONT_CHECK_SIZE flag we fail to open a
      segment if lower-numbered segments have been truncated, and it wants
      a block number rather than a segment number.
      
      While comparing with the older coding, also remove an unnecessary
      clobbering of errno, and adjust the code in mdunlinkfiletag() to
      ressemble the original code from mdpostckpt() more closely instead
      of using an unnecessary call to smgropen().
      
      Author: Thomas Munro
      Discussion: https://postgr.es/m/CA%2BhUKGL%2BYLUOA0eYiBXBfwW%2BbH5kFgh94%3DgQH0jHEJ-t5Y91wQ%40mail.gmail.com
      794c543b
    • Stephen Frost's avatar
      Handle errors during GSSAPI startup better · c46c85d4
      Stephen Frost authored
      There was some confusion over the format of the error message returned
      from the server during GSSAPI startup; specifically, it was expected
      that a length would be returned when, in reality, at this early stage in
      the startup sequence, no length is returned from the server as part of
      an error message.
      
      Correct the client-side code for dealing with error messages sent by the
      server during startup by simply reading what's available into our
      buffer, after we've discovered it's an error message, and then reporting
      back what was returned.
      
      In passing, also add in documentation of the environment variable
      PGGSSENCMODE which was missed previously, and adjust the code to look
      for the PGGSSENCMODE variable (the environment variable change was
      missed in the prior GSSMODE -> GSSENCMODE commit).
      
      Error-handling issue discovered by Peter Eisentraut, the rest were items
      discovered during testing of the error handling.
      c46c85d4
    • Michael Paquier's avatar
      Fix some documentation in pg_rewind · bfc80683
      Michael Paquier authored
      Since 11, it is possible to use a non-superuser role when using an
      online source cluster with pg_rewind as long as the role has proper
      permissions to execute on the source all the functions used by
      pg_rewind, and the documentation stated that a superuser is necessary.
      Let's add at the same time all the details needed to create such a
      role.
      
      A second confusion which comes a lot from users is that it is necessary
      to issue a checkpoint on a freshly-promoted standby so as its control
      file has up-to-date timeline information which is used by pg_rewind to
      validate the operation.  Let's document that properly.  This is
      back-patched down to 9.5 where pg_rewind has been introduced.
      
      Author: Michael Paquier
      Reviewed-by: Magnus Hagander
      Discussion: https://postgr.es/m/CABUevEz5bpvbwVsYCaSMV80CBZ5-82nkMzbb+Bu=h1m=rLdn=g@mail.gmail.com
      Backpatch-through: 9.5
      bfc80683
    • Andres Freund's avatar
    • Andres Freund's avatar
      Harden tableam against nonexistant / wrong kind of AMs. · ea97e440
      Andres Freund authored
      Previously it was allowed to set default_table_access_method to an
      empty string. That makes sense for default_tablespace, where that was
      copied from, as it signals falling back to the database's default
      tablespace. As there is no equivalent for table AMs, forbid that.
      
      Also make sure to throw a usable error when creating a table using an
      index AM, by using get_am_type_oid() to implement get_table_am_oid()
      instead of a separate copy. Previously we'd error out only later, in
      GetTableAmRoutine().
      
      Thirdly remove GetTableAmRoutineByAmId() - it was only used in an
      earlier version of 8586bf7e.
      
      Add tests for the above (some for index AMs as well).
      ea97e440
    • Peter Geoghegan's avatar
      Add test coverage for rootdescend verification. · 344b7e11
      Peter Geoghegan authored
      Commit c1afd175, which added support for rootdescend verification to
      amcheck, added only minimal regression test coverage.  Address this by
      making sure that rootdescend verification is run on a multi-level index.
      In passing, simplify some of the regression tests that exercise
      multi-level nbtree page deletion.
      
      Both issues spotted while rereviewing coverage of the nbtree patch
      series using gcov.
      344b7e11
  4. 04 Apr, 2019 16 commits
    • Andres Freund's avatar
      tableam: Add table_multi_insert() and revamp/speed-up COPY FROM buffering. · 86b85044
      Andres Freund authored
      This adds table_multi_insert(), and converts COPY FROM, the only user
      of heap_multi_insert, to it.
      
      A simple conversion of COPY FROM use slots would have yielded a
      slowdown when inserting into a partitioned table for some
      workloads. Different partitions might need different slots (both slot
      types and their descriptors), and dropping / creating slots when
      there's constant partition changes is measurable.
      
      Thus instead revamp the COPY FROM buffering for partitioned tables to
      allow to buffer inserts into multiple tables, flushing only when
      limits are reached across all partition buffers. By only dropping
      slots when there've been inserts into too many different partitions,
      the aforementioned overhead is gone. By allowing larger batches, even
      when there are frequent partition changes, we actuall speed such cases
      up significantly.
      
      By using slots COPY of very narrow rows into unlogged / temporary
      might slow down very slightly (due to the indirect function calls).
      
      Author: David Rowley, Andres Freund, Haribabu Kommi
      Discussion:
          https://postgr.es/m/20180703070645.wchpu5muyto5n647@alap3.anarazel.de
          https://postgr.es/m/20190327054923.t3epfuewxfqdt22e@alap3.anarazel.de
      86b85044
    • Tom Lane's avatar
      Add a "SQLSTATE-only" error verbosity option to libpq and psql. · 7bac3aca
      Tom Lane authored
      This is intended for use mostly in test scripts for external tools,
      which could do without cross-PG-version variations in error message
      wording.  Of course, the SQLSTATE isn't guaranteed stable either, but
      it should be more so than the error message text.
      
      Note: there's a bit of an ABI change for libpq here, but it seems
      OK because if somebody compiles against a newer version of libpq-fe.h,
      and then tries to pass PQERRORS_SQLSTATE to PQsetErrorVerbosity()
      of an older libpq library, it will be accepted and then act like
      PQERRORS_DEFAULT, thanks to the way the tests in pqBuildErrorMessage3
      have historically been phrased.  That seems acceptable.
      
      Didier Gautheron, reviewed by Dagfinn Ilmari Mannsåker
      
      Discussion: https://postgr.es/m/CAJRYxuKyj4zA+JGVrtx8OWAuBfE-_wN4sUMK4H49EuPed=mOBw@mail.gmail.com
      7bac3aca
    • Alvaro Herrera's avatar
      pg_restore: Require "-f -" to mean stdout · 413ccaa7
      Alvaro Herrera authored
      The previous convention that stdout was selected by default when nothing
      is specified was just too error-prone.
      
      After a suggestion from Andrew Gierth.
      Author: Euler Taveira
      Reviewed-by: Yoshikazu Imai, José Arthur Benetasso Villanova
      Discussion: https://postgr.es/m/87sgwrmhdv.fsf@news-spur.riddles.org.uk
      413ccaa7
    • Tom Lane's avatar
      Make queries' locking of indexes more consistent. · 9c703c16
      Tom Lane authored
      The assertions added by commit b04aeb0a exposed that there are some
      code paths wherein the executor will try to open an index without
      holding any lock on it.  We do have some lock on the index's table,
      so it seems likely that there's no fatal problem with this (for
      instance, the index couldn't get dropped from under us).  Still,
      it's bad practice and we should fix it.
      
      To do so, remove the optimizations in ExecInitIndexScan and friends
      that tried to avoid taking a lock on an index belonging to a target
      relation, and just take the lock always.  In non-bug cases, this
      will result in no additional shared-memory access, since we'll find
      in the local lock table that we already have a lock of the desired
      type; hence, no significant performance degradation should occur.
      
      Also, adjust the planner and executor so that the type of lock taken
      on an index is always identical to the type of lock taken for its table,
      by relying on the recently added RangeTblEntry.rellockmode field.
      This avoids some corner cases where that might not have been true
      before (possibly resulting in extra locking overhead), and prevents
      future maintenance issues from having multiple bits of logic that
      all needed to be in sync.  In addition, this change removes all core
      calls to ExecRelationIsTargetRelation, which avoids a possible O(N^2)
      startup penalty for queries with large numbers of target relations.
      (We'd probably remove that function altogether, were it not that we
      advertise it as something that FDWs might want to use.)
      
      Also adjust some places in selfuncs.c to not take any lock on indexes
      they are transiently opening, since we can assume that plancat.c
      did that already.
      
      In passing, change gin_clean_pending_list() to take RowExclusiveLock
      not AccessShareLock on its target index.  Although it's not clear that
      that's actually a bug, it seemed very strange for a function that's
      explicitly going to modify the index to use only AccessShareLock.
      
      David Rowley, reviewed by Julien Rouhaud and Amit Langote,
      a bit of further tweaking by me
      
      Discussion: https://postgr.es/m/19465.1541636036@sss.pgh.pa.us
      9c703c16
    • Robert Haas's avatar
      Allow VACUUM to be run with index cleanup disabled. · a96c41fe
      Robert Haas authored
      This commit adds a new reloption, vacuum_index_cleanup, which
      controls whether index cleanup is performed for a particular
      relation by default.  It also adds a new option to the VACUUM
      command, INDEX_CLEANUP, which can be used to override the
      reloption.  If neither the reloption nor the VACUUM option is
      used, the default is true, as before.
      
      Masahiko Sawada, reviewed and tested by Nathan Bossart, Alvaro
      Herrera, Kyotaro Horiguchi, Darafei Praliaskouski, and me.
      The wording of the documentation is mostly due to me.
      
      Discussion: http://postgr.es/m/CAD21AoAt5R3DNUZSjOoXDUY=naYPUOuffVsRzuTYMz29yLzQCA@mail.gmail.com
      a96c41fe
    • Peter Geoghegan's avatar
      Invalidate binary search bounds consistently. · 74eb2176
      Peter Geoghegan authored
      _bt_check_unique() failed to invalidate binary search bounds in the
      event of a live conflict following commit e5adcb78.  This resulted in
      problems after waiting for the conflicting xact to commit or abort.  The
      subsequent call to _bt_check_unique() would restore the initial binary
      search bounds, rather than starting a new search.  Fix by explicitly
      invalidating bounds when it becomes clear that there is a live conflict
      that insertion will have to wait to resolve.
      
      Ashutosh Sharma, with a few additional tweaks by me.
      
      Author: Ashutosh Sharma
      Reported-By: Ashutosh Sharma
      Diagnosed-By: Ashutosh Sharma
      Discussion: https://postgr.es/m/CAE9k0PnQp-qr-UYKMSCzdC2FBzdE4wKP41hZrZvvP26dKLonLg@mail.gmail.com
      74eb2176
    • Stephen Frost's avatar
      Move the be_gssapi_get_* prototypes · 87e16db5
      Stephen Frost authored
      The be_gssapi_get_* prototypes were put close to similar ones for SSL-
      but a bit too close since that meant they ended up only being included
      for SSL-enabled builds.  Move those to be under ENABLE_GSS instead.
      
      Pointed out by Tom.
      87e16db5
    • Thomas Munro's avatar
      Refactor the fsync queue for wider use. · 3eb77eba
      Thomas Munro authored
      Previously, md.c and checkpointer.c were tightly integrated so that
      fsync calls could be handed off and processed in the background.
      Introduce a system of callbacks and file tags, so that other modules
      can hand off fsync work in the same way.
      
      For now only md.c uses the new interface, but other users are being
      proposed.  Since there may be use cases that are not strictly SMGR
      implementations, use a new function table for sync handlers rather
      than extending the traditional SMGR one.
      
      Instead of using a bitmapset of segment numbers for each RelFileNode
      in the checkpointer's hash table, make the segment number part of the
      key.  This requires sending explicit "forget" requests for every
      segment individually when relations are dropped, but suits the file
      layout schemes of proposed future users better (ie sparse or high
      segment numbers).
      
      Author: Shawn Debnath and Thomas Munro
      Reviewed-by: Thomas Munro, Andres Freund
      Discussion: https://postgr.es/m/CAEepm=2gTANm=e3ARnJT=n0h8hf88wqmaZxk0JYkxw+b21fNrw@mail.gmail.com
      3eb77eba
    • Peter Eisentraut's avatar
      file_fdw: Fix for generated columns · 33215d11
      Peter Eisentraut authored
      Since file_fdw uses COPY internally, but COPY doesn't allow listing
      generated columns in its column list, we need to make sure that we
      don't add generated columns to the column lists internally generated
      by file_fdw.
      Reported-by: default avatarErik Rijkers <er@xs4all.nl>
      33215d11
    • Noah Misch's avatar
      Silence -Wimplicit-fallthrough in sysv_shmem.c. · 6f0e1900
      Noah Misch authored
      Commit 2f932f71 added code that elicits
      a warning on buildfarm member flaviventris.  Back-patch to 9.4, like
      that commit.
      
      Reported by Andres Freund.
      
      Discussion: https://postgr.es/m/20190404020057.galelv7by75ekqrh@alap3.anarazel.de
      6f0e1900
    • Noah Misch's avatar
      Make src/test/recovery/t/017_shm.pl safe for concurrent execution. · 16ee6eaf
      Noah Misch authored
      Buildfarm members idiacanthus and komodoensis, which share a host, both
      executed this test in the same second.  That failed.  Back-patch to 9.6,
      where the test first appeared.
      
      Discussion: https://postgr.es/m/20190404020543.GA1319573@rfd.leadboat.com
      16ee6eaf
    • Michael Paquier's avatar
      Improve readability of some tests in strings.sql · 92c76021
      Michael Paquier authored
      c2513365 has added some tests to check if a toast relation should be
      empty or not, hardcoding the toast relation name when calling
      pg_relation_size().  pg_class.reltoastrelid offers the same information,
      so simplify the tests to use that.
      
      Reviewed-by: Daniel Gustafsson
      Discussion: https://postgr.es/m/20190403065949.GH3298@paquier.xyz
      92c76021
    • Andres Freund's avatar
      tableam: basic documentation. · b73c3a11
      Andres Freund authored
      This adds documentation about the user oriented parts of table access
      methods (i.e. the default_table_access_method GUC and the USING clause
      for CREATE TABLE etc), adds a basic chapter about the table access
      method interface, and adds a note to storage.sgml that it's contents
      don't necessarily apply for non-builtin AMs.
      
      Author: Haribabu Kommi and Andres Freund
      Discussion: https://postgr.es/m/20180703070645.wchpu5muyto5n647@alap3.anarazel.de
      b73c3a11
    • Noah Misch's avatar
      Assert that pgwin32_signal_initialize() has been called early enough. · ab9ed9be
      Noah Misch authored
      Before the pgwin32_signal_initialize() call, the backend version of
      pg_usleep() has no effect.  No in-tree code falls afoul of that today,
      but temporary commit 23078689 did so.
      
      Discussion: https://postgr.es/m/20190402135442.GA1173872@rfd.leadboat.com
      ab9ed9be
    • Noah Misch's avatar
      Handle USE_MODULE_DB for all tests able to use an installed postmaster. · f433394e
      Noah Misch authored
      When $(MODULES) and $(MODULE_big) are empty, derive the database name
      from the first element of $(REGRESS) instead of using a constant string.
      When deriving the database name from $(MODULES), use its first element
      instead of the entire list; the earlier approach would fail if any
      multi-module directory had $(REGRESS) tests.  Treat isolation suites and
      src/pl correspondingly.  Under USE_MODULE_DB=1, installcheck-world and
      check-world no longer reuse any database name in a given postmaster.
      Buildfarm members axolotl, mandrill and frogfish saw spurious "is being
      accessed by other users" failures that would not have happened without
      database name reuse.  (The CountOtherDBBackends() 5s deadline expired
      during DROP DATABASE; a backend for an earlier test suite had used the
      same database name and had not yet exited.)  Back-patch to 9.4 (all
      supported versions), except bits pertaining to isolation suites.
      
      Concept reviewed by Andrew Dunstan, Andres Freund and Tom Lane.
      
      Discussion: https://postgr.es/m/20190401135213.GE891537@rfd.leadboat.com
      f433394e
    • Noah Misch's avatar
      Consistently test for in-use shared memory. · 2f932f71
      Noah Misch authored
      postmaster startup scrutinizes any shared memory segment recorded in
      postmaster.pid, exiting if that segment matches the current data
      directory and has an attached process.  When the postmaster.pid file was
      missing, a starting postmaster used weaker checks.  Change to use the
      same checks in both scenarios.  This increases the chance of a startup
      failure, in lieu of data corruption, if the DBA does "kill -9 `head -n1
      postmaster.pid` && rm postmaster.pid && pg_ctl -w start".  A postmaster
      will no longer recycle segments pertaining to other data directories.
      That's good for production, but it's bad for integration tests that
      crash a postmaster and immediately delete its data directory.  Such a
      test now leaks a segment indefinitely.  No "make check-world" test does
      that.  win32_shmem.c already avoided all these problems.  In 9.6 and
      later, enhance PostgresNode to facilitate testing.  Back-patch to 9.4
      (all supported versions).
      
      Reviewed by Daniel Gustafsson and Kyotaro HORIGUCHI.
      
      Discussion: https://postgr.es/m/20130911033341.GD225735@tornado.leadboat.com
      2f932f71
  5. 03 Apr, 2019 5 commits