1. 20 Apr, 2016 2 commits
    • Tom Lane's avatar
      Fix memory leak and other bugs in ginPlaceToPage() & subroutines. · bde361fe
      Tom Lane authored
      Commit 36a35c55 turned the interface between ginPlaceToPage and
      its subroutines in gindatapage.c and ginentrypage.c into a royal mess:
      page-update critical sections were started in one place and finished in
      another place not even in the same file, and the very same subroutine
      might return having started a critical section or not.  Subsequent patches
      band-aided over some of the problems with this design by making things
      even messier.
      
      One user-visible resulting problem is memory leaks caused by the need for
      the subroutines to allocate storage that would survive until ginPlaceToPage
      calls XLogInsert (as reported by Julien Rouhaud).  This would not typically
      be noticeable during retail index updates.  It could be visible in a GIN
      index build, in the form of memory consumption swelling to several times
      the commanded maintenance_work_mem.
      
      Another rather nasty problem is that in the internal-page-splitting code
      path, we would clear the child page's GIN_INCOMPLETE_SPLIT flag well before
      entering the critical section that it's supposed to be cleared in; a
      failure in between would leave the index in a corrupt state.  There were
      also assorted coding-rule violations with little immediate consequence but
      possible long-term hazards, such as beginning an XLogInsert sequence before
      entering a critical section, or calling elog(DEBUG) inside a critical
      section.
      
      To fix, redefine the API between ginPlaceToPage() and its subroutines
      by splitting the subroutines into two parts.  The "beginPlaceToPage"
      subroutine does what can be done outside a critical section, including
      full computation of the result pages into temporary storage when we're
      going to split the target page.  The "execPlaceToPage" subroutine is called
      within a critical section established by ginPlaceToPage(), and it handles
      the actual page update in the non-split code path.  The critical section,
      as well as the XLOG insertion call sequence, are both now always started
      and finished in ginPlaceToPage().  Also, make ginPlaceToPage() create and
      work in a short-lived memory context to eliminate the leakage problem.
      (Since a short-lived memory context had been getting created in the most
      common code path in the subroutines, this shouldn't cause any noticeable
      performance penalty; we're just moving the overhead up one call level.)
      
      In passing, fix a bunch of comments that had gone unmaintained throughout
      all this klugery.
      
      Report: <571276DD.5050303@dalibo.com>
      bde361fe
    • Kevin Grittner's avatar
      Revert no-op changes to BufferGetPage() · a343e223
      Kevin Grittner authored
      The reverted changes were intended to force a choice of whether any
      newly-added BufferGetPage() calls needed to be accompanied by a
      test of the snapshot age, to support the "snapshot too old"
      feature.  Such an accompanying test is needed in about 7% of the
      cases, where the page is being used as part of a scan rather than
      positioning for other purposes (such as DML or vacuuming).  The
      additional effort required for back-patching, and the doubt whether
      the intended benefit would really be there, have indicated it is
      best just to rely on developers to do the right thing based on
      comments and existing usage, as we do with many other conventions.
      
      This change should have little or no effect on generated executable
      code.
      
      Motivated by the back-patching pain of Tom Lane and Robert Haas
      a343e223
  2. 19 Apr, 2016 1 commit
    • Tom Lane's avatar
      Improve regression tests for degree-based trigonometric functions. · 4db0d2d2
      Tom Lane authored
      Print the actual value of each function result that's expected to be exact,
      rather than merely emitting a NULL if it's not right.  Although we print
      these with extra_float_digits = 3, we should not trust that the platform
      will produce a result visibly different from the expected value if it's off
      only in the last place; hence, also include comparisons against the exact
      values as before.  This is a bit bulkier and uglier than the previous
      printout, but it will provide more information and be easier to interpret
      if there's a test failure.
      
      Discussion: <18241.1461073100@sss.pgh.pa.us>
      4db0d2d2
  3. 18 Apr, 2016 4 commits
    • Tom Lane's avatar
      Make partition-lock-release coding more transparent in BufferAlloc(). · a0382e2d
      Tom Lane authored
      Coverity complained that oldPartitionLock was possibly dereferenced after
      having been set to NULL.  That actually can't happen, because we'd only use
      it if (oldFlags & BM_TAG_VALID) is true.  But nonetheless Coverity is
      justified in complaining, because at line 1275 we actually overwrite
      oldFlags, and then still expect its BM_TAG_VALID bit to be a safe guide to
      whether to release the oldPartitionLock.  Thus, the code would be incorrect
      if someone else had changed the buffer's BM_TAG_VALID flag meanwhile.
      That should not happen, since we hold pin on the buffer throughout this
      sequence, but it's starting to look like a rather shaky chain of logic.
      And there's no need for such assumptions, because we can simply replace
      the (oldFlags & BM_TAG_VALID) tests with (oldPartitionLock != NULL),
      which has identical results and makes it plain to all comers that we don't
      dereference a null pointer.  A small side benefit is that the range of
      liveness of oldFlags is greatly reduced, possibly allowing the compiler
      to save a register.
      
      This is just cleanup, not an actual bug fix, so there seems no need
      for a back-patch.
      a0382e2d
    • Tom Lane's avatar
      Further reduce the number of semaphores used under --disable-spinlocks. · 75c24d0f
      Tom Lane authored
      Per discussion, there doesn't seem to be much value in having
      NUM_SPINLOCK_SEMAPHORES set to 1024: under any scenario where you are
      running more than a few backends concurrently, you really had better have a
      real spinlock implementation if you want tolerable performance.  And 1024
      semaphores is a sizable fraction of the system-wide SysV semaphore limit
      on many platforms.  Therefore, reduce this setting's default value to 128
      to make it less likely to cause out-of-semaphores problems.
      75c24d0f
    • Fujii Masao's avatar
      Fix typo in docs. · 8ce8307b
      Fujii Masao authored
      Artur Zakirov
      8ce8307b
    • Peter Eisentraut's avatar
      doc: Document that sequences can also be extension configuration tables · d460c7cc
      Peter Eisentraut authored
      From: Michael Paquier <michael.paquier@gmail.com>
      d460c7cc
  4. 17 Apr, 2016 1 commit
    • Tom Lane's avatar
      Avoid code duplication in \crosstabview. · 9603a325
      Tom Lane authored
      In commit 6f0d6a50 I added a duplicate copy of psqlscanslash's identifier
      downcasing code, but actually it's not hard to split that out as a callable
      subroutine and avoid the duplication.
      9603a325
  5. 16 Apr, 2016 7 commits
  6. 15 Apr, 2016 13 commits
    • Tom Lane's avatar
      Use less-generic names in matview.sql. · 4447f0bc
      Tom Lane authored
      The original coding of this test used table and view names like "t",
      "tv", "foo", etc.  This tended to interfere with doing simple manual
      tests in the regression database; not to mention that it posed a
      considerable risk of conflict with other regression test scripts.
      Prefix these names with "mvtest_" to avoid such conflicts.
      
      Also, change transiently-created role name to be "regress_xxx" per
      discussions about being careful with regression-test role creation.
      4447f0bc
    • Tom Lane's avatar
      Fix possible crash in ALTER TABLE ... REPLICA IDENTITY USING INDEX. · 8f1911d5
      Tom Lane authored
      Careless coding added by commit 07cacba9 could result in a crash
      or a bizarre error message if someone tried to select an index on the
      OID column as the replica identity index for a table.  Back-patch to 9.4
      where the feature was introduced.
      
      Discussion: CAKJS1f8TQYgTRDyF1_u9PVCKWRWz+DkieH=U7954HeHVPJKaKg@mail.gmail.com
      
      David Rowley
      8f1911d5
    • Robert Haas's avatar
      postgres_fdw: Clean up handling of system columns. · da7d44b6
      Robert Haas authored
      Previously, querying the xmin column of a single postgres_fdw foreign
      table fetched the tuple length, xmax the typmod, and cmin or cmax the
      composite type OID of the tuple.  However, when you queried several
      such tables and the join got shipped to the remote side, these columns
      ended up containing the remote values of the corresponding columns.
      Both behaviors are rather unprincipled, the former for obvious reasons
      and the latter because the remote values of these columns don't have
      any local significance; our transaction IDs are in a different space
      than those of the remote machine.  Clean this up by setting all of
      these fields to 0 in both cases.  Also fix the handling of tableoid
      to be sane.
      
      Robert Haas and Ashutosh Bapat, reviewed by Etsuro Fujita.
      da7d44b6
    • Robert Haas's avatar
      Tweak EXPLAIN for parallel query to show workers launched. · 5702277c
      Robert Haas authored
      The previous display was sort of confusing, because it didn't
      distinguish between the number of workers that we planned to launch
      and the number that actually got launched.  This has already confused
      several people, so display both numbers and label them clearly.
      
      Julien Rouhaud, reviewed by me.
      5702277c
    • Tom Lane's avatar
      Fix portability problem induced by commit a6f6b781. · 6b85d4ba
      Tom Lane authored
      pg_xlogdump includes bufmgr.h.  With a compiler that emits code for
      static inline functions even when they're unreferenced, that leads
      to unresolved external references in the new static-inline version
      of BufferGetPage().  So hide it with #ifndef FRONTEND, as we've done
      for similar issues elsewhere.  Per buildfarm member pademelon.
      6b85d4ba
    • Magnus Hagander's avatar
      Fix typo in comment · ba8fe38f
      Magnus Hagander authored
      ba8fe38f
    • Magnus Hagander's avatar
      Update helptext for vcregress.pl · cf086b1c
      Magnus Hagander authored
      This has clearly not been tracking the code changse for quite some time.
      
      Michael Paquier, problem spotted by Kyotaro HORIGUCHI
      cf086b1c
    • Fujii Masao's avatar
      Make regression test for multiple synchronous standbys more stable. · 36c1c916
      Fujii Masao authored
      The regression test checks whether the output of pg_stat_replication is
      expected or not after changing synchronous_standby_names and reloading
      the configuration file. Regarding this test logic, previously there was
      a timing issue which made the test result unstable. That is,
      pg_stat_replication could return unexpected result during small window
      after the configuration file was reloaded before new setting value
      took effect, and which made the test fail.
      
      This commit changes the test logic so that it uses a loop with a timeout
      to give some room for the test to pass. Now the test fails only when
      pg_stat_replication keeps returning unexpected result for 30 seconds.
      
      Michael Paquier
      36c1c916
    • Tom Lane's avatar
      Fix memory leak in GIN index scans. · f0e766bd
      Tom Lane authored
      The code had a query-lifespan memory leak when encountering GIN entries
      that have posting lists (rather than posting trees, ie, there are a
      relatively small number of heap tuples containing this index key value).
      With a suitable data distribution this could add up to a lot of leakage.
      Problem seems to have been introduced by commit 36a35c55, so back-patch
      to 9.4.
      
      Julien Rouhaud
      f0e766bd
    • Tom Lane's avatar
      Rethink \crosstabview's argument parsing logic. · 6f0d6a50
      Tom Lane authored
      \crosstabview interpreted its arguments in an unusual way, including
      doing case-insensitive matching of unquoted column names, which is
      surely not the right thing.  Rip that out in favor of doing something
      equivalent to the dequoting/case-folding rules used by other psql
      commands.  To keep it simple, change the syntax so that the optional
      sort column is specified as a separate argument, instead of the
      also-quite-unusual syntax that attached it to the colH argument with
      a colon.
      
      Also, rework the error messages to be closer to project style.
      6f0d6a50
    • Andres Freund's avatar
      Make init_spin_delay() C89 compliant #2. · 4b74c6a4
      Andres Freund authored
      My previous attempt at doing so, in 80abbeba, was not sufficient. While that
      fixed the problem for bufmgr.c and lwlock.c , s_lock.c still has non-constant
      expressions in the struct initializer, because the file/line/function
      information comes from the caller of s_lock().
      
      Give up on using a macro, and use a static inline instead.
      
      Discussion: 4369.1460435533@sss.pgh.pa.us
      4b74c6a4
    • Andres Freund's avatar
      Remove trailing commas in enums. · 533cd230
      Andres Freund authored
      These aren't valid C89. Found thanks to gcc's -Wc90-c99-compat. These
      exist in differing places in most supported branches.
      533cd230
    • Andres Freund's avatar
      Fix trivial typo. · 7b167812
      Andres Freund authored
      7b167812
  7. 14 Apr, 2016 10 commits
    • Tom Lane's avatar
      Fix core dump in ReorderBufferRestoreChange on alignment-picky platforms. · 6a3d3965
      Tom Lane authored
      When re-reading an update involving both an old tuple and a new tuple from
      disk, reorderbuffer.c was careless about whether the new tuple is suitably
      aligned for direct access --- in general, it isn't.  We'd missed seeing
      this in the buildfarm because the contrib/test_decoding tests exercise this
      code path only a few times, and by chance all of those cases have old
      tuples with length a multiple of 4, which is usually enough to make the
      access to the new tuple's t_len safe.  For some still-not-entirely-clear
      reason, however, Debian's sparc build gets a bus error, as reported by
      Christoph Berg; perhaps it's assuming 8-byte alignment of the pointer?
      
      The lack of previous field reports is probably because you need all of
      these conditions to trigger a crash: an alignment-picky platform (not
      Intel), a transaction large enough to spill to disk, an update within
      that xact that changes a primary-key field and has an odd-length old tuple,
      and of course logical decoding tracing the transaction.
      
      Avoid the alignment assumption by using memcpy instead of fetching t_len
      directly, and add a test case that exposes the crash on picky platforms.
      Back-patch to 9.4 where the bug was introduced.
      
      Discussion: <20160413094117.GC21485@msg.credativ.de>
      6a3d3965
    • Tom Lane's avatar
      Adjust signature of walrcv_receive hook. · c2dc194b
      Tom Lane authored
      Commit 314cbfc5 redefined the signature of this hook as
      typedef int (*walrcv_receive_type) (char **buffer, int *wait_fd);
      
      But in fact the type of the "wait_fd" variable ought to be pgsocket,
      which is what WaitLatchOrSocket expects, and which is necessary if
      we want to be able to assign PGINVALID_SOCKET to it on Windows.
      So fix that.
      c2dc194b
    • Tom Lane's avatar
      Adjust datatype of ReplicationState.acquired_by. · 994f1125
      Tom Lane authored
      It was declared as "pid_t", which would be fine except that none of
      the places that printed it in error messages took any thought for the
      possibility that it's not equivalent to "int".  This leads to warnings
      on some buildfarm members, and could possibly lead to actually wrong
      error messages on those platforms.  There doesn't seem to be any very
      good reason not to just make it "int"; it's only ever assigned from
      MyProcPid, which is int.  If we want to cope with PIDs that are wider
      than int, this is not the place to start.
      
      Also, fix the comment, which seems to perhaps be a leftover from a time
      when the field was only a bool?
      
      Per buildfarm.  Back-patch to 9.5 which has same issue.
      994f1125
    • Tom Lane's avatar
      Docs: clarify description of LIMIT/OFFSET behavior. · fda21aa0
      Tom Lane authored
      Section 7.6 was a tad confusing because it specified what LIMIT NULL
      does, but neglected to do the same for OFFSET NULL, making this look
      like perhaps a special case or a wrong restatement of the bit about
      LIMIT ALL.  Wordsmith a bit while at it.  Per bug #14084.
      fda21aa0
    • Tom Lane's avatar
      Fix prototype of pgwin32_bind(). · 22989a8e
      Tom Lane authored
      I (tgl) had copied-and-pasted this from pgwin32_accept(), failing to
      notice that the third parameter should be "int" not "int *".
      
      David Rowley
      22989a8e
    • Tom Lane's avatar
      Fix broken dependency-mongering for index operator classes/families. · 92a30a7e
      Tom Lane authored
      For a long time, opclasscmds.c explained that "we do not create a
      dependency link to the AM [for an opclass or opfamily], because we don't
      currently support DROP ACCESS METHOD".  Commit 473b9328 invented
      DROP ACCESS METHOD, but it batted only 1 for 2 on adding the dependency
      links, and 0 for 2 on updating the comments about the topic.
      
      In passing, undo the same commit's entirely inappropriate decision to
      blow away an existing index as a side-effect of create_am.sql.
      92a30a7e
    • Fujii Masao's avatar
      Fix duplicated index entry in doc. · c8cb7453
      Fujii Masao authored
      Commit cfe96ae2 corrected the name of pg_logical_emit_message()
      in its index entry. But this typo fix caused duplicated index
      entry because there was another index entry for the function.
      
      Spotted by Tom Lane.
      c8cb7453
    • Stephen Frost's avatar
      Disallow SET SESSION AUTHORIZATION pg_* · bfed4ab8
      Stephen Frost authored
      As part of reserving the pg_* namespace for default roles and in line
      with SET ROLE and other previous efforts, disallow settings the role
      to a default/reserved role using SET SESSION AUTHORIZATION.
      
      These checks and restrictions on what is allowed regarding default /
      reserved roles are under debate, but it seems prudent to ensure that
      the existing checks at least cover the intended cases while the
      debate rages on.  On me to clean it up if the consensus decision is
      to remove these checks.
      bfed4ab8
    • Andres Freund's avatar
      Add required database and origin filtering for logical messages. · be65eddd
      Andres Freund authored
      Logical messages, added in 3fe3511d, during decoding failed to filter
      messages emitted in other databases and messages emitted "under" a
      replication origin the output plugin isn't interested in.
      
      Add tests to verify that both types of filtering actually work. While
      touching message.sql remove hunk obsoleted by d25379eb.
      
      Bump XLOG_PAGE_MAGIC because xl_logical_message changed and because
      3fe3511d had omitted doing so. 3fe3511d additionally didn't bump
      catversion, but 7a542700 has done so since.
      
      Author: Petr Jelinek
      Reported-By: Andres Freund
      Discussion: 20160406142513.wotqy3ba3kanr423@alap3.anarazel.de
      be65eddd
    • Andres Freund's avatar
      Make init_spin_delay() C89 compliant and change stuck spinlock reporting. · 80abbeba
      Andres Freund authored
      The current definition of init_spin_delay (introduced recently in
      48354581) wasn't C89 compliant. It's not legal to refer to refer to
      non-constant expressions, and the ptr argument was one.  This, as
      reported by Tom, lead to a failure on buildfarm animal pademelon.
      
      The pointer, especially on system systems with ASLR, isn't super helpful
      anyway, though. So instead of making init_spin_delay into an inline
      function, make s_lock_stuck() report the function name in addition to
      file:line and change init_spin_delay() accordingly. While not a direct
      replacement, the function name is likely more useful anyway (line
      numbers are often hard to interpret in third party reports).
      
      This also fixes what file/line number is reported for waits via
      s_lock().
      
      As PG_FUNCNAME_MACRO is now used outside of elog.h, move it to c.h.
      
      Reported-By: Tom Lane
      Discussion: 4369.1460435533@sss.pgh.pa.us
      80abbeba
  8. 13 Apr, 2016 2 commits
    • Tom Lane's avatar
      Fix pg_dump so pg_upgrade'ing an extension with simple opfamilies works. · 6cead413
      Tom Lane authored
      As reported by Michael Feld, pg_upgrade'ing an installation having
      extensions with operator families that contain just a single operator class
      failed to reproduce the extension membership of those operator families.
      This caused no immediate ill effects, but would create problems when later
      trying to do a plain dump and restore, because the seemingly-not-part-of-
      the-extension operator families would appear separately in the pg_dump
      output, and then would conflict with the families created by loading the
      extension.  This has been broken ever since extensions were introduced,
      and many of the standard contrib extensions are affected, so it's a bit
      astonishing nobody complained before.
      
      The cause of the problem is a perhaps-ill-considered decision to omit
      such operator families from pg_dump's output on the grounds that the
      CREATE OPERATOR CLASS commands could recreate them, and having explicit
      CREATE OPERATOR FAMILY commands would impede loading the dump script into
      pre-8.3 servers.  Whatever the merits of that decision when 8.3 was being
      written, it looks like a poor tradeoff now.  We can fix the pg_upgrade
      problem simply by removing that code, so that the operator families are
      dumped explicitly (and then will be properly made to be part of their
      extensions).
      
      Although this fixes the behavior of future pg_upgrade runs, it does nothing
      to clean up existing installations that may have improperly-linked operator
      families.  Given the small number of complaints to date, maybe we don't
      need to worry about providing an automated solution for that; anyone who
      needs to clean it up can do so with manual "ALTER EXTENSION ADD OPERATOR
      FAMILY" commands, or even just ignore the duplicate-opfamily errors they
      get during a pg_restore.  In any case we need this fix.
      
      Back-patch to all supported branches.
      
      Discussion: <20228.1460575691@sss.pgh.pa.us>
      6cead413
    • Andres Freund's avatar
      Avoid atomic operation in MarkLocalBufferDirty(). · 6b93fcd1
      Andres Freund authored
      The recent patch to make Pin/UnpinBuffer lockfree in the hot
      path (48354581), accidentally used pg_atomic_fetch_or_u32() in
      MarkLocalBufferDirty(). Other code operating on local buffers was
      careful to only use pg_atomic_read/write_u32 which just read/write from
      memory; to avoid unnecessary overhead.
      
      On its own that'd just make MarkLocalBufferDirty() slightly less
      efficient, but in addition InitLocalBuffers() doesn't call
      pg_atomic_init_u32() - thus the spinlock fallback for the atomic
      operations isn't initialized. That in turn caused, as reported by Tom,
      buildfarm animal gaur to fail.  As those errors are actually useful
      against this type of error, continue to omit - intentionally this time -
      initialization of the atomic variable.
      
      In addition, add an explicit note about only using pg_atomic_read/write
      on local buffers's state to BufferDesc's description.
      
      Reported-By: Tom Lane
      Discussion: 1881.1460431476@sss.pgh.pa.us
      6b93fcd1