1. 05 May, 2018 4 commits
    • Tom Lane's avatar
      Fix bootstrap parser so that its keywords are unreserved words. · d160882a
      Tom Lane authored
      Mark Dilger pointed out that the bootstrap parser does not allow
      any of its keywords to appear as column values unless they're quoted,
      and proposed dealing with that by quoting such values in genbki.pl.
      Looking closer, though, we also have that problem with respect to table,
      column, and type names appearing in the .bki file: the parser would fail
      if any of those matched any of its keywords.  While so far there have
      been no conflicts (that I've heard of), this seems like a booby trap
      waiting to catch somebody.  Rather than clutter genbki.pl with enough
      quoting logic to handle all that, let's make the bootstrap parser grow
      up a little bit and treat its keywords as unreserved.
      
      Experimentation shows that it's fairly easy to do so with the exception
      of _null_, which I don't have a big problem with keeping as a reserved
      word.  The only change needed is that we can't have the "close" command
      take an optional table name: it has to either require or forbid the
      table name to avoid shift/reduce conflicts.  genbki.pl has historically
      always included the table name, so I took that option.
      
      The implementation has bootscanner.l passing forward the string value
      of each keyword, in case bootparse.y needs that.  This avoids needing to
      know the precise spelling of each keyword in bootparse.y, which is good
      because that's not always obvious from the token name.
      
      Discussion: https://postgr.es/m/3024FC91-DB6D-4732-B31C-DF772DF039A0@gmail.com
      d160882a
    • Tom Lane's avatar
      Revert "Test conversion of NaN between float4 and float8." · 5c4c771d
      Tom Lane authored
      This reverts commit 55e0e458.
      It's served its purpose of demonstrating what was wrong on
      buildfarm member opossum.  We could consider putting some kind
      of single-purpose hack into ftod() to make the test pass there;
      but I don't think it's worth the trouble, since there are surely
      many other places whether this platform bug could manifest.
      5c4c771d
    • Tom Lane's avatar
      Put in_range_float4_float8's work in-line. · cb3e9e40
      Tom Lane authored
      In commit 8b29e88c, I'd dithered about whether to make
      in_range_float4_float8 be a standalone copy of the float in-range logic
      or have it punt to in_range_float8_float8.  I went with the latter, which
      saves code space though at the cost of performance and readability.
      
      However, it emerges that this tickles a compiler or hardware bug on
      buildfarm member opossum.  Test results from commit 55e0e458 show
      conclusively that widening a float4 NaN to float8 produces Inf, not NaN,
      on that machine; which accounts perfectly for the window RANGE test
      failures it's been showing.  We can dodge this problem by making
      in_range_float4_float8 be an independent function, so that it checks
      for NaN inputs before widening them.
      
      Ordinarily I'd not be very excited about working around such obviously
      broken functionality; but given that this was a judgment call to begin
      with, I don't mind reversing it.
      cb3e9e40
    • Peter Eisentraut's avatar
      2f525187
  2. 04 May, 2018 11 commits
    • Tom Lane's avatar
      First-draft release notes for 10.4. · 488ccfe4
      Tom Lane authored
      As usual, the release notes for other branches will be made by cutting
      these down, but put them up for community review first.
      488ccfe4
    • Heikki Linnakangas's avatar
      Fix scenario where streaming standby gets stuck at a continuation record. · 06687198
      Heikki Linnakangas authored
      If a continuation record is split so that its first half has already been
      removed from the master, and is only present in pg_wal, and there is a
      recycled WAL segment in the standby server that looks like it would
      contain the second half, recovery would get stuck. The code in
      XLogPageRead() incorrectly started streaming at the beginning of the
      WAL record, even if we had already read the first page.
      
      Backpatch to 9.4. In principle, older versions have the same problem, but
      without replication slots, there was no straightforward mechanism to
      prevent the master from recycling old WAL that was still needed by standby.
      Without such a mechanism, I think it's reasonable to assume that there's
      enough slack in how many old segments are kept around to not run into this,
      or you have a WAL archive.
      
      Reported by Jonathon Nelson. Analysis and patch by Kyotaro HORIGUCHI, with
      some extra comments by me.
      
      Discussion: https://www.postgresql.org/message-id/CACJqAM3xVz0JY1XFDKPP%2BJoJAjoGx%3DGNuOAshEDWCext7BFvCQ%40mail.gmail.com
      06687198
    • Alvaro Herrera's avatar
      Don't mark pages all-visible spuriously · d2599ecf
      Alvaro Herrera authored
      Dan Wood diagnosed a long-standing problem that pages containing tuples
      that are locked by multixacts containing live lockers may spuriously end
      up as candidates for getting their all-visible flag set.  This has the
      long-term effect that multixacts remain unfrozen; this may previously
      pass undetected, but since commit XYZ it would be reported as
        "ERROR: found multixact 134100944 from before relminmxid 192042633"
      because when a later vacuum tries to freeze the page it detects that a
      multixact that should have gotten frozen, wasn't.
      
      Dan proposed a (correct) patch that simply sets a variable to its
      correct value, after a bogus initialization.  But, per discussion, it
      seems better coding to avoid the bogus initializations altogether, since
      they could give rise to more bugs later.  Therefore this fix rewrites
      the logic a little bit to avoid depending on the bogus initializations.
      
      This bug was part of a family introduced in 9.6 by commit a892234f;
      later, commit 38e9f90a fixed most of them, but this one was
      unnoticed.
      
      Authors: Dan Wood, Pavan Deolasee, Álvaro Herrera
      Reviewed-by: Masahiko Sawada, Pavan Deolasee, Álvaro Herrera
      Discussion: https://postgr.es/m/84EBAC55-F06D-4FBE-A3F3-8BDA093CE3E3@amazon.com
      d2599ecf
    • Andrew Dunstan's avatar
      Provide for testing on python3 modules when under MSVC · 966268c7
      Andrew Dunstan authored
      This should have been done some years ago as promised in commit
      c4dcdd0c2. However, better late than never.
      
      Along the way do a little housekeeping, including using a simpler test
      for the python version being tested, and removing a redundant subroutine
      parameter. These changes only apply back to release 9.5.
      
      Backpatch to all live releases.
      966268c7
    • Andrew Dunstan's avatar
      Allow MSYS as well as MINGW in Msys uname · 608a7101
      Andrew Dunstan authored
      Msys2's uname -s outputs a string beginning MSYS rather than MINGW as is
      output by Msys. Allow either in pg_upgrade's test.sh.
      
      Backpatch to all live branches.
      608a7101
    • Tom Lane's avatar
      Sync our copy of the timezone library with IANA release tzcode2018e. · b45f6613
      Tom Lane authored
      The non-cosmetic changes involve teaching the "zic" tzdata compiler about
      negative DST.  While I'm not currently intending that we start using
      negative-DST data right away, it seems possible that somebody would try
      to use our copy of zic with bleeding-edge IANA data.  So we'd better be
      out in front of this change code-wise, even though it doesn't matter for
      the data file we're shipping.
      
      Discussion: https://postgr.es/m/30996.1525445902@sss.pgh.pa.us
      b45f6613
    • Tom Lane's avatar
      Fix precedence problem in new Perl code. · 59cb3230
      Tom Lane authored
      I think this bit of commit 1f1cd9b5 didn't do quite what I meant :-(
      59cb3230
    • Peter Eisentraut's avatar
      pg_dump: Use current_database() instead of PQdb() · 1cd2445c
      Peter Eisentraut authored
      For querying pg_database about information about the database being
      dumped, look up by using current_database() instead of the value
      obtained from PQdb().  When using a connection proxy, the value from
      PQdb() might not be the real name of the database.
      1cd2445c
    • Teodor Sigaev's avatar
      Don't truncate away non-key attributes for leftmost downlinks. · 2a9e04f0
      Teodor Sigaev authored
      nbtsort.c does not need to truncate away non-key attributes for the
      minimum key of the leftmost page on a level, since this is only used to
      build a minus infinity downlink for the level's leftmost page.
      Truncating away non-key attributes in advance of truncating away all
      attributes in _bt_sortaddtup() does not affect the correctness of CREATE
      INDEX, but it is misleading.
      
      Author: Peter Geoghegan
      Discussion: https://www.postgresql.org/message-id/CAH2-WzkAS2M3ussHG-s_Av=Zo6dPjOxyu5fNRkYnxQV+YzGQ4w@mail.gmail.com
      2a9e04f0
    • Teodor Sigaev's avatar
      Re-think predicate locking on GIN indexes. · 0bef1c06
      Teodor Sigaev authored
      The principle behind the locking was not very well thought-out, and not
      documented. Add a section in the README to explain how it's supposed to
      work, and change the code so that it actually works that way.
      
      This fixes two bugs:
      
      1. If fast update was turned on concurrently, subsequent inserts to the
         pending list would not conflict with predicate locks that were acquired
         earlier, on entry pages. The included 'predicate-gin-fastupdate' test
         demonstrates that. To fix, make all scans acquire a predicate lock on
         the metapage. That lock represents a scan of the pending list, whether
         or not there is a pending list at the moment. Forget about the
         optimization to skip locking/checking for locks, when fastupdate=off.
      2. If a scan finds no match, it still needs to lock the entry page. The
         point of predicate locks is to lock the gabs between values, whether
         or not there is a match. The included 'predicate-gin-nomatch' test
         tests that case.
      
      In addition to those two bug fixes, this removes some unnecessary locking,
      following the principle laid out in the README. Because all items in
      a posting tree have the same key value, a lock on the posting tree root is
      enough to cover all the items. (With a very large posting tree, it would
      possibly be better to lock the posting tree leaf pages instead, so that a
      "skip scan" with a query like "A & B", you could avoid unnecessary conflict
      if a new tuple is inserted with A but !B. But let's keep this simple.)
      
      Also, some spelling  fixes.
      
      Author: Heikki Linnakangas with some editorization by me
      Review: Andrey Borodin, Alexander Korotkov
      Discussion: https://www.postgresql.org/message-id/0b3ad2c2-2692-62a9-3a04-5724f2af9114@iki.fi
      0bef1c06
    • Peter Eisentraut's avatar
      Update expected files for older Python versions · 7d867997
      Peter Eisentraut authored
      neglected in commit fa03769e
      7d867997
  3. 03 May, 2018 10 commits
    • Tom Lane's avatar
      Blindly try to fix MSVC build's use of genbki.pl and Gen_fmgrtab.pl. · bad51a49
      Tom Lane authored
      We need to use a stamp file to record the runs of these scripts, as
      is done on the Unix side.  I think I got it right, but can't test.
      
      While at it, extend this handmade dependency logic to also check the
      generating script files, as the makefiles do.
      
      Discussion: https://postgr.es/m/16925.1525376229@sss.pgh.pa.us
      bad51a49
    • Tom Lane's avatar
      Avoid overwriting unchanged output files in genbki.pl and Gen_fmgrtab.pl. · 1f1cd9b5
      Tom Lane authored
      If a particular output file already exists with the contents it should
      have, leave it alone, so that its mod timestamp is not advanced.
      
      In builds using --enable-depend, this can avoid the need to recompile .c
      files whose included files didn't actually change.  It's not clear whether
      it saves much of anything for users of ccache; but the cost of doing the
      file comparisons seems to be negligible, so we might as well do it.
      
      For developers using the MSVC toolchain, this will create a regression:
      msvc/Solution.pm will sometimes run genbki.pl or Gen_fmgrtab.pl
      unnecessarily.  I'll look into fixing that separately.
      
      Discussion: https://postgr.es/m/16925.1525376229@sss.pgh.pa.us
      1f1cd9b5
    • Tom Lane's avatar
      Rearrange makefile rules for running Gen_fmgrtab.pl. · 9bf28f96
      Tom Lane authored
      Make these rules look more like the ones associated with genbki.pl,
      to wit:
      
      * Use a stamp file to record when we last ran the script, instead of
      relying on the timestamps of the individual output files.
      
      * Take the knowledge out of backend/Makefile and put it in utils/Makefile
      where it belongs.  I moved down the handling of errcodes.h and probes.h
      too, although those continue to be built by separate processes.
      
      In itself, this is just much-needed cleanup with little practical effect.
      However, by decoupling these makefile rules from the timestamps of the
      generated header files, we open the door to not advancing those timestamps
      unnecessarily, which will be taken advantage of by the next commit.
      
      msvc/Solution.pm should be taught to do things similarly, but I'll leave
      that for another commit.
      
      Discussion: https://postgr.es/m/16925.1525376229@sss.pgh.pa.us
      9bf28f96
    • Peter Eisentraut's avatar
      Tweak tests to support Python 3.7 · fa03769e
      Peter Eisentraut authored
      Python 3.7 removes the trailing comma in the repr() of
      BaseException (see <https://bugs.python.org/issue30399>), leading to
      test output differences.  Work around that by composing the equivalent
      test output in a more manual way.
      fa03769e
    • Teodor Sigaev's avatar
      Add HOLD_INTERRUPTS section into FinishPreparedTransaction. · 8f9be261
      Teodor Sigaev authored
      If an interrupt arrives in the middle of FinishPreparedTransaction
      and any callback decide to call CHECK_FOR_INTERRUPTS (e.g.
      RemoveTwoPhaseFile can write a warning with ereport, which checks for
      interrupts) then it's possible to leave current GXact undeleted.
      
      Backpatch to all supported branches
      
      Stas Kelvich
      
      Discussion: ihttps://www.postgresql.org/message-id/3AD85097-A3F3-4EBA-99BD-C38EDF8D2949@postgrespro.ru
      8f9be261
    • Tom Lane's avatar
      Avoid portability issues in autoprewarm.c. · cddc4dc6
      Tom Lane authored
      autoprewarm.c mostly considered the number of blocks it might be dealing
      with as being int64.  This is unnecessary, because NBuffers is declared
      as int, and there's been no suggestion that we might widen it in the
      foreseeable future.  Moreover, using int64 is problematic because the
      code expected INT64_FORMAT to work with fscanf(), something we don't
      guarantee, and which indeed fails on some older buildfarm members.
      
      On top of that, the module randomly used uint32 rather than int64 variables
      to hold block counters in several places, so it would fail anyway if we
      ever did have NBuffers wider than that; and it also supposed that pg_qsort
      could sort an int64 number of elements, which is wrong on 32-bit machines
      (though no doubt a 32-bit machine couldn't actually have that many
      buffers).
      
      Hence, change all these variables to plain int.
      
      In passing, avoid shadowing one variable named i with another,
      and avoid casting away const in apw_compare_blockinfo.
      
      Discussion: https://postgr.es/m/7773.1525288909@sss.pgh.pa.us
      cddc4dc6
    • Teodor Sigaev's avatar
      Fix pg_dump support for pre-8.2 versions · ac7a7e32
      Teodor Sigaev authored
      Unify indnkeys/indnatts/indnkeyatts usage  for all version of query to get
      index information, remove indnkeys column  from query as unused.
      
      Author: Marina Polyakova
      Noticed by: Peter Eisentraut
      ac7a7e32
    • Tom Lane's avatar
      Further improve code for probing the availability of ARM CRC instructions. · a7a73875
      Tom Lane authored
      Andrew Gierth pointed out that commit 1c72ec6f would yield the wrong
      answer on big-endian ARM systems, because the data being CRC'd would be
      different.  To fix that, and avoid the rather unsightly hard-wired
      constant, simply compare the hardware and software implementations'
      results.
      
      While we're at it, also log the resulting decision at DEBUG1, and error
      out if the hw and sw results unexpectedly differ.  Also, since this
      file must compile for both frontend and backend, avoid incorrect
      dependencies on backend-only headers.
      
      In passing, add a comment to postmaster.c about when the CRC function
      pointer will get initialized.
      
      Thomas Munro, based on complaints from Andrew Gierth and Tom Lane
      
      Discussion: https://postgr.es/m/HE1PR0801MB1323D171938EABC04FFE7FA9E3110@HE1PR0801MB1323.eurprd08.prod.outlook.com
      a7a73875
    • Peter Eisentraut's avatar
      Fix SPI error cleanup and memory leak · 30c66e77
      Peter Eisentraut authored
      Since the SPI stack has been moved from TopTransactionContext to
      TopMemoryContext, setting _SPI_stack to NULL in AtEOXact_SPI() leaks
      memory.  In fact, we don't need to do that anymore: We just leave the
      allocated stack around for the next SPI use.
      
      Also, refactor the SPI cleanup so that it is run both at transaction end
      and when returning to the main loop on an exception.  The latter is
      necessary when a procedure calls a COMMIT or ROLLBACK command that
      itself causes an error.
      30c66e77
    • Robert Haas's avatar
      Remove now-unnecessary cast. · a365f52d
      Robert Haas authored
      Etsuro Fujita
      
      Discussion: http://postgr.es/m/5AE99BA7.9060001@lab.ntt.co.jp
      a365f52d
  4. 02 May, 2018 12 commits
  5. 01 May, 2018 3 commits
    • Tom Lane's avatar
      Fix some assorted compiler warnings on Windows. · b2328bf6
      Tom Lane authored
      Don't overflow the result type of constant expressions.  Don't negate
      unsigned types.  Define HAVE_STDBOOL_H for Visual C++ 2013 and later.
      
      Thomas Munro
      Reviewed-By: Michael Paquier and Tom Lane
      
      Discussion: https://postgr.es/m/CAEepm%3D3%3DTDYEXUEcHpEx%2BTwc31wo7PA0oBAiNt6sWmq93MW02A%40mail.gmail.com
      b2328bf6
    • Tom Lane's avatar
      Clean up warnings from -Wimplicit-fallthrough. · 41c912ca
      Tom Lane authored
      Recent gcc can warn about switch-case fall throughs that are not
      explicitly labeled as intentional.  This seems like a good thing,
      so clean up the warnings exposed thereby by labeling all such
      cases with comments that gcc will recognize.
      
      In files that already had one or more suitable comments, I generally
      matched the existing style of those.  Otherwise I went with
      /* FALLTHROUGH */, which is one of the spellings approved at the
      more-restrictive-than-default level -Wimplicit-fallthrough=4.
      (At the default level you can also spell it /* FALL ?THRU */,
      and it's not picky about case.  What you can't do is include
      additional text in the same comment, so some existing comments
      containing versions of this aren't good enough.)
      
      Testing with gcc 8.0.1 (Fedora 28's current version), I found that
      I also had to put explicit "break"s after elog(ERROR) or ereport(ERROR);
      apparently, for this purpose gcc doesn't recognize that those don't
      return.  That seems like possibly a gcc bug, but it's fine because
      in most places we did that anyway; so this amounts to a visit from the
      style police.
      
      Discussion: https://postgr.es/m/15083.1525207729@sss.pgh.pa.us
      41c912ca
    • Andres Freund's avatar
      Improve representation of 'moved partitions' indicator on deleted tuples. · 1667148a
      Andres Freund authored
      Previously a tuple that has been moved to a different partition (see
      f16241be), set the block number on the old tuple to an invalid
      value to indicate that fact. But the tuple offset was left
      untouched. That turned out to trigger a wal_consistency_checking
      failure as reported by Peter Geoghegan, as the offset wasn't
      always overwritten during WAL replay.
      
      Heikki observed that we're wasting valuable data by not putting
      information also in the offset. Thus set that to
      MovedPartitionsOffsetNumber when a tuple indicates it has moved.
      
      We continue to set the block number to MovedPartitionsBlockNumber, as
      that seems more likely to cause problems for code not updated to know
      about moved tuples.
      
      As t_ctid's offset number is now always set, this refinement also
      fixes the wal_consistency_checking issue.
      
      This technically is a minor disk format break, with previously created
      moved tuples not being recognized anymore. But since there not even
      has been a beta release since f16241be...
      
      Reported-By: Peter Geoghegan
      Author: Heikki Linnakangas, Amul Sul
      Discussion: https://postgr.es/m/CAH2-Wzm9ty+1BX7-GMNJ=xPRg67oJTVeDNdA9LSyJJtMgRiCMA@mail.gmail.com
      1667148a