1. 13 Jul, 2017 2 commits
    • Alvaro Herrera's avatar
      Fix typo in v10 release notes · 2036f71b
      Alvaro Herrera authored
      The new functions return a list of files in the corresponding directory,
      not the name of the directory itself.
      
      Pointed out by Gianni Ciolli.
      2036f71b
    • Heikki Linnakangas's avatar
      Fix race between GetNewTransactionId and GetOldestActiveTransactionId. · 74fc8386
      Heikki Linnakangas authored
      The race condition goes like this:
      
      1. GetNewTransactionId advances nextXid e.g. from 100 to 101
      2. GetOldestActiveTransactionId reads the new nextXid, 101
      3. GetOldestActiveTransactionId loops through the proc array. There are no
         active XIDs there, so it returns 101 as the oldest active XID.
      4. GetNewTransactionid stores XID 100 to MyPgXact->xid
      
      So, GetOldestActiveTransactionId returned XID 101, even though 100 only
      just started and is surely still running.
      
      This would be hard to hit in practice, and even harder to spot any ill
      effect if it happens. GetOldestActiveTransactionId is only used when
      creating a checkpoint in a master server, and the race condition can only
      happen on an online checkpoint, as there are no backends running during a
      shutdown checkpoint. The oldestActiveXid value of an online checkpoint is
      only used when starting up a hot standby server, to determine the starting
      point where pg_subtrans is initialized from. For the race condition to
      happen, there must be no other XIDs in the proc array that would hold back
      the oldest-active XID value, which means that the missed XID must be a top
      transaction's XID. However, pg_subtrans is not used for top XIDs, so I
      believe an off-by-one error is in fact inconsequential. Nevertheless, let's
      fix it, as it's clearly wrong and the fix is simple.
      
      This has been wrong ever since hot standby was introduced, so backport to
      all supported versions.
      
      Discussion: https://www.postgresql.org/message-id/e7258662-82b6-7a45-56d4-99b337a32bf7@iki.fi
      74fc8386
  2. 12 Jul, 2017 7 commits
  3. 11 Jul, 2017 1 commit
    • Tom Lane's avatar
      Fix multiple assignments to a column of a domain type. · b1cb32fb
      Tom Lane authored
      We allow INSERT and UPDATE commands to assign to the same column more than
      once, as long as the assignments are to subfields or elements rather than
      the whole column.  However, this failed when the target column was a domain
      over array rather than plain array.  Fix by teaching process_matched_tle()
      to look through CoerceToDomain nodes, and add relevant test cases.
      
      Also add a group of test cases exercising domains over array of composite.
      It's doubtless accidental that CREATE DOMAIN allows this case while not
      allowing straight domain over composite; but it does, so we'd better make
      sure we don't break it.  (I could not find any documentation mentioning
      either side of that, so no doc changes.)
      
      It's been like this for a long time, so back-patch to all supported
      branches.
      
      Discussion: https://postgr.es/m/4206.1499798337@sss.pgh.pa.us
      b1cb32fb
  4. 10 Jul, 2017 10 commits
  5. 09 Jul, 2017 3 commits
  6. 08 Jul, 2017 1 commit
  7. 07 Jul, 2017 2 commits
  8. 06 Jul, 2017 4 commits
    • Teodor Sigaev's avatar
      Fix potential data corruption during freeze · 31b8db8e
      Teodor Sigaev authored
      Fix oversight in 3b97e682 bug fix. Bitwise AND is used instead of OR and
      it cleans all bits in t_infomask heap tuple field.
      
      Backpatch to 9.3
      31b8db8e
    • Dean Rasheed's avatar
      Clarify the contract of partition_rbound_cmp(). · f1dae097
      Dean Rasheed authored
      partition_rbound_cmp() is intended to compare range partition bounds
      in a way such that if all the bound values are equal but one is an
      upper bound and one is a lower bound, the upper bound is treated as
      smaller than the lower bound. This particular ordering is required by
      RelationBuildPartitionDesc() when building the PartitionBoundInfoData,
      so that it can consistently keep only the upper bounds when upper and
      lower bounds coincide.
      
      Update the function comment to make that clearer.
      
      Also, fix a (currently unreachable) corner-case bug -- if the bound
      values coincide and they contain unbounded values, fall through to the
      lower-vs-upper comparison code, rather than immediately returning
      0. Currently it is not possible to define coincident upper and lower
      bounds containing unbounded columns, but that may change in the
      future, so code defensively.
      
      Discussion: https://postgr.es/m/CAAJ_b947mowpLdxL3jo3YLKngRjrq9+Ej4ymduQTfYR+8=YAYQ@mail.gmail.com
      f1dae097
    • Dean Rasheed's avatar
      Simplify the logic checking new range partition bounds. · c03911d9
      Dean Rasheed authored
      The previous logic, whilst not actually wrong, was overly complex and
      involved doing two binary searches, where only one was really
      necessary. This simplifies that logic and improves the comments.
      
      One visible change is that if the new partition overlaps multiple
      existing partitions, the error message now always reports the overlap
      with the first existing partition (the one with the lowest
      bounds). The old code would sometimes report the clash with the first
      partition and sometimes with the last one.
      
      Original patch idea from Amit Langote, substantially rewritten by me.
      
      Discussion: https://postgr.es/m/CAAJ_b947mowpLdxL3jo3YLKngRjrq9+Ej4ymduQTfYR+8=YAYQ@mail.gmail.com
      c03911d9
    • Tom Lane's avatar
      Fix another race-condition-ish issue in recovery/t/001_stream_rep.pl. · ec86af91
      Tom Lane authored
      Buildfarm members hornet and sungazer have shown multiple instances of
      "Failed test 'xmin of non-cascaded slot with hs feedback has changed'".
      The reason seems to be that the test is checking the current xmin of the
      master server's replication slot against a past xmin of the first slave
      server's replication slot.  Even though the latter slot is downstream of
      the former, it's possible for its reported xmin to be ahead of the former's
      reported xmin, because those numbers are updated whenever the respective
      downstream walreceiver feels like it (see logic in WalReceiverMain).
      Instrumenting this test shows that indeed the slave slot's xmin does often
      advance before the master's does, especially if an autovacuum transaction
      manages to occur during the relevant window.  If we happen to capture such
      an advanced xmin as $xmin, then the subsequent wait_slot_xmins call can
      fall through before the master's xmin has advanced at all, and then if it
      advances before the get_slot_xmins call, we can get the observed failure.
      Yeah, that's a bit of a long chain of deduction, but it's hard to explain
      any other way how the test can get past an "xmin <> '$xmin'" check only
      to have the next query find that xmin does equal $xmin.
      
      Fix by keeping separate images of the master and slave slots' xmins
      and testing their has-xmin-advanced conditions independently.
      ec86af91
  9. 05 Jul, 2017 5 commits
  10. 04 Jul, 2017 2 commits
  11. 03 Jul, 2017 3 commits