Commits · de4389712206d2686e09ad8d6dd112dc4b6c6d42 · Abuhujair Javed / Postgres FD Implementation

26 Apr, 2017 2 commits

Fix various concurrency issues in logical replication worker launching · de438971

Peter Eisentraut authored Apr 26, 2017

The code was originally written with assumption that launcher is the
only process starting the worker.  However that hasn't been true since
commit 7c4f5240 which failed to modify the worker management code
adequately.

This patch adds an in_use field to the LogicalRepWorker struct to
indicate whether the worker slot is being used and uses proper locking
everywhere this flag is set or read.

However if the parent process dies while the new worker is starting and
the new worker fails to attach to shared memory, this flag would never
get cleared.  We solve this rare corner case by adding a sort of garbage
collector for in_use slots.  This uses another field in the
LogicalRepWorker struct named launch_time that contains the time when
the worker was started.  If any request to start a new worker does not
find free slot, we'll check for workers that were supposed to start but
took too long to actually do so, and reuse their slot.

In passing also fix possible race conditions when stopping a worker that
hasn't finished starting yet.

Author: Petr Jelinek <petr.jelinek@2ndquadrant.com>
Reported-by: Fujii Masao <masao.fujii@gmail.com>

de438971

doc PG10: add Rafia Sabih to parallel index scan item · 309191f6
Bruce Momjian authored Apr 26, 2017
```
Reported-by: Amit Kapila
```
309191f6

25 Apr, 2017 22 commits

Allow ALTER TABLE ONLY on partitioned tables · 9139aa19

Stephen Frost authored Apr 25, 2017

There is no need to forbid ALTER TABLE ONLY on partitioned tables,
when no partitions exist yet.  This can be handy for users who are
building up their partitioned table independently and will create actual
partitions later.

In addition, this is how pg_dump likes to operate in certain instances.

Author: Amit Langote, with some error message word-smithing by me

9139aa19

doc PG10: update EXPLAIN SUMMARY item · 5f2b48d1
Bruce Momjian authored Apr 25, 2017
```
Reported-by: Tels
```
5f2b48d1

Wake up launcher when enabling a subscription · a3f17b9c

Peter Eisentraut authored Apr 25, 2017

Otherwise one would have to wait up to DEFAULT_NAPTIME_PER_CYCLE until
the subscription worker is considered for starting.

There is a small race condition: If one enables a subscription right
after disabling it, the launcher might not have registered the stopping
when receiving the wakeup signal for the re-enabling. The start will
then not happen right away but after the full cycle time.

Author: Kyotaro HORIGUCHI <horiguchi.kyotaro@lab.ntt.co.jp>

a3f17b9c

doc: update PG 10 item about referencing many relations · ef0ba572
Bruce Momjian authored Apr 25, 2017
```
Reported-by: Tom Lane
```
ef0ba572
doc: add PG 10 doc item about VACUUM truncation, 7e26e02e · 3d774119
Bruce Momjian authored Apr 25, 2017
```
Reported-by: Andres Freund
```
3d774119
doc PG10: add commit 090010f2 and adjust EXPLAIN SUMMARY item · 3640cf5e
Bruce Momjian authored Apr 25, 2017
```
Reported-by: Tels, Andres Freund
```
3640cf5e
doc: properly indent SGML tags in PG 10 release notes · bf368fbe
Bruce Momjian authored Apr 25, 2017

bf368fbe

Set the priorities of all quorum synchronous standbys to 1. · 346199dc

Fujii Masao authored Apr 26, 2017

In quorum-based synchronous replication, all the standbys listed in
synchronous_standby_names equally have chances to be chosen
as synchronous standbys. So they should have the same priority.
However, previously, quorum standbys whose names appear earlier
in the list were given higher priority values though the difference of
those priority values didn't affect the selection of synchronous standbys.
Users could see those "meaningless" priority values in pg_stat_replication
and this was confusing.

This commit gives all the quorum synchronous standbys the same
highest priority, i.e., 1, in order to remove such confusion.

Author: Fujii Masao
Reviewed-by: Masahiko Sawada, Kyotaro Horiguchi
Discussion: http://postgr.es/m/CAHGQGwEKOw=SmPLxJzkBsH6wwDBgOnVz46QjHbtsiZ-d-2RGUg@mail.gmail.com

346199dc

doc: PG 10 release notes updates · cdd5bcad
Bruce Momjian authored Apr 25, 2017
```
Reported-by: Michael Paquier, Felix Gerzaguet
```
cdd5bcad
doc: PG 10 release note updates · 64f0f7cf
Bruce Momjian authored Apr 25, 2017
```
Reported-by: David Rowley, Amit Langote, Ashutosh Bapat
```
64f0f7cf

Adjust outdated comment. · 914ae8d3

Robert Haas authored Apr 25, 2017

Commit 5dfc1981 removed the only
existing caller of hash_freeze, but left behind a comment indicating
that hash_freeze was still used.  Adjust.

Kyotaro Horiguchi

Discussion: http://postgr.es/m/20170424.165541.230634914.horiguchi.kyotaro@lab.ntt.co.jp

914ae8d3

Update copyright in recently added files. · 7cc14ae9

Fujii Masao authored Apr 25, 2017

This commit also fixes copyright line missed by the automated script.

Author: Masahiko Sawada

7cc14ae9

doc: move hash info to new section and split out growth item · 45e3d8ae
Bruce Momjian authored Apr 25, 2017
```
Reported-by: Amit Kapila
```
45e3d8ae

doc: move hash performance item into index section · cef5dbbf

Bruce Momjian authored Apr 24, 2017

The requirement to rebuild pg_upgrade-ed hash indexes was kept in the
incompatibilities section.

Reported-by: Amit Kapila

cef5dbbf

doc: add Rafia Sabih to PG 10 release note item · b007b1af
Bruce Momjian authored Apr 24, 2017
```
Reported-by: Amit Kapila
```
b007b1af
doc: fix PG 10 release note doc markup · d103e671
Bruce Momjian authored Apr 24, 2017

d103e671
doc: merge PG 10 release SysV item · 419a0554
Bruce Momjian authored Apr 24, 2017
```
Reported-by: Takayuki Tsunakawa
```
419a0554

postgres_fdw: Fix join push down with extensions · 332bec1e

Peter Eisentraut authored Apr 24, 2017

Objects in an extension are shippable to a foreign server if the
extension is part of the foreign server definition's shippable
extensions list.  But this was not properly considered in some cases
when checking whether a join condition can be pushed to a foreign server
and the join condition uses an object from a shippable extension.  So
the join would never be pushed down in those cases.

So, the list of extensions needs to be made available in fpinfo of the
relation being considered to be pushed down before any expressions are
assessed for being shippable.  Fix foreign_join_ok() to do that for a
join relation.

The code to save FDW options in fpinfo is scattered at multiple places.
Bring all of that together into functions apply_server_options(),
apply_table_options(), and merge_fdw_options().

David Rowley and Ashutosh Bapat, per report from David Rowley

332bec1e

doc: PG 10 fixes · 6e033c6a
Bruce Momjian authored Apr 24, 2017
```
Reported-by: Takayuki Tsunakawa
```
6e033c6a
doc: several minor PG 10 doc adjustments · bba375eb
Bruce Momjian authored Apr 24, 2017

bba375eb
doc: fix attribution of sequence item, order incompatibilities · a0d932b3
Bruce Momjian authored Apr 24, 2017
```
Reported-by: Andreas Karlsson
```
a0d932b3
doc: first draft of Postgres 10 release notes · 1d8573ed
Bruce Momjian authored Apr 24, 2017

1d8573ed

24 Apr, 2017 7 commits

doc: update release doc markup instructions · 66fade8a
Bruce Momjian authored Apr 24, 2017

66fade8a

Revert "Use pselect(2) not select(2), if available, to wait in postmaster's loop." · 64925603

Tom Lane authored Apr 24, 2017

This reverts commit 81069a9e.

Buildfarm results suggest that some platforms have versions of pselect(2)
that are not merely non-atomic, but flat out non-functional. Revert the
use-pselect patch to confirm this diagnosis (and exclude the no-SA_RESTART
patch as the source of trouble). If it's so, we should probably look into
blacklisting specific platforms that have broken pselect.

Discussion: https://postgr.es/m/9696.1493072081@sss.pgh.pa.us

64925603

Use pselect(2) not select(2), if available, to wait in postmaster's loop. · 81069a9e

Tom Lane authored Apr 24, 2017

Traditionally we've unblocked signals, called select(2), and then blocked
signals again. The code expects that the select() will be cancelled with
EINTR if an interrupt occurs; but there's a race condition, which is that
an already-pending signal will be delivered as soon as we unblock, and then
when we reach select() there will be nothing preventing it from waiting.
This can result in a long delay before we perform any action that
ServerLoop was supposed to have taken in response to the signal. As with
the somewhat-similar symptoms fixed by commit 89390208, the main practical
problem is slow launching of parallel workers. The window for trouble is
usually pretty short, corresponding to one iteration of ServerLoop; but
it's not negligible.

To fix, use pselect(2) in place of select(2) where available, as that's
designed to solve exactly this problem. Where not available, we continue
to use the old way, and are no worse off than before.

pselect(2) has been required by POSIX since about 2001, so most modern
platforms should have it. A bigger portability issue is that some
implementations are said to be non-atomic, ie pselect() isn't really
any different from unblock/select/reblock. Still, we're no worse off
than before on such a platform.

There is talk of rewriting the postmaster to use a WaitEventSet and
not do signal response work in signal handlers, at which point this
could be reverted, since we'd be using a self-pipe to solve the race
condition. But that's not happening before v11 at the earliest.

Back-patch to 9.6. The problem exists much further back, but the
worst symptom arises only in connection with parallel query, so it
does not seem worth taking any portability risks in older branches.

Discussion: https://postgr.es/m/9205.1492833041@sss.pgh.pa.us

81069a9e

Run the postmaster's signal handlers without SA_RESTART. · 89390208

Tom Lane authored Apr 24, 2017

The postmaster keeps signals blocked everywhere except while waiting
for something to happen in ServerLoop(). The code expects that the
select(2) will be cancelled with EINTR if an interrupt occurs; without
that, followup actions that should be performed by ServerLoop() itself
will be delayed. However, some platforms interpret the SA_RESTART
signal flag as meaning that they should restart rather than cancel
the select(2). Worse yet, some of them restart it with the original
timeout delay, meaning that a steady stream of signal interrupts can
prevent ServerLoop() from iterating at all if there are no incoming
connection requests.

Observable symptoms of this, on an affected platform such as HPUX 10,
include extremely slow parallel query startup (possibly as much as
30 seconds) and failure to update timestamps on the postmaster's sockets
and lockfiles when no new connections arrive for a long time.

We can fix this by running the postmaster's signal handlers without
SA_RESTART. That would be quite a scary change if the range of code
where signals are accepted weren't so tiny, but as it is, it seems
safe enough. (Note that postmaster children do, and must, reset all
the handlers before unblocking signals; so this change should not
affect any child process.)

There is talk of rewriting the postmaster to use a WaitEventSet and
not do signal response work in signal handlers, at which point it might
be appropriate to revert this patch. But that's not happening before
v11 at the earliest.

Back-patch to 9.6. The problem exists much further back, but the
worst symptom arises only in connection with parallel query, so it
does not seem worth taking any portability risks in older branches.

Discussion: https://postgr.es/m/9205.1492833041@sss.pgh.pa.us

89390208

Get rid of extern declarations of non-existent functions. · cbc2270e
Fujii Masao authored Apr 25, 2017
```
Those extern declartions were mistakenly added by commit 7c4f5240.

Author: Petr Jelinek
```
cbc2270e

Fix postmaster's handling of fork failure for a bgworker process. · 4fe04244

Tom Lane authored Apr 24, 2017

This corner case didn't behave nicely at all: the postmaster would
(partially) update its state as though the process had started
successfully, and be quite confused thereafter.  Fix it to act
like the worker had crashed, instead.

In passing, refactor so that do_start_bgworker contains all the
state-change logic for bgworker launch, rather than just some of it.

Back-patch as far as 9.4.  9.3 contains similar logic, but it's just
enough different that I don't feel comfortable applying the patch
without more study; and the use of bgworkers in 9.3 was so small
that it doesn't seem worth the extra work.

transam/parallel.c is still entirely unprepared for the possibility
of bgworker startup failure, but that seems like material for a
separate patch.

Discussion: https://postgr.es/m/4905.1492813727@sss.pgh.pa.us

4fe04244

Code review for commands/statscmds.c. · 4b34624d

Tom Lane authored Apr 24, 2017

Fix machine-dependent sorting of column numbers.  (Odd behavior
would only materialize for column numbers above 255, but that's
certainly legal.)

Fix poor choice of SQLSTATE for some errors, and improve error message
wording.  (Notably, "is not a scalar type" is a totally misleading way
to explain "does not have a default btree opclass".)

Avoid taking AccessExclusiveLock on the associated relation during DROP
STATISTICS.  That's neither necessary nor desirable, and it could easily
have put us into situations where DROP fails (compare commit 68ea2b7f).

Adjust/improve comments.

David Rowley and Tom Lane

Discussion: https://postgr.es/m/CAKJS1f-GmCfPvBbAEaM5xoVOaYdVgVN1gicALSoYQ77z-+vLbw@mail.gmail.com

4b34624d

23 Apr, 2017 8 commits

Don't include sys/poll.h anymore. · b182a4ae

Andres Freund authored Apr 23, 2017

poll.h is mandated by Single Unix Spec v2, the usual baseline for
postgres on unix.  None of the unixoid buildfarms animals has
sys/poll.h but not poll.h.  Therefore there's not much point to test
for sys/poll.h's existence and include it optionally.

Author: Andres Freund, per suggestion from Tom Lane
Discussion: https://postgr.es/m/20505.1492723662@sss.pgh.pa.us

b182a4ae

Zero padding in replication origin's checkpointed on disk-state. · eb97aa7e

Andres Freund authored Apr 23, 2017

This seems to be largely cosmetic, avoiding valgrind bleats and the
like. The uninitialized padding influences the CRC of the on-disk
entry, but because it's also used when verifying the CRC, that doesn't
cause spurious failures.  Backpatch nonetheless.

It's a bit unfortunate that contrib/test_decoding/sql/replorigin.sql
doesn't exercise the checkpoint path, but checkpoints are fairly
expensive on weaker machines, and we'd have to stop/start for that to
be meaningful.

Author: Andres Freund
Discussion: https://postgr.es/m/20170422183123.w2jgiuxtts7qrqaq@alap3.anarazel.de
Backpatch: 9.5, where replication origins were introduced

eb97aa7e

Initialize all memory for logical replication relation cache. · e84d243b

Andres Freund authored Apr 23, 2017

As reported by buildfarm animal skink / valgrind, some of the
variables weren't always initialized.  To avoid further mishaps use
memset to ensure the entire entry is initialized.

Author: Petr Jelinek
Reported-By: Andres Freund
Discussion: https://postgr.es/m/20170422183123.w2jgiuxtts7qrqaq@alap3.anarazel.de
Backpatch: none, code new in master

e84d243b

Remove select(2) backed latch implementation. · 61c21dda

Andres Freund authored Apr 23, 2017

poll(2) is required by Single Unix Spec v2, the usual baseline for
postgres (leaving windows aside).  There's not been any buildfarm
animals without poll(2) for a long while, leaving the select(2)
implementation to be largely untested.

On windows, including mingw, poll() is not available, but we have a
special case implementation for windows anyway.

Author: Andres Freund
Discussion: https://postgr.es/m/20170420003611.7r2sdvehesdyiz2i@alap3.anarazel.de

61c21dda

Workaround for RecoverPreparedTransactions() · 546c13e1

Simon Riggs authored Apr 23, 2017

Force overwriteOK = true while we investigate deeper fix

Proposed by Tom Lane as temporary measure, accepted by me

546c13e1

Fix LagTrackerRead() for timeline increments · 84638808

Simon Riggs authored Apr 23, 2017

Bug was masked by error in running 004_timeline_switch.pl that was
fixed recently in 7d68f228.

Detective work by Alvaro Herrera and Tom Lane

Author: Thomas Munro

84638808

Fix order of arguments to SubTransSetParent(). · 0874d4f3

Tom Lane authored Apr 23, 2017

ProcessTwoPhaseBuffer (formerly StandbyRecoverPreparedTransactions)
mixed up the parent and child XIDs when calling SubTransSetParent to
record the transactions' relationship in pg_subtrans.

Remarkably, analysis by Simon Riggs suggests that this doesn't lead to
visible problems (at least, not in non-Assert builds). That might
explain why we'd not noticed it before. Nonetheless, it's surely wrong.

This code was born broken, so back-patch to all supported branches.

Discussion: https://postgr.es/m/20110.1492905318@sss.pgh.pa.us

0874d4f3

Fix TAP infrastructure to support Mingw better · 33f3bbc6

Andrew Dunstan authored Apr 23, 2017

archive_command and restore_command need to refer to Windows paths, not
Msys virtual file system paths, as postgres is completely unaware of the
latter, so prefix them with the Windows path to the virtual file system
root. Clean psql and pg_recvlogical output of carriage returns.

33f3bbc6

22 Apr, 2017 1 commit

Make PostgresNode.pm check server status more carefully. · 7d68f228

Tom Lane authored Apr 22, 2017

PostgresNode blithely ignored the exit status of pg_ctl, and in general
made no effort to be sure that the server was running when it should be.
This caused it to miss server crashes, which is a serious shortcoming
in a test scaffold.  Make it complain if pg_ctl fails, and modify the
start and stop logic to complain if the server doesn't start, or doesn't
stop, when expected.

Also, have it turn off the "restart_after_crash" configuration parameter
in created clusters, as bitter experience has shown that leaving that on
can mask crashes too.

We might at some point need variant functions that allow for, eg,
server start failure to be expected.  But no existing test case appears
to want that, and it surely shouldn't be the default behavior.

Note that this *will* break the buildfarm, as it will expose known
bugs that the previous testing failed to.  I'm committing it despite
that, to verify that we get the expected failures in the buildfarm
not just in manual testing.

Back-patch into 9.6 where PostgresNode was introduced.  (The 9.6
branch is not expected to show any failures.)

Discussion: https://postgr.es/m/21432.1492886428@sss.pgh.pa.us

7d68f228