Commit 4d894b41 authored by Heikki Linnakangas's avatar Heikki Linnakangas

Change the order that pg_xlog and WAL archive are polled for WAL segments.

If there is a WAL segment with same ID but different TLI present in both
the WAL archive and pg_xlog, prefer the one with higher TLI. Before this
patch, the archive was polled first, for all expected TLIs, and only if no
file was found was pg_xlog scanned. This was a change in behavior from 9.3,
which first scanned archive and pg_xlog for the highest TLI, then archive
and pg_xlog for the next highest TLI and so forth. This patch reverts the
behavior back to what it was in 9.2.

The reason for this is that if for example you try to do archive recovery
to timeline 2, which branched off timeline 1, but the WAL for timeline 2 is
not archived yet, we would replay past the timeline switch point on
timeline 1 using the archived files, before even looking timeline 2's files
in pg_xlog

Report and patch by Kyotaro Horiguchi. Backpatch to 9.3 where the behavior
was changed.
parent 0f2ca007
...@@ -11006,17 +11006,15 @@ WaitForWALToBecomeAvailable(XLogRecPtr RecPtr, bool randAccess, ...@@ -11006,17 +11006,15 @@ WaitForWALToBecomeAvailable(XLogRecPtr RecPtr, bool randAccess,
/*------- /*-------
* Standby mode is implemented by a state machine: * Standby mode is implemented by a state machine:
* *
* 1. Read from archive (XLOG_FROM_ARCHIVE) * 1. Read from either archive or pg_xlog (XLOG_FROM_ARCHIVE), or just
* 2. Read from pg_xlog (XLOG_FROM_PG_XLOG) * pg_xlog (XLOG_FROM_XLOG)
* 3. Check trigger file * 2. Check trigger file
* 4. Read from primary server via walreceiver (XLOG_FROM_STREAM) * 3. Read from primary server via walreceiver (XLOG_FROM_STREAM)
* 5. Rescan timelines * 4. Rescan timelines
* 6. Sleep 5 seconds, and loop back to 1. * 5. Sleep 5 seconds, and loop back to 1.
* *
* Failure to read from the current source advances the state machine to * Failure to read from the current source advances the state machine to
* the next state. In addition, successfully reading a file from pg_xlog * the next state.
* moves the state machine from state 2 back to state 1 (we always prefer
* files in the archive over files in pg_xlog).
* *
* 'currentSource' indicates the current state. There are no currentSource * 'currentSource' indicates the current state. There are no currentSource
* values for "check trigger", "rescan timelines", and "sleep" states, * values for "check trigger", "rescan timelines", and "sleep" states,
...@@ -11044,9 +11042,6 @@ WaitForWALToBecomeAvailable(XLogRecPtr RecPtr, bool randAccess, ...@@ -11044,9 +11042,6 @@ WaitForWALToBecomeAvailable(XLogRecPtr RecPtr, bool randAccess,
switch (currentSource) switch (currentSource)
{ {
case XLOG_FROM_ARCHIVE: case XLOG_FROM_ARCHIVE:
currentSource = XLOG_FROM_PG_XLOG;
break;
case XLOG_FROM_PG_XLOG: case XLOG_FROM_PG_XLOG:
/* /*
...@@ -11212,7 +11207,9 @@ WaitForWALToBecomeAvailable(XLogRecPtr RecPtr, bool randAccess, ...@@ -11212,7 +11207,9 @@ WaitForWALToBecomeAvailable(XLogRecPtr RecPtr, bool randAccess,
* Try to restore the file from archive, or read an existing * Try to restore the file from archive, or read an existing
* file from pg_xlog. * file from pg_xlog.
*/ */
readFile = XLogFileReadAnyTLI(readSegNo, DEBUG2, currentSource); readFile = XLogFileReadAnyTLI(readSegNo, DEBUG2,
currentSource == XLOG_FROM_ARCHIVE ? XLOG_FROM_ANY :
currentSource);
if (readFile >= 0) if (readFile >= 0)
return true; /* success! */ return true; /* success! */
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment