Commit 3114cb60 authored by Tom Lane's avatar Tom Lane

Don't advance checkPoint.nextXid near the end of a checkpoint sequence.

This reverts commit c1113069 in favor of
actually fixing the problem: namely, that we should never have been
modifying the checkpoint record's nextXid at this point to begin with.
The nextXid should match the state as of the checkpoint's logical WAL
position (ie the redo point), not the state as of its physical position.
It's especially bogus to advance it in some wal_levels and not others.
In any case there is no need for the checkpoint record to carry the
same nextXid shown in the XLOG_RUNNING_XACTS record just emitted by
LogStandbySnapshot, as any replay operation will already have adopted
that value as current.

This fixes bug #7710 from Tarvi Pillessaar, and probably also explains bug
#6291 from Daniel Farina, in that if a checkpoint were in progress at the
instant of XID wraparound, the epoch bump would be lost as reported.
(And, of course, these days there's at least a 50-50 chance of a checkpoint
being in progress at any given instant.)

Diagnosed by me and independently by Andres Freund.  Back-patch to all
branches supporting hot standby.
parent 5c117258
...@@ -7119,18 +7119,9 @@ CreateCheckPoint(int flags) ...@@ -7119,18 +7119,9 @@ CreateCheckPoint(int flags)
* *
* If we are shutting down, or Startup process is completing crash * If we are shutting down, or Startup process is completing crash
* recovery we don't need to write running xact data. * recovery we don't need to write running xact data.
*
* Update checkPoint.nextXid since we may have a later value. If we
* do update the value, and we have wrapped, increment epoch also.
*/ */
if (!shutdown && XLogStandbyInfoActive()) if (!shutdown && XLogStandbyInfoActive())
{ LogStandbySnapshot();
TransactionId prevXid = checkPoint.nextXid;
LogStandbySnapshot(&checkPoint.nextXid);
if (checkPoint.nextXid < prevXid)
checkPoint.nextXidEpoch++;
}
START_CRIT_SECTION(); START_CRIT_SECTION();
......
...@@ -848,7 +848,7 @@ standby_redo(XLogRecPtr lsn, XLogRecord *record) ...@@ -848,7 +848,7 @@ standby_redo(XLogRecPtr lsn, XLogRecord *record)
* from a time when they were possible. * from a time when they were possible.
*/ */
void void
LogStandbySnapshot(TransactionId *nextXid) LogStandbySnapshot(void)
{ {
RunningTransactions running; RunningTransactions running;
xl_standby_lock *locks; xl_standby_lock *locks;
...@@ -877,8 +877,6 @@ LogStandbySnapshot(TransactionId *nextXid) ...@@ -877,8 +877,6 @@ LogStandbySnapshot(TransactionId *nextXid)
LogCurrentRunningXacts(running); LogCurrentRunningXacts(running);
/* GetRunningTransactionData() acquired XidGenLock, we must release it */ /* GetRunningTransactionData() acquired XidGenLock, we must release it */
LWLockRelease(XidGenLock); LWLockRelease(XidGenLock);
*nextXid = running->nextXid;
} }
/* /*
......
...@@ -113,6 +113,6 @@ typedef RunningTransactionsData *RunningTransactions; ...@@ -113,6 +113,6 @@ typedef RunningTransactionsData *RunningTransactions;
extern void LogAccessExclusiveLock(Oid dbOid, Oid relOid); extern void LogAccessExclusiveLock(Oid dbOid, Oid relOid);
extern void LogAccessExclusiveLockPrepare(void); extern void LogAccessExclusiveLockPrepare(void);
extern void LogStandbySnapshot(TransactionId *nextXid); extern void LogStandbySnapshot(void);
#endif /* STANDBY_H */ #endif /* STANDBY_H */
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment