Commit db76b1ef authored by Andres Freund's avatar Andres Freund

Allow SetHintBits() to succeed if the buffer's LSN is new enough.

Previously we only allowed SetHintBits() to succeed if the commit LSN of
the last transaction touching the page has already been flushed to
disk. We can't generally change the LSN of the page, because we don't
necessarily have the required locks on the page. But the required LSN
interlock does not mean the commit record has to be flushed immediately,
it just requires that the commit record will be flushed before the page is
written out. Therefore if the buffer LSN is newer than the commit LSN,
the hint bit can be safely set.

In a number of scenarios (e.g. pgbench) this noticeably increases the
number of hint bits are set. But more importantly it also keeps the
success rate up when flushing WAL less frequently. That was the original
reason for commit 4de82f7d, which has negative performance consequences
in a number of scenarios. This will allow a followup commit to reduce
the flush rate.

Discussion: 20160118163908.GW10941@awork2.anarazel.de
parent cfafd8be
...@@ -89,12 +89,13 @@ static bool XidInMVCCSnapshot(TransactionId xid, Snapshot snapshot); ...@@ -89,12 +89,13 @@ static bool XidInMVCCSnapshot(TransactionId xid, Snapshot snapshot);
* Set commit/abort hint bits on a tuple, if appropriate at this time. * Set commit/abort hint bits on a tuple, if appropriate at this time.
* *
* It is only safe to set a transaction-committed hint bit if we know the * It is only safe to set a transaction-committed hint bit if we know the
* transaction's commit record has been flushed to disk, or if the table is * transaction's commit record is guaranteed to be flushed to disk before the
* temporary or unlogged and will be obliterated by a crash anyway. We * buffer, or if the table is temporary or unlogged and will be obliterated by
* cannot change the LSN of the page here because we may hold only a share * a crash anyway. We cannot change the LSN of the page here, because we may
* lock on the buffer, so we can't use the LSN to interlock this; we have to * hold only a share lock on the buffer, so we can only use the LSN to
* just refrain from setting the hint bit until some future re-examination * interlock this if the buffer's LSN already is newer than the commit LSN;
* of the tuple. * otherwise we have to just refrain from setting the hint bit until some
* future re-examination of the tuple.
* *
* We can always set hint bits when marking a transaction aborted. (Some * We can always set hint bits when marking a transaction aborted. (Some
* code in heapam.c relies on that!) * code in heapam.c relies on that!)
...@@ -122,8 +123,12 @@ SetHintBits(HeapTupleHeader tuple, Buffer buffer, ...@@ -122,8 +123,12 @@ SetHintBits(HeapTupleHeader tuple, Buffer buffer,
/* NB: xid must be known committed here! */ /* NB: xid must be known committed here! */
XLogRecPtr commitLSN = TransactionIdGetCommitLSN(xid); XLogRecPtr commitLSN = TransactionIdGetCommitLSN(xid);
if (XLogNeedsFlush(commitLSN) && BufferIsPermanent(buffer)) if (BufferIsPermanent(buffer) && XLogNeedsFlush(commitLSN) &&
return; /* not flushed yet, so don't set hint */ BufferGetLSNAtomic(buffer) < commitLSN)
{
/* not flushed and no LSN interlock, so don't set hint */
return;
}
} }
tuple->t_infomask |= infomask; tuple->t_infomask |= infomask;
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment