Commit e6799d5a authored by Andres Freund's avatar Andres Freund

Move page initialization from RelationAddExtraBlocks() to use.

Previously we initialized pages when bulk extending in
RelationAddExtraBlocks(). That has a major disadvantage: It ties
RelationAddExtraBlocks() to heap, as other types of storage are likely
to need different amounts of special space, have different amount of
free space (previously determined by PageGetHeapFreeSpace()).

That we're relying on initializing pages, but not WAL logging the
initialization, also means the risk for getting
"WARNING:  relation \"%s\" page %u is uninitialized --- fixing"
style warnings in vacuums after crashes/immediate shutdowns, is
considerably higher. The warning sounds much more serious than what
they are.

Fix those two issues together by not initializing pages in
RelationAddExtraPages() (but continue to do so in
RelationGetBufferForTuple(), which is linked much more closely to
heap), and accepting uninitialized pages as normal in
vacuumlazy.c. When vacuumlazy encounters an empty page it now adds it
to the FSM, but does nothing else.  We chose to not issue a debug
message, much less a warning in that case - it seems rarely useful,
and quite likely to scare people unnecessarily.

For now empty pages aren't added to the VM, because standbys would not
re-discover such pages after a promotion. In contrast to other sources
for empty pages, there's no corresponding WAL records triggering FSM
updates during replay.

Author: Andres Freund
Reviewed-By: Tom Lane
Discussion: https://postgr.es/m/20181219083945.6khtgm36mivonhva@alap3.anarazel.de
parent d4316b87
...@@ -204,7 +204,8 @@ RelationAddExtraBlocks(Relation relation, BulkInsertState bistate) ...@@ -204,7 +204,8 @@ RelationAddExtraBlocks(Relation relation, BulkInsertState bistate)
/* /*
* Extend by one page. This should generally match the main-line * Extend by one page. This should generally match the main-line
* extension code in RelationGetBufferForTuple, except that we hold * extension code in RelationGetBufferForTuple, except that we hold
* the relation extension lock throughout. * the relation extension lock throughout, and we don't immediately
* initialize the page (see below).
*/ */
buffer = ReadBufferBI(relation, P_NEW, bistate); buffer = ReadBufferBI(relation, P_NEW, bistate);
...@@ -216,18 +217,16 @@ RelationAddExtraBlocks(Relation relation, BulkInsertState bistate) ...@@ -216,18 +217,16 @@ RelationAddExtraBlocks(Relation relation, BulkInsertState bistate)
BufferGetBlockNumber(buffer), BufferGetBlockNumber(buffer),
RelationGetRelationName(relation)); RelationGetRelationName(relation));
PageInit(page, BufferGetPageSize(buffer), 0);
/* /*
* We mark all the new buffers dirty, but do nothing to write them * Add the page to the FSM without initializing. If we were to
* out; they'll probably get used soon, and even if they are not, a * initialize here the page would potentially get flushed out to disk
* crash will leave an okay all-zeroes page on disk. * before we add any useful content. There's no guarantee that that'd
* happen before a potential crash, so we need to deal with
* uninitialized pages anyway, thus avoid the potential for
* unnecessary writes.
*/ */
MarkBufferDirty(buffer);
/* we'll need this info below */
blockNum = BufferGetBlockNumber(buffer); blockNum = BufferGetBlockNumber(buffer);
freespace = PageGetHeapFreeSpace(page); freespace = BufferGetPageSize(buffer) - SizeOfPageHeaderData;
UnlockReleaseBuffer(buffer); UnlockReleaseBuffer(buffer);
...@@ -479,6 +478,18 @@ loop: ...@@ -479,6 +478,18 @@ loop:
* we're done. * we're done.
*/ */
page = BufferGetPage(buffer); page = BufferGetPage(buffer);
/*
* Initialize page, it'll be used soon. We could avoid dirtying the
* buffer here, and rely on the caller to do so whenever it puts a
* tuple onto the page, but there seems not much benefit in doing so.
*/
if (PageIsNew(page))
{
PageInit(page, BufferGetPageSize(buffer), 0);
MarkBufferDirty(buffer);
}
pageFreeSpace = PageGetHeapFreeSpace(page); pageFreeSpace = PageGetHeapFreeSpace(page);
if (len + saveFreeSpace <= pageFreeSpace) if (len + saveFreeSpace <= pageFreeSpace)
{ {
......
...@@ -861,42 +861,38 @@ lazy_scan_heap(Relation onerel, int options, LVRelStats *vacrelstats, ...@@ -861,42 +861,38 @@ lazy_scan_heap(Relation onerel, int options, LVRelStats *vacrelstats,
if (PageIsNew(page)) if (PageIsNew(page))
{ {
/* /*
* An all-zeroes page could be left over if a backend extends the * All-zeroes pages can be left over if either a backend extends
* relation but crashes before initializing the page. Reclaim such * the relation by a single page, but crashes before the newly
* pages for use. * initialized page has been written out, or when bulk-extending
* the relation (which creates a number of empty pages at the tail
* end of the relation, but enters them into the FSM).
* *
* We have to be careful here because we could be looking at a * Make sure these pages are in the FSM, to ensure they can be
* page that someone has just added to the relation and not yet * reused. Do that by testing if there's any space recorded for
* been able to initialize (see RelationGetBufferForTuple). To * the page. If not, enter it.
* protect against that, release the buffer lock, grab the
* relation extension lock momentarily, and re-lock the buffer. If
* the page is still uninitialized by then, it must be left over
* from a crashed backend, and we can initialize it.
* *
* We don't really need the relation lock when this is a new or * Note we do not enter the page into the visibilitymap. That has
* temp relation, but it's probably not worth the code space to * the downside that we repeatedly visit this page in subsequent
* check that, since this surely isn't a critical path. * vacuums, but otherwise we'll never not discover the space on a
* * promoted standby. The harm of repeated checking ought to
* Note: the comparable code in vacuum.c need not worry because * normally not be too bad - the space usually should be used at
* it's got exclusive lock on the whole relation. * some point, otherwise there wouldn't be any regular vacuums.
*/ */
LockBuffer(buf, BUFFER_LOCK_UNLOCK);
LockRelationForExtension(onerel, ExclusiveLock);
UnlockRelationForExtension(onerel, ExclusiveLock);
LockBufferForCleanup(buf);
if (PageIsNew(page))
{
ereport(WARNING,
(errmsg("relation \"%s\" page %u is uninitialized --- fixing",
relname, blkno)));
PageInit(page, BufferGetPageSize(buf), 0);
empty_pages++; empty_pages++;
}
freespace = PageGetHeapFreeSpace(page); /*
MarkBufferDirty(buf); * Perform checking of FSM after releasing lock, the fsm is
* approximate, after all.
*/
UnlockReleaseBuffer(buf); UnlockReleaseBuffer(buf);
if (GetRecordedFreeSpace(onerel, blkno) == 0)
{
Size freespace;
freespace = BufferGetPageSize(buf) - SizeOfPageHeaderData;
RecordPageWithFreeSpace(onerel, blkno, freespace); RecordPageWithFreeSpace(onerel, blkno, freespace);
}
continue; continue;
} }
...@@ -905,7 +901,10 @@ lazy_scan_heap(Relation onerel, int options, LVRelStats *vacrelstats, ...@@ -905,7 +901,10 @@ lazy_scan_heap(Relation onerel, int options, LVRelStats *vacrelstats,
empty_pages++; empty_pages++;
freespace = PageGetHeapFreeSpace(page); freespace = PageGetHeapFreeSpace(page);
/* empty pages are always all-visible and all-frozen */ /*
* Empty pages are always all-visible and all-frozen (note that
* the same is currently not true for new pages, see above).
*/
if (!PageIsAllVisible(page)) if (!PageIsAllVisible(page))
{ {
START_CRIT_SECTION(); START_CRIT_SECTION();
...@@ -1639,12 +1638,13 @@ lazy_check_needs_freeze(Buffer buf, bool *hastup) ...@@ -1639,12 +1638,13 @@ lazy_check_needs_freeze(Buffer buf, bool *hastup)
*hastup = false; *hastup = false;
/* If we hit an uninitialized page, we want to force vacuuming it. */ /*
if (PageIsNew(page)) * New and empty pages, obviously, don't contain tuples. We could make
return true; * sure that the page is registered in the FSM, but it doesn't seem worth
* waiting for a cleanup lock just for that, especially because it's
/* Quick out for ordinary empty page. */ * likely that the pin holder will do so.
if (PageIsEmpty(page)) */
if (PageIsNew(page) || PageIsEmpty(page))
return false; return false;
maxoff = PageGetMaxOffsetNumber(page); maxoff = PageGetMaxOffsetNumber(page);
...@@ -2029,7 +2029,6 @@ count_nondeletable_pages(Relation onerel, LVRelStats *vacrelstats) ...@@ -2029,7 +2029,6 @@ count_nondeletable_pages(Relation onerel, LVRelStats *vacrelstats)
if (PageIsNew(page) || PageIsEmpty(page)) if (PageIsNew(page) || PageIsEmpty(page))
{ {
/* PageIsNew probably shouldn't happen... */
UnlockReleaseBuffer(buf); UnlockReleaseBuffer(buf);
continue; continue;
} }
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment