Commit 522baf14 authored by Michael Paquier's avatar Michael Paquier

Delay fsyncs of pg_basebackup until the end of backup

Since the addition of fsync requests in bc34223b to make base backup data
consistent on disk once pg_basebackup finishes, each tablespace tar file
is individually flushed once completed, with an additional flush of the
parent directory when the base backup finishes.  While holding a
connection to the server, a fsync request taking a long time may cause a
failure of the base backup, which is annoying for any integration.  A
recent example of breakage can involve tcp_user_timeout, but
wal_sender_timeout can cause similar problems.

While reviewing the code, there was a second issue causing too many
fsync requests to be done for the same WAL data.  As recursive fsyncs
are done at the end of the backup for both the plain and tar formats
from the base target directory where everything is written, it is fine
to disable fsyncs when fetching or streaming WAL.

Reported-by: Ryohei Takahashi
Author: Michael Paquier
Reviewed-by: Ryohei Takahashi
Discussion: https://postgr.es/m/OSBPR01MB4550DAE2F8C9502894A45AAB82BE0@OSBPR01MB4550.jpnprd01.prod.outlook.com
Backpatch-through: 10
parent 25dcc9d3
...@@ -486,15 +486,18 @@ LogStreamerMain(logstreamer_param *param) ...@@ -486,15 +486,18 @@ LogStreamerMain(logstreamer_param *param)
#endif #endif
stream.standby_message_timeout = standby_message_timeout; stream.standby_message_timeout = standby_message_timeout;
stream.synchronous = false; stream.synchronous = false;
stream.do_sync = do_sync; /* fsync happens at the end of pg_basebackup for all data */
stream.do_sync = false;
stream.mark_done = true; stream.mark_done = true;
stream.partial_suffix = NULL; stream.partial_suffix = NULL;
stream.replication_slot = replication_slot; stream.replication_slot = replication_slot;
if (format == 'p') if (format == 'p')
stream.walmethod = CreateWalDirectoryMethod(param->xlog, 0, do_sync); stream.walmethod = CreateWalDirectoryMethod(param->xlog, 0,
stream.do_sync);
else else
stream.walmethod = CreateWalTarMethod(param->xlog, compresslevel, do_sync); stream.walmethod = CreateWalTarMethod(param->xlog, compresslevel,
stream.do_sync);
if (!ReceiveXlogStream(param->bgconn, &stream)) if (!ReceiveXlogStream(param->bgconn, &stream))
...@@ -1346,9 +1349,10 @@ ReceiveTarFile(PGconn *conn, PGresult *res, int rownum) ...@@ -1346,9 +1349,10 @@ ReceiveTarFile(PGconn *conn, PGresult *res, int rownum)
if (copybuf != NULL) if (copybuf != NULL)
PQfreemem(copybuf); PQfreemem(copybuf);
/* sync the resulting tar file, errors are not considered fatal */ /*
if (do_sync && strcmp(basedir, "-") != 0) * Do not sync the resulting tar file yet, all files are synced once at
(void) fsync_fname(filename, false); * the end.
*/
} }
...@@ -2138,9 +2142,9 @@ BaseBackup(void) ...@@ -2138,9 +2142,9 @@ BaseBackup(void)
/* /*
* Make data persistent on disk once backup is completed. For tar format * Make data persistent on disk once backup is completed. For tar format
* once syncing the parent directory is fine, each tar file created per * sync the parent directory and all its contents as each tar file was not
* tablespace has been already synced. In plain format, all the data of * synced after being completed. In plain format, all the data of the
* the base directory is synced, taking into account all the tablespaces. * base directory is synced, taking into account all the tablespaces.
* Errors are not considered fatal. * Errors are not considered fatal.
*/ */
if (do_sync) if (do_sync)
...@@ -2150,7 +2154,7 @@ BaseBackup(void) ...@@ -2150,7 +2154,7 @@ BaseBackup(void)
if (format == 't') if (format == 't')
{ {
if (strcmp(basedir, "-") != 0) if (strcmp(basedir, "-") != 0)
(void) fsync_fname(basedir, true); (void) fsync_dir_recurse(basedir);
} }
else else
{ {
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment