• Michael Paquier's avatar
    Delay fsyncs of pg_basebackup until the end of backup · 522baf14
    Michael Paquier authored
    Since the addition of fsync requests in bc34223b to make base backup data
    consistent on disk once pg_basebackup finishes, each tablespace tar file
    is individually flushed once completed, with an additional flush of the
    parent directory when the base backup finishes.  While holding a
    connection to the server, a fsync request taking a long time may cause a
    failure of the base backup, which is annoying for any integration.  A
    recent example of breakage can involve tcp_user_timeout, but
    wal_sender_timeout can cause similar problems.
    
    While reviewing the code, there was a second issue causing too many
    fsync requests to be done for the same WAL data.  As recursive fsyncs
    are done at the end of the backup for both the plain and tar formats
    from the base target directory where everything is written, it is fine
    to disable fsyncs when fetching or streaming WAL.
    
    Reported-by: Ryohei Takahashi
    Author: Michael Paquier
    Reviewed-by: Ryohei Takahashi
    Discussion: https://postgr.es/m/OSBPR01MB4550DAE2F8C9502894A45AAB82BE0@OSBPR01MB4550.jpnprd01.prod.outlook.com
    Backpatch-through: 10
    522baf14
pg_basebackup.c 62.3 KB