Commit 165565cd authored by Bruce Momjian's avatar Bruce Momjian

Some additional doc changes based around compression of page images in

WAL and the interaction of the new full_page_writes parameter with PITR.

The too-small WAL first sect1 has been merged with the one following
sect1 for clarity.

Some minor comments have been made in the WAL config section also.

Passes SGML make and proofread for typos.
Files changed:
patching file doc/src/sgml/backup.sgml
patching file doc/src/sgml/config.sgml
patching file doc/src/sgml/wal.sgml

Simon Riggs
parent 3af21fa5
<!-- <!--
$PostgreSQL: pgsql/doc/src/sgml/backup.sgml,v 2.69 2005/06/25 22:47:28 tgl Exp $ $PostgreSQL: pgsql/doc/src/sgml/backup.sgml,v 2.70 2005/10/13 17:32:42 momjian Exp $
--> -->
<chapter id="backup"> <chapter id="backup">
<title>Backup and Restore</title> <title>Backup and Restore</title>
...@@ -1147,13 +1147,22 @@ restore_command = 'copy /mnt/server/archivedir/%f "%p"' # Windows ...@@ -1147,13 +1147,22 @@ restore_command = 'copy /mnt/server/archivedir/%f "%p"' # Windows
</para> </para>
<para> <para>
It should also be noted that the present <acronym>WAL</acronym> It should also be noted that the default <acronym>WAL</acronym>
format is extremely bulky since it includes many disk page format is fairly bulky since it includes many disk page snapshots. The pages
snapshots. This is appropriate for crash recovery purposes, are partially compressed, using the simple expedient of removing the
empty space (if any) within each block. You can significantly reduce
the total volume of archived logs by turning off page snapshots
using the <xref linkend="guc-full-page-writes"> parameter,
though you should read the notes and warnings in
<xref linkend="reliability"> before you do so.
These page snapshots are designed to allow crash recovery,
since we may need to fix partially-written disk pages. It is not since we may need to fix partially-written disk pages. It is not
necessary to store so many page copies for PITR operations, however. necessary to store these page copies for PITR operations, however.
If you turn off <xref linkend="guc-full-page-writes">, your PITR
backup and recovery operations will continue to work successfully.
An area for future development is to compress archived WAL data by An area for future development is to compress archived WAL data by
removing unnecessary page copies. In the meantime, administrators removing unnecessary page copies when <xref linkend="guc-full-page-writes">
is turned on. In the meantime, administrators
may wish to reduce the number of page snapshots included in WAL by may wish to reduce the number of page snapshots included in WAL by
increasing the checkpoint interval parameters as much as feasible. increasing the checkpoint interval parameters as much as feasible.
</para> </para>
......
<!-- <!--
$PostgreSQL: pgsql/doc/src/sgml/config.sgml,v 1.25 2005/10/08 20:27:25 tgl Exp $ $PostgreSQL: pgsql/doc/src/sgml/config.sgml,v 1.26 2005/10/13 17:32:42 momjian Exp $
--> -->
<chapter Id="runtime-config"> <chapter Id="runtime-config">
<title>Run-time Configuration</title> <title>Run-time Configuration</title>
...@@ -1360,7 +1360,7 @@ SET ENABLE_SEQSCAN TO OFF; ...@@ -1360,7 +1360,7 @@ SET ENABLE_SEQSCAN TO OFF;
<para> <para>
When this option is on, the <productname>PostgreSQL</> server When this option is on, the <productname>PostgreSQL</> server
writes full pages to WAL when they are first modified after a writes full pages to WAL when they are first modified after a
checkpoint so full recovery is possible. Turning this option off checkpoint so crash recovery is possible. Turning this option off
might lead to a corrupt system after an operating system crash might lead to a corrupt system after an operating system crash
or power failure because uncorrected partial pages might contain or power failure because uncorrected partial pages might contain
inconsistent or corrupt data. The risks are less but similar to inconsistent or corrupt data. The risks are less but similar to
......
<!-- $PostgreSQL: pgsql/doc/src/sgml/wal.sgml,v 1.35 2005/10/01 01:42:43 momjian Exp $ --> <!-- $PostgreSQL: pgsql/doc/src/sgml/wal.sgml,v 1.36 2005/10/13 17:32:42 momjian Exp $ -->
<chapter id="reliability"> <chapter id="reliability">
<title>Reliability</title> <title>Reliability</title>
...@@ -12,7 +12,7 @@ ...@@ -12,7 +12,7 @@
failure (unrelated to the non-volatile area itself). To accomplish failure (unrelated to the non-volatile area itself). To accomplish
this, <productname>PostgreSQL</> uses the magnetic platters of modern this, <productname>PostgreSQL</> uses the magnetic platters of modern
disk drives for permanent storage that is immune to the failures disk drives for permanent storage that is immune to the failures
listed above. In fact, a computer can be completely destroyed, but if listed above. In fact, even if a computer is fatally damaged, if
the disk drives survive they can be moved to another computer with the disk drives survive they can be moved to another computer with
similar hardware and all committed transactions will remain intact. similar hardware and all committed transactions will remain intact.
</para> </para>
...@@ -68,11 +68,13 @@ ...@@ -68,11 +68,13 @@
these partially written cases. To guard against that, these partially written cases. To guard against that,
<productname>PostgreSQL</> periodically writes full page images to <productname>PostgreSQL</> periodically writes full page images to
permanent storage <emphasis>before</> modifying the actual page on permanent storage <emphasis>before</> modifying the actual page on
disk. By doing this, during recovery <productname>PostgreSQL</> can disk. By doing this, during crash recovery <productname>PostgreSQL</> can
restore partially-written pages. If you have a battery-backed disk restore partially-written pages. If you have a battery-backed disk
controller that prevents partial page writes, you can turn off this controller or filesystem (e.g. Reiser4) that prevents partial page writes,
page imaging by using the <xref linkend="guc-full-page-writes"> you can turn off this page imaging by using the
parameter. <xref linkend="guc-full-page-writes"> parameter. This parameter has no
effect on the successful use of Point in Time Recovery (PITR),
described in <xref linkend="backup-online">.
</para> </para>
<para> <para>
...@@ -107,14 +109,10 @@ ...@@ -107,14 +109,10 @@
the data pages can be redone from the log records. (This is the data pages can be redone from the log records. (This is
roll-forward recovery, also known as REDO.) roll-forward recovery, also known as REDO.)
</para> </para>
</sect1>
<sect1 id="wal-benefits">
<title>Benefits of Write-Ahead Logging</title>
<indexterm zone="wal-benefits"> <para>
<primary>fsync</primary> WAL brings three major benefits:
</indexterm> </para>
<para> <para>
The first major benefit of using <acronym>WAL</acronym> is a The first major benefit of using <acronym>WAL</acronym> is a
...@@ -131,11 +129,11 @@ ...@@ -131,11 +129,11 @@
</para> </para>
<para> <para>
The next benefit is consistency of the data pages. The truth is The next benefit is crash recovery protection. The truth is
that, before <acronym>WAL</acronym>, that, before <acronym>WAL</acronym> was introduced back in release 7.1,
<productname>PostgreSQL</productname> was never able to guarantee <productname>PostgreSQL</productname> was never able to guarantee
consistency in the case of a crash. Before consistency in the case of a crash. Now,
<acronym>WAL</acronym>, any crash during writing could result in: <acronym>WAL</acronym> protects fully against the following problems:
<orderedlist> <orderedlist>
<listitem> <listitem>
...@@ -151,13 +149,6 @@ ...@@ -151,13 +149,6 @@
of partially written data pages</simpara> of partially written data pages</simpara>
</listitem> </listitem>
</orderedlist> </orderedlist>
Problems with indexes (problems 1 and 2) could possibly have been
fixed by additional <function>fsync</function> calls, but it is
not obvious how to handle the last case without
<acronym>WAL</acronym>. <acronym>WAL</acronym> saves the entire data
page content in the log if that is required to ensure page
consistency for after-crash recovery.
</para> </para>
<para> <para>
...@@ -214,12 +205,14 @@ ...@@ -214,12 +205,14 @@
<varname>checkpoint_timeout</varname> causes checkpoints to be done <varname>checkpoint_timeout</varname> causes checkpoints to be done
more often. This allows faster after-crash recovery (since less work more often. This allows faster after-crash recovery (since less work
will need to be redone). However, one must balance this against the will need to be redone). However, one must balance this against the
increased cost of flushing dirty data pages more often. In addition, increased cost of flushing dirty data pages more often. If
to ensure data page consistency, the first modification of a data <xref linkend="guc-full-page-writes"> is set (the default), there is
page after each checkpoint results in logging the entire page another factor to consider. To ensure data page consistency,
content. Thus a smaller checkpoint interval increases the volume of the first modification of a data page after each checkpoint results in
output to the WAL log, partially negating the goal of using a smaller logging the entire page content. In that case,
interval, and in any case causing more disk I/O. a smaller checkpoint interval increases the volume of output to the WAL log,
partially negating the goal of using a smaller interval,
and in any case causing more disk I/O.
</para> </para>
<para> <para>
...@@ -234,7 +227,9 @@ ...@@ -234,7 +227,9 @@
a message will be output to the server log recommending increasing a message will be output to the server log recommending increasing
<varname>checkpoint_segments</varname>. Occasional appearance of such <varname>checkpoint_segments</varname>. Occasional appearance of such
a message is not cause for alarm, but if it appears often then the a message is not cause for alarm, but if it appears often then the
checkpoint control parameters should be increased. checkpoint control parameters should be increased. Bulk operations such
as a COPY, INSERT SELECT etc. may cause a number of such warnings if you
do not set <xref linkend="guc-checkpoint-segments"> high enough.
</para> </para>
<para> <para>
...@@ -252,7 +247,7 @@ ...@@ -252,7 +247,7 @@
</para> </para>
<para> <para>
There are two commonly used <acronym>WAL</acronym> functions: There are two commonly used internal <acronym>WAL</acronym> functions:
<function>LogInsert</function> and <function>LogFlush</function>. <function>LogInsert</function> and <function>LogFlush</function>.
<function>LogInsert</function> is used to place a new record into <function>LogInsert</function> is used to place a new record into
the <acronym>WAL</acronym> buffers in shared memory. If there is no the <acronym>WAL</acronym> buffers in shared memory. If there is no
...@@ -275,9 +270,11 @@ ...@@ -275,9 +270,11 @@
modifying the configuration parameter <xref modifying the configuration parameter <xref
linkend="guc-wal-buffers">. The default number of <acronym>WAL</acronym> linkend="guc-wal-buffers">. The default number of <acronym>WAL</acronym>
buffers is 8. Increasing this value will buffers is 8. Increasing this value will
correspondingly increase shared memory usage. (It should be noted correspondingly increase shared memory usage. When
that there is presently little evidence to suggest that increasing <xref linkend="guc-full-page-writes"> is set and the system is very busy,
<varname>wal_buffers</> beyond the default is worthwhile.) setting this value higher will help smooth response times during the
period immediately following each checkpoint. As a guide, a setting of 1024
would be considered to be high.
</para> </para>
<para> <para>
...@@ -313,7 +310,8 @@ ...@@ -313,7 +310,8 @@
(provided that <productname>PostgreSQL</productname> has been (provided that <productname>PostgreSQL</productname> has been
compiled with support for it) will result in each compiled with support for it) will result in each
<function>LogInsert</function> and <function>LogFlush</function> <function>LogInsert</function> and <function>LogFlush</function>
<acronym>WAL</acronym> call being logged to the server log. This <acronym>WAL</acronym> call being logged to the server log. The output
is too verbose for use as a guide to performance tuning. This
option may be replaced by a more general mechanism in the future. option may be replaced by a more general mechanism in the future.
</para> </para>
</sect1> </sect1>
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment