Commit 40d964ec authored by Amit Kapila's avatar Amit Kapila

Allow vacuum command to process indexes in parallel.

This feature allows the vacuum to leverage multiple CPUs in order to
process indexes.  This enables us to perform index vacuuming and index
cleanup with background workers.  This adds a PARALLEL option to VACUUM
command where the user can specify the number of workers that can be used
to perform the command which is limited by the number of indexes on a
table.  Specifying zero as a number of workers will disable parallelism.
This option can't be used with the FULL option.

Each index is processed by at most one vacuum process.  Therefore parallel
vacuum can be used when the table has at least two indexes.

The parallel degree is either specified by the user or determined based on
the number of indexes that the table has, and further limited by
max_parallel_maintenance_workers.  The index can participate in parallel
vacuum iff it's size is greater than min_parallel_index_scan_size.

Author: Masahiko Sawada and Amit Kapila
Reviewed-by: Dilip Kumar, Amit Kapila, Robert Haas, Tomas Vondra,
Mahendra Singh and Sergei Kornilov
Tested-by: Mahendra Singh and Prabhat Sahu
Discussion:
https://postgr.es/m/CAD21AoDTPMgzSkV4E3SFo1CH_x50bf5PqZFQf4jmqjk-C03BWg@mail.gmail.com
https://postgr.es/m/CAA4eK1J-VoR9gzS5E75pcD-OH0mEyCdp8RihcwKrcuw7J-Q0+w@mail.gmail.com
parent 44f1fc8d
...@@ -2308,13 +2308,13 @@ include_dir 'conf.d' ...@@ -2308,13 +2308,13 @@ include_dir 'conf.d'
<listitem> <listitem>
<para> <para>
Sets the maximum number of parallel workers that can be Sets the maximum number of parallel workers that can be
started by a single utility command. Currently, the only started by a single utility command. Currently, the parallel
parallel utility command that supports the use of parallel utility commands that support the use of parallel workers are
workers is <command>CREATE INDEX</command>, and only when <command>CREATE INDEX</command> only when building a B-tree index,
building a B-tree index. Parallel workers are taken from the and <command>VACUUM</command> without <literal>FULL</literal>
pool of processes established by <xref option. Parallel workers are taken from the pool of processes
linkend="guc-max-worker-processes"/>, limited by <xref established by <xref linkend="guc-max-worker-processes"/>, limited
linkend="guc-max-parallel-workers"/>. Note that the requested by <xref linkend="guc-max-parallel-workers"/>. Note that the requested
number of workers may not actually be available at run time. number of workers may not actually be available at run time.
If this occurs, the utility operation will run with fewer If this occurs, the utility operation will run with fewer
workers than expected. The default value is 2. Setting this workers than expected. The default value is 2. Setting this
...@@ -4915,7 +4915,9 @@ ANY <replaceable class="parameter">num_sync</replaceable> ( <replaceable class=" ...@@ -4915,7 +4915,9 @@ ANY <replaceable class="parameter">num_sync</replaceable> ( <replaceable class="
for a parallel scan to be considered. Note that a parallel index scan for a parallel scan to be considered. Note that a parallel index scan
typically won't touch the entire index; it is the number of pages typically won't touch the entire index; it is the number of pages
which the planner believes will actually be touched by the scan which which the planner believes will actually be touched by the scan which
is relevant. is relevant. This parameter is also used to decide whether a
particular index can participate in a parallel vacuum. See
<xref linkend="sql-vacuum"/>.
If this value is specified without units, it is taken as blocks, If this value is specified without units, it is taken as blocks,
that is <symbol>BLCKSZ</symbol> bytes, typically 8kB. that is <symbol>BLCKSZ</symbol> bytes, typically 8kB.
The default is 512 kilobytes (<literal>512kB</literal>). The default is 512 kilobytes (<literal>512kB</literal>).
......
...@@ -34,6 +34,7 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet ...@@ -34,6 +34,7 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet
SKIP_LOCKED [ <replaceable class="parameter">boolean</replaceable> ] SKIP_LOCKED [ <replaceable class="parameter">boolean</replaceable> ]
INDEX_CLEANUP [ <replaceable class="parameter">boolean</replaceable> ] INDEX_CLEANUP [ <replaceable class="parameter">boolean</replaceable> ]
TRUNCATE [ <replaceable class="parameter">boolean</replaceable> ] TRUNCATE [ <replaceable class="parameter">boolean</replaceable> ]
PARALLEL <replaceable class="parameter">integer</replaceable>
<phrase>and <replaceable class="parameter">table_and_columns</replaceable> is:</phrase> <phrase>and <replaceable class="parameter">table_and_columns</replaceable> is:</phrase>
...@@ -75,10 +76,14 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet ...@@ -75,10 +76,14 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet
with normal reading and writing of the table, as an exclusive lock with normal reading and writing of the table, as an exclusive lock
is not obtained. However, extra space is not returned to the operating is not obtained. However, extra space is not returned to the operating
system (in most cases); it's just kept available for re-use within the system (in most cases); it's just kept available for re-use within the
same table. <command>VACUUM FULL</command> rewrites the entire contents same table. It also allows us to leverage multiple CPUs in order to process
of the table into a new disk file with no extra space, allowing unused indexes. This feature is known as <firstterm>parallel vacuum</firstterm>.
space to be returned to the operating system. This form is much slower and To disable this feature, one can use <literal>PARALLEL</literal> option and
requires an exclusive lock on each table while it is being processed. specify parallel workers as zero. <command>VACUUM FULL</command> rewrites
the entire contents of the table into a new disk file with no extra space,
allowing unused space to be returned to the operating system. This form is
much slower and requires an exclusive lock on each table while it is being
processed.
</para> </para>
<para> <para>
...@@ -223,6 +228,33 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet ...@@ -223,6 +228,33 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet
</listitem> </listitem>
</varlistentry> </varlistentry>
<varlistentry>
<term><literal>PARALLEL</literal></term>
<listitem>
<para>
Perform vacuum index and cleanup index phases of <command>VACUUM</command>
in parallel using <replaceable class="parameter">integer</replaceable>
background workers (for the detail of each vacuum phases, please
refer to <xref linkend="vacuum-phases"/>). If the
<literal>PARALLEL</literal> option is omitted, then
<command>VACUUM</command> decides the number of workers based on number
of indexes that support parallel vacuum operation on the relation which
is further limited by <xref linkend="guc-max-parallel-workers-maintenance"/>.
The index can participate in a parallel vacuum if and only if the size
of the index is more than <xref linkend="guc-min-parallel-index-scan-size"/>.
Please note that it is not guaranteed that the number of parallel workers
specified in <replaceable class="parameter">integer</replaceable> will
be used during execution. It is possible for a vacuum to run with fewer
workers than specified, or even with no workers at all. Only one worker
can be used per index. So parallel workers are launched only when there
are at least <literal>2</literal> indexes in the table. Workers for
vacuum launches before starting each phase and exit at the end of
the phase. These behaviors might change in a future release. This
option can't be used with the <literal>FULL</literal> option.
</para>
</listitem>
</varlistentry>
<varlistentry> <varlistentry>
<term><replaceable class="parameter">boolean</replaceable></term> <term><replaceable class="parameter">boolean</replaceable></term>
<listitem> <listitem>
...@@ -237,6 +269,15 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet ...@@ -237,6 +269,15 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet
</listitem> </listitem>
</varlistentry> </varlistentry>
<varlistentry>
<term><replaceable class="parameter">integer</replaceable></term>
<listitem>
<para>
Specifies a non-negative integer value passed to the selected option.
</para>
</listitem>
</varlistentry>
<varlistentry> <varlistentry>
<term><replaceable class="parameter">table_name</replaceable></term> <term><replaceable class="parameter">table_name</replaceable></term>
<listitem> <listitem>
...@@ -316,11 +357,19 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet ...@@ -316,11 +357,19 @@ VACUUM [ FULL ] [ FREEZE ] [ VERBOSE ] [ ANALYZE ] [ <replaceable class="paramet
more than a plain <command>VACUUM</command> would. more than a plain <command>VACUUM</command> would.
</para> </para>
<para>
The <option>PARALLEL</option> option is used only for vacuum purpose.
Even if this option is specified with <option>ANALYZE</option> option
it does not affect <option>ANALYZE</option>.
</para>
<para> <para>
<command>VACUUM</command> causes a substantial increase in I/O traffic, <command>VACUUM</command> causes a substantial increase in I/O traffic,
which might cause poor performance for other active sessions. Therefore, which might cause poor performance for other active sessions. Therefore,
it is sometimes advisable to use the cost-based vacuum delay feature. it is sometimes advisable to use the cost-based vacuum delay feature. For
See <xref linkend="runtime-config-resource-vacuum-cost"/> for details. parallel vacuum, each worker sleeps proportional to the work done by that
worker. See <xref linkend="runtime-config-resource-vacuum-cost"/> for
details.
</para> </para>
<para> <para>
......
This diff is collapsed.
...@@ -14,6 +14,7 @@ ...@@ -14,6 +14,7 @@
#include "postgres.h" #include "postgres.h"
#include "access/heapam.h"
#include "access/nbtree.h" #include "access/nbtree.h"
#include "access/parallel.h" #include "access/parallel.h"
#include "access/session.h" #include "access/session.h"
...@@ -139,6 +140,9 @@ static const struct ...@@ -139,6 +140,9 @@ static const struct
}, },
{ {
"_bt_parallel_build_main", _bt_parallel_build_main "_bt_parallel_build_main", _bt_parallel_build_main
},
{
"parallel_vacuum_main", parallel_vacuum_main
} }
}; };
...@@ -174,6 +178,7 @@ CreateParallelContext(const char *library_name, const char *function_name, ...@@ -174,6 +178,7 @@ CreateParallelContext(const char *library_name, const char *function_name,
pcxt = palloc0(sizeof(ParallelContext)); pcxt = palloc0(sizeof(ParallelContext));
pcxt->subid = GetCurrentSubTransactionId(); pcxt->subid = GetCurrentSubTransactionId();
pcxt->nworkers = nworkers; pcxt->nworkers = nworkers;
pcxt->nworkers_to_launch = nworkers;
pcxt->library_name = pstrdup(library_name); pcxt->library_name = pstrdup(library_name);
pcxt->function_name = pstrdup(function_name); pcxt->function_name = pstrdup(function_name);
pcxt->error_context_stack = error_context_stack; pcxt->error_context_stack = error_context_stack;
...@@ -486,6 +491,23 @@ ReinitializeParallelDSM(ParallelContext *pcxt) ...@@ -486,6 +491,23 @@ ReinitializeParallelDSM(ParallelContext *pcxt)
} }
} }
/*
* Reinitialize parallel workers for a parallel context such that we could
* launch the different number of workers. This is required for cases where
* we need to reuse the same DSM segment, but the number of workers can
* vary from run-to-run.
*/
void
ReinitializeParallelWorkers(ParallelContext *pcxt, int nworkers_to_launch)
{
/*
* The number of workers that need to be launched must be less than the
* number of workers with which the parallel context is initialized.
*/
Assert(pcxt->nworkers >= nworkers_to_launch);
pcxt->nworkers_to_launch = nworkers_to_launch;
}
/* /*
* Launch parallel workers. * Launch parallel workers.
*/ */
...@@ -498,7 +520,7 @@ LaunchParallelWorkers(ParallelContext *pcxt) ...@@ -498,7 +520,7 @@ LaunchParallelWorkers(ParallelContext *pcxt)
bool any_registrations_failed = false; bool any_registrations_failed = false;
/* Skip this if we have no workers. */ /* Skip this if we have no workers. */
if (pcxt->nworkers == 0) if (pcxt->nworkers == 0 || pcxt->nworkers_to_launch == 0)
return; return;
/* We need to be a lock group leader. */ /* We need to be a lock group leader. */
...@@ -533,7 +555,7 @@ LaunchParallelWorkers(ParallelContext *pcxt) ...@@ -533,7 +555,7 @@ LaunchParallelWorkers(ParallelContext *pcxt)
* fails. It wouldn't help much anyway, because registering the worker in * fails. It wouldn't help much anyway, because registering the worker in
* no way guarantees that it will start up and initialize successfully. * no way guarantees that it will start up and initialize successfully.
*/ */
for (i = 0; i < pcxt->nworkers; ++i) for (i = 0; i < pcxt->nworkers_to_launch; ++i)
{ {
memcpy(worker.bgw_extra, &i, sizeof(int)); memcpy(worker.bgw_extra, &i, sizeof(int));
if (!any_registrations_failed && if (!any_registrations_failed &&
......
...@@ -42,6 +42,7 @@ ...@@ -42,6 +42,7 @@
#include "nodes/makefuncs.h" #include "nodes/makefuncs.h"
#include "pgstat.h" #include "pgstat.h"
#include "postmaster/autovacuum.h" #include "postmaster/autovacuum.h"
#include "postmaster/bgworker_internals.h"
#include "storage/bufmgr.h" #include "storage/bufmgr.h"
#include "storage/lmgr.h" #include "storage/lmgr.h"
#include "storage/proc.h" #include "storage/proc.h"
...@@ -68,6 +69,14 @@ static MemoryContext vac_context = NULL; ...@@ -68,6 +69,14 @@ static MemoryContext vac_context = NULL;
static BufferAccessStrategy vac_strategy; static BufferAccessStrategy vac_strategy;
/*
* Variables for cost-based parallel vacuum. See comments atop
* compute_parallel_delay to understand how it works.
*/
pg_atomic_uint32 *VacuumSharedCostBalance = NULL;
pg_atomic_uint32 *VacuumActiveNWorkers = NULL;
int VacuumCostBalanceLocal = 0;
/* non-export function prototypes */ /* non-export function prototypes */
static List *expand_vacuum_rel(VacuumRelation *vrel, int options); static List *expand_vacuum_rel(VacuumRelation *vrel, int options);
static List *get_all_vacuum_rels(int options); static List *get_all_vacuum_rels(int options);
...@@ -76,6 +85,7 @@ static void vac_truncate_clog(TransactionId frozenXID, ...@@ -76,6 +85,7 @@ static void vac_truncate_clog(TransactionId frozenXID,
TransactionId lastSaneFrozenXid, TransactionId lastSaneFrozenXid,
MultiXactId lastSaneMinMulti); MultiXactId lastSaneMinMulti);
static bool vacuum_rel(Oid relid, RangeVar *relation, VacuumParams *params); static bool vacuum_rel(Oid relid, RangeVar *relation, VacuumParams *params);
static double compute_parallel_delay(void);
static VacOptTernaryValue get_vacopt_ternary_value(DefElem *def); static VacOptTernaryValue get_vacopt_ternary_value(DefElem *def);
/* /*
...@@ -94,12 +104,16 @@ ExecVacuum(ParseState *pstate, VacuumStmt *vacstmt, bool isTopLevel) ...@@ -94,12 +104,16 @@ ExecVacuum(ParseState *pstate, VacuumStmt *vacstmt, bool isTopLevel)
bool freeze = false; bool freeze = false;
bool full = false; bool full = false;
bool disable_page_skipping = false; bool disable_page_skipping = false;
bool parallel_option = false;
ListCell *lc; ListCell *lc;
/* Set default value */ /* Set default value */
params.index_cleanup = VACOPT_TERNARY_DEFAULT; params.index_cleanup = VACOPT_TERNARY_DEFAULT;
params.truncate = VACOPT_TERNARY_DEFAULT; params.truncate = VACOPT_TERNARY_DEFAULT;
/* By default parallel vacuum is enabled */
params.nworkers = 0;
/* Parse options list */ /* Parse options list */
foreach(lc, vacstmt->options) foreach(lc, vacstmt->options)
{ {
...@@ -129,6 +143,39 @@ ExecVacuum(ParseState *pstate, VacuumStmt *vacstmt, bool isTopLevel) ...@@ -129,6 +143,39 @@ ExecVacuum(ParseState *pstate, VacuumStmt *vacstmt, bool isTopLevel)
params.index_cleanup = get_vacopt_ternary_value(opt); params.index_cleanup = get_vacopt_ternary_value(opt);
else if (strcmp(opt->defname, "truncate") == 0) else if (strcmp(opt->defname, "truncate") == 0)
params.truncate = get_vacopt_ternary_value(opt); params.truncate = get_vacopt_ternary_value(opt);
else if (strcmp(opt->defname, "parallel") == 0)
{
parallel_option = true;
if (opt->arg == NULL)
{
ereport(ERROR,
(errcode(ERRCODE_SYNTAX_ERROR),
errmsg("parallel option requires a value between 0 and %d",
MAX_PARALLEL_WORKER_LIMIT),
parser_errposition(pstate, opt->location)));
}
else
{
int nworkers;
nworkers = defGetInt32(opt);
if (nworkers < 0 || nworkers > MAX_PARALLEL_WORKER_LIMIT)
ereport(ERROR,
(errcode(ERRCODE_SYNTAX_ERROR),
errmsg("parallel vacuum degree must be between 0 and %d",
MAX_PARALLEL_WORKER_LIMIT),
parser_errposition(pstate, opt->location)));
/*
* Disable parallel vacuum, if user has specified parallel
* degree as zero.
*/
if (nworkers == 0)
params.nworkers = -1;
else
params.nworkers = nworkers;
}
}
else else
ereport(ERROR, ereport(ERROR,
(errcode(ERRCODE_SYNTAX_ERROR), (errcode(ERRCODE_SYNTAX_ERROR),
...@@ -152,6 +199,11 @@ ExecVacuum(ParseState *pstate, VacuumStmt *vacstmt, bool isTopLevel) ...@@ -152,6 +199,11 @@ ExecVacuum(ParseState *pstate, VacuumStmt *vacstmt, bool isTopLevel)
!(params.options & (VACOPT_FULL | VACOPT_FREEZE))); !(params.options & (VACOPT_FULL | VACOPT_FREEZE)));
Assert(!(params.options & VACOPT_SKIPTOAST)); Assert(!(params.options & VACOPT_SKIPTOAST));
if ((params.options & VACOPT_FULL) && parallel_option)
ereport(ERROR,
(errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
errmsg("cannot specify both FULL and PARALLEL options")));
/* /*
* Make sure VACOPT_ANALYZE is specified if any column lists are present. * Make sure VACOPT_ANALYZE is specified if any column lists are present.
*/ */
...@@ -383,6 +435,9 @@ vacuum(List *relations, VacuumParams *params, ...@@ -383,6 +435,9 @@ vacuum(List *relations, VacuumParams *params,
VacuumPageHit = 0; VacuumPageHit = 0;
VacuumPageMiss = 0; VacuumPageMiss = 0;
VacuumPageDirty = 0; VacuumPageDirty = 0;
VacuumCostBalanceLocal = 0;
VacuumSharedCostBalance = NULL;
VacuumActiveNWorkers = NULL;
/* /*
* Loop to process each selected relation. * Loop to process each selected relation.
...@@ -1941,16 +1996,26 @@ vac_close_indexes(int nindexes, Relation *Irel, LOCKMODE lockmode) ...@@ -1941,16 +1996,26 @@ vac_close_indexes(int nindexes, Relation *Irel, LOCKMODE lockmode)
void void
vacuum_delay_point(void) vacuum_delay_point(void)
{ {
double msec = 0;
/* Always check for interrupts */ /* Always check for interrupts */
CHECK_FOR_INTERRUPTS(); CHECK_FOR_INTERRUPTS();
/* Nap if appropriate */ if (!VacuumCostActive || InterruptPending)
if (VacuumCostActive && !InterruptPending && return;
VacuumCostBalance >= VacuumCostLimit)
{
double msec;
/*
* For parallel vacuum, the delay is computed based on the shared cost
* balance. See compute_parallel_delay.
*/
if (VacuumSharedCostBalance != NULL)
msec = compute_parallel_delay();
else if (VacuumCostBalance >= VacuumCostLimit)
msec = VacuumCostDelay * VacuumCostBalance / VacuumCostLimit; msec = VacuumCostDelay * VacuumCostBalance / VacuumCostLimit;
/* Nap if appropriate */
if (msec > 0)
{
if (msec > VacuumCostDelay * 4) if (msec > VacuumCostDelay * 4)
msec = VacuumCostDelay * 4; msec = VacuumCostDelay * 4;
...@@ -1966,6 +2031,66 @@ vacuum_delay_point(void) ...@@ -1966,6 +2031,66 @@ vacuum_delay_point(void)
} }
} }
/*
* Computes the vacuum delay for parallel workers.
*
* The basic idea of a cost-based vacuum delay for parallel vacuum is to allow
* each worker to sleep proportional to the work done by it. We achieve this
* by allowing all parallel vacuum workers including the leader process to
* have a shared view of cost related parameters (mainly VacuumCostBalance).
* We allow each worker to update it as and when it has incurred any cost and
* then based on that decide whether it needs to sleep. We compute the time
* to sleep for a worker based on the cost it has incurred
* (VacuumCostBalanceLocal) and then reduce the VacuumSharedCostBalance by
* that amount. This avoids letting the workers sleep who have done less or
* no I/O as compared to other workers and therefore can ensure that workers
* who are doing more I/O got throttled more.
*
* We allow any worker to sleep only if it has performed the I/O above a
* certain threshold, which is calculated based on the number of active
* workers (VacuumActiveNWorkers), and the overall cost balance is more than
* VacuumCostLimit set by the system. The testing reveals that we achieve
* the required throttling if we allow a worker that has done more than 50%
* of its share of work to sleep.
*/
static double
compute_parallel_delay(void)
{
double msec = 0;
uint32 shared_balance;
int nworkers;
/* Parallel vacuum must be active */
Assert(VacuumSharedCostBalance);
nworkers = pg_atomic_read_u32(VacuumActiveNWorkers);
/* At least count itself */
Assert(nworkers >= 1);
/* Update the shared cost balance value atomically */
shared_balance = pg_atomic_add_fetch_u32(VacuumSharedCostBalance, VacuumCostBalance);
/* Compute the total local balance for the current worker */
VacuumCostBalanceLocal += VacuumCostBalance;
if ((shared_balance >= VacuumCostLimit) &&
(VacuumCostBalanceLocal > 0.5 * (VacuumCostLimit / nworkers)))
{
/* Compute sleep time based on the local cost balance */
msec = VacuumCostDelay * VacuumCostBalanceLocal / VacuumCostLimit;
pg_atomic_sub_fetch_u32(VacuumSharedCostBalance, VacuumCostBalanceLocal);
VacuumCostBalanceLocal = 0;
}
/*
* Reset the local balance as we accumulated it into the shared value.
*/
VacuumCostBalance = 0;
return msec;
}
/* /*
* A wrapper function of defGetBoolean(). * A wrapper function of defGetBoolean().
* *
......
...@@ -2886,6 +2886,8 @@ table_recheck_autovac(Oid relid, HTAB *table_toast_map, ...@@ -2886,6 +2886,8 @@ table_recheck_autovac(Oid relid, HTAB *table_toast_map,
(!wraparound ? VACOPT_SKIP_LOCKED : 0); (!wraparound ? VACOPT_SKIP_LOCKED : 0);
tab->at_params.index_cleanup = VACOPT_TERNARY_DEFAULT; tab->at_params.index_cleanup = VACOPT_TERNARY_DEFAULT;
tab->at_params.truncate = VACOPT_TERNARY_DEFAULT; tab->at_params.truncate = VACOPT_TERNARY_DEFAULT;
/* As of now, we don't support parallel vacuum for autovacuum */
tab->at_params.nworkers = -1;
tab->at_params.freeze_min_age = freeze_min_age; tab->at_params.freeze_min_age = freeze_min_age;
tab->at_params.freeze_table_age = freeze_table_age; tab->at_params.freeze_table_age = freeze_table_age;
tab->at_params.multixact_freeze_min_age = multixact_freeze_min_age; tab->at_params.multixact_freeze_min_age = multixact_freeze_min_age;
......
...@@ -3597,7 +3597,7 @@ psql_completion(const char *text, int start, int end) ...@@ -3597,7 +3597,7 @@ psql_completion(const char *text, int start, int end)
if (ends_with(prev_wd, '(') || ends_with(prev_wd, ',')) if (ends_with(prev_wd, '(') || ends_with(prev_wd, ','))
COMPLETE_WITH("FULL", "FREEZE", "ANALYZE", "VERBOSE", COMPLETE_WITH("FULL", "FREEZE", "ANALYZE", "VERBOSE",
"DISABLE_PAGE_SKIPPING", "SKIP_LOCKED", "DISABLE_PAGE_SKIPPING", "SKIP_LOCKED",
"INDEX_CLEANUP", "TRUNCATE"); "INDEX_CLEANUP", "TRUNCATE", "PARALLEL");
else if (TailMatches("FULL|FREEZE|ANALYZE|VERBOSE|DISABLE_PAGE_SKIPPING|SKIP_LOCKED|INDEX_CLEANUP|TRUNCATE")) else if (TailMatches("FULL|FREEZE|ANALYZE|VERBOSE|DISABLE_PAGE_SKIPPING|SKIP_LOCKED|INDEX_CLEANUP|TRUNCATE"))
COMPLETE_WITH("ON", "OFF"); COMPLETE_WITH("ON", "OFF");
} }
......
...@@ -23,7 +23,9 @@ ...@@ -23,7 +23,9 @@
#include "nodes/lockoptions.h" #include "nodes/lockoptions.h"
#include "nodes/primnodes.h" #include "nodes/primnodes.h"
#include "storage/bufpage.h" #include "storage/bufpage.h"
#include "storage/dsm.h"
#include "storage/lockdefs.h" #include "storage/lockdefs.h"
#include "storage/shm_toc.h"
#include "utils/relcache.h" #include "utils/relcache.h"
#include "utils/snapshot.h" #include "utils/snapshot.h"
...@@ -193,6 +195,7 @@ extern Size SyncScanShmemSize(void); ...@@ -193,6 +195,7 @@ extern Size SyncScanShmemSize(void);
struct VacuumParams; struct VacuumParams;
extern void heap_vacuum_rel(Relation onerel, extern void heap_vacuum_rel(Relation onerel,
struct VacuumParams *params, BufferAccessStrategy bstrategy); struct VacuumParams *params, BufferAccessStrategy bstrategy);
extern void parallel_vacuum_main(dsm_segment *seg, shm_toc *toc);
/* in heap/heapam_visibility.c */ /* in heap/heapam_visibility.c */
extern bool HeapTupleSatisfiesVisibility(HeapTuple stup, Snapshot snapshot, extern bool HeapTupleSatisfiesVisibility(HeapTuple stup, Snapshot snapshot,
......
...@@ -33,7 +33,8 @@ typedef struct ParallelContext ...@@ -33,7 +33,8 @@ typedef struct ParallelContext
{ {
dlist_node node; dlist_node node;
SubTransactionId subid; SubTransactionId subid;
int nworkers; int nworkers; /* Maximum number of workers to launch */
int nworkers_to_launch; /* Actual number of workers to launch */
int nworkers_launched; int nworkers_launched;
char *library_name; char *library_name;
char *function_name; char *function_name;
...@@ -63,6 +64,7 @@ extern ParallelContext *CreateParallelContext(const char *library_name, ...@@ -63,6 +64,7 @@ extern ParallelContext *CreateParallelContext(const char *library_name,
const char *function_name, int nworkers); const char *function_name, int nworkers);
extern void InitializeParallelDSM(ParallelContext *pcxt); extern void InitializeParallelDSM(ParallelContext *pcxt);
extern void ReinitializeParallelDSM(ParallelContext *pcxt); extern void ReinitializeParallelDSM(ParallelContext *pcxt);
extern void ReinitializeParallelWorkers(ParallelContext *pcxt, int nworkers_to_launch);
extern void LaunchParallelWorkers(ParallelContext *pcxt); extern void LaunchParallelWorkers(ParallelContext *pcxt);
extern void WaitForParallelWorkersToAttach(ParallelContext *pcxt); extern void WaitForParallelWorkersToAttach(ParallelContext *pcxt);
extern void WaitForParallelWorkersToFinish(ParallelContext *pcxt); extern void WaitForParallelWorkersToFinish(ParallelContext *pcxt);
......
...@@ -222,6 +222,13 @@ typedef struct VacuumParams ...@@ -222,6 +222,13 @@ typedef struct VacuumParams
* default value depends on reloptions */ * default value depends on reloptions */
VacOptTernaryValue truncate; /* Truncate empty pages at the end, VacOptTernaryValue truncate; /* Truncate empty pages at the end,
* default value depends on reloptions */ * default value depends on reloptions */
/*
* The number of parallel vacuum workers. 0 by default which means choose
* based on the number of indexes. -1 indicates a parallel vacuum is
* disabled.
*/
int nworkers;
} VacuumParams; } VacuumParams;
/* GUC parameters */ /* GUC parameters */
...@@ -231,6 +238,11 @@ extern int vacuum_freeze_table_age; ...@@ -231,6 +238,11 @@ extern int vacuum_freeze_table_age;
extern int vacuum_multixact_freeze_min_age; extern int vacuum_multixact_freeze_min_age;
extern int vacuum_multixact_freeze_table_age; extern int vacuum_multixact_freeze_table_age;
/* Variables for cost-based parallel vacuum */
extern pg_atomic_uint32 *VacuumSharedCostBalance;
extern pg_atomic_uint32 *VacuumActiveNWorkers;
extern int VacuumCostBalanceLocal;
/* in commands/vacuum.c */ /* in commands/vacuum.c */
extern void ExecVacuum(ParseState *pstate, VacuumStmt *vacstmt, bool isTopLevel); extern void ExecVacuum(ParseState *pstate, VacuumStmt *vacstmt, bool isTopLevel);
......
...@@ -92,6 +92,40 @@ CONTEXT: SQL function "do_analyze" statement 1 ...@@ -92,6 +92,40 @@ CONTEXT: SQL function "do_analyze" statement 1
SQL function "wrap_do_analyze" statement 1 SQL function "wrap_do_analyze" statement 1
VACUUM FULL vactst; VACUUM FULL vactst;
VACUUM (DISABLE_PAGE_SKIPPING) vaccluster; VACUUM (DISABLE_PAGE_SKIPPING) vaccluster;
-- PARALLEL option
CREATE TABLE pvactst (i INT, a INT[], p POINT) with (autovacuum_enabled = off);
INSERT INTO pvactst SELECT i, array[1,2,3], point(i, i+1) FROM generate_series(1,1000) i;
CREATE INDEX btree_pvactst ON pvactst USING btree (i);
CREATE INDEX hash_pvactst ON pvactst USING hash (i);
CREATE INDEX brin_pvactst ON pvactst USING brin (i);
CREATE INDEX gin_pvactst ON pvactst USING gin (a);
CREATE INDEX gist_pvactst ON pvactst USING gist (p);
CREATE INDEX spgist_pvactst ON pvactst USING spgist (p);
-- VACUUM invokes parallel index cleanup
SET min_parallel_index_scan_size to 0;
VACUUM (PARALLEL 2) pvactst;
-- VACUUM invokes parallel bulk-deletion
UPDATE pvactst SET i = i WHERE i < 1000;
VACUUM (PARALLEL 2) pvactst;
UPDATE pvactst SET i = i WHERE i < 1000;
VACUUM (PARALLEL 0) pvactst; -- disable parallel vacuum
VACUUM (PARALLEL -1) pvactst; -- error
ERROR: parallel vacuum degree must be between 0 and 1024
LINE 1: VACUUM (PARALLEL -1) pvactst;
^
VACUUM (PARALLEL 2, INDEX_CLEANUP FALSE) pvactst;
VACUUM (PARALLEL 2, FULL TRUE) pvactst; -- error, cannot use both PARALLEL and FULL
ERROR: cannot specify both FULL and PARALLEL options
VACUUM (PARALLEL) pvactst; -- error, cannot use PARALLEL option without parallel degree
ERROR: parallel option requires a value between 0 and 1024
LINE 1: VACUUM (PARALLEL) pvactst;
^
CREATE TEMPORARY TABLE tmp (a int PRIMARY KEY);
CREATE INDEX tmp_idx1 ON tmp (a);
VACUUM (PARALLEL 1) tmp; -- disables parallel vacuum option
WARNING: disabling parallel option of vacuum on "tmp" --- cannot vacuum temporary tables in parallel
RESET min_parallel_index_scan_size;
DROP TABLE pvactst;
-- INDEX_CLEANUP option -- INDEX_CLEANUP option
CREATE TABLE no_index_cleanup (i INT PRIMARY KEY, t TEXT); CREATE TABLE no_index_cleanup (i INT PRIMARY KEY, t TEXT);
-- Use uncompressed data stored in toast. -- Use uncompressed data stored in toast.
......
...@@ -75,6 +75,37 @@ VACUUM FULL vactst; ...@@ -75,6 +75,37 @@ VACUUM FULL vactst;
VACUUM (DISABLE_PAGE_SKIPPING) vaccluster; VACUUM (DISABLE_PAGE_SKIPPING) vaccluster;
-- PARALLEL option
CREATE TABLE pvactst (i INT, a INT[], p POINT) with (autovacuum_enabled = off);
INSERT INTO pvactst SELECT i, array[1,2,3], point(i, i+1) FROM generate_series(1,1000) i;
CREATE INDEX btree_pvactst ON pvactst USING btree (i);
CREATE INDEX hash_pvactst ON pvactst USING hash (i);
CREATE INDEX brin_pvactst ON pvactst USING brin (i);
CREATE INDEX gin_pvactst ON pvactst USING gin (a);
CREATE INDEX gist_pvactst ON pvactst USING gist (p);
CREATE INDEX spgist_pvactst ON pvactst USING spgist (p);
-- VACUUM invokes parallel index cleanup
SET min_parallel_index_scan_size to 0;
VACUUM (PARALLEL 2) pvactst;
-- VACUUM invokes parallel bulk-deletion
UPDATE pvactst SET i = i WHERE i < 1000;
VACUUM (PARALLEL 2) pvactst;
UPDATE pvactst SET i = i WHERE i < 1000;
VACUUM (PARALLEL 0) pvactst; -- disable parallel vacuum
VACUUM (PARALLEL -1) pvactst; -- error
VACUUM (PARALLEL 2, INDEX_CLEANUP FALSE) pvactst;
VACUUM (PARALLEL 2, FULL TRUE) pvactst; -- error, cannot use both PARALLEL and FULL
VACUUM (PARALLEL) pvactst; -- error, cannot use PARALLEL option without parallel degree
CREATE TEMPORARY TABLE tmp (a int PRIMARY KEY);
CREATE INDEX tmp_idx1 ON tmp (a);
VACUUM (PARALLEL 1) tmp; -- disables parallel vacuum option
RESET min_parallel_index_scan_size;
DROP TABLE pvactst;
-- INDEX_CLEANUP option -- INDEX_CLEANUP option
CREATE TABLE no_index_cleanup (i INT PRIMARY KEY, t TEXT); CREATE TABLE no_index_cleanup (i INT PRIMARY KEY, t TEXT);
-- Use uncompressed data stored in toast. -- Use uncompressed data stored in toast.
......
...@@ -1216,7 +1216,11 @@ LPVOID ...@@ -1216,7 +1216,11 @@ LPVOID
LPWSTR LPWSTR
LSEG LSEG
LUID LUID
LVDeadTuples
LVParallelState
LVRelStats LVRelStats
LVShared
LVSharedIndStats
LWLock LWLock
LWLockHandle LWLockHandle
LWLockMinimallyPadded LWLockMinimallyPadded
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment