Commit 3d13623d authored by Tom Lane's avatar Tom Lane

Prevent leakage of SPI tuple tables during subtransaction abort.

plpgsql often just remembers SPI-result tuple tables in local variables,
and has no mechanism for freeing them if an ereport(ERROR) causes an escape
out of the execution function whose local variable it is.  In the original
coding, that wasn't a problem because the tuple table would be cleaned up
when the function's SPI context went away during transaction abort.
However, once plpgsql grew the ability to trap exceptions, repeated
trapping of errors within a function could result in significant
intra-function-call memory leakage, as illustrated in bug #8279 from
Chad Wagner.

We could fix this locally in plpgsql with a bunch of PG_TRY/PG_CATCH
coding, but that would be tedious, probably slow, and prone to bugs of
omission; moreover it would do nothing for similar risks elsewhere.
What seems like a better plan is to make SPI itself responsible for
freeing tuple tables at subtransaction abort.  This patch attacks the
problem that way, keeping a list of live tuple tables within each SPI
function context.  Currently, such freeing is automatic for tuple tables
made within the failed subtransaction.  We might later add a SPI call to
mark a tuple table as not to be freed this way, allowing callers to opt
out; but until someone exhibits a clear use-case for such behavior, it
doesn't seem worth bothering.

A very useful side-effect of this change is that SPI_freetuptable() can
now defend itself against bad calls, such as duplicate free requests;
this should make things more robust in many places.  (In particular,
this reduces the risks involved if a third-party extension contains
now-redundant SPI_freetuptable() calls in error cleanup code.)

Even though the leakage problem is of long standing, it seems imprudent
to back-patch this into stable branches, since it does represent an API
semantics change for SPI users.  We'll patch this in 9.3, but live with
the leakage in older branches.
parent fd27b999
...@@ -3934,8 +3934,8 @@ void SPI_freetuptable(SPITupleTable * <parameter>tuptable</parameter>) ...@@ -3934,8 +3934,8 @@ void SPI_freetuptable(SPITupleTable * <parameter>tuptable</parameter>)
<para> <para>
<function>SPI_freetuptable</function> frees a row set created by a <function>SPI_freetuptable</function> frees a row set created by a
prior SPI command execution function, such as prior SPI command execution function, such as
<function>SPI_execute</>. Therefore, this function is usually called <function>SPI_execute</>. Therefore, this function is often called
with the global variable <varname>SPI_tupletable</varname> as with the global variable <varname>SPI_tuptable</varname> as
argument. argument.
</para> </para>
...@@ -3944,6 +3944,16 @@ void SPI_freetuptable(SPITupleTable * <parameter>tuptable</parameter>) ...@@ -3944,6 +3944,16 @@ void SPI_freetuptable(SPITupleTable * <parameter>tuptable</parameter>)
multiple commands and does not want to keep the results of earlier multiple commands and does not want to keep the results of earlier
commands around until it ends. Note that any unfreed row sets will commands around until it ends. Note that any unfreed row sets will
be freed anyway at <function>SPI_finish</>. be freed anyway at <function>SPI_finish</>.
Also, if a subtransaction is started and then aborted within execution
of a SPI procedure, SPI automatically frees any row sets created while
the subtransaction was running.
</para>
<para>
Beginning in <productname>PostgreSQL</> 9.3,
<function>SPI_freetuptable</function> contains guard logic to protect
against duplicate deletion requests for the same row set. In previous
releases, duplicate deletions would lead to crashes.
</para> </para>
</refsect1> </refsect1>
...@@ -3955,7 +3965,7 @@ void SPI_freetuptable(SPITupleTable * <parameter>tuptable</parameter>) ...@@ -3955,7 +3965,7 @@ void SPI_freetuptable(SPITupleTable * <parameter>tuptable</parameter>)
<term><literal>SPITupleTable * <parameter>tuptable</parameter></literal></term> <term><literal>SPITupleTable * <parameter>tuptable</parameter></literal></term>
<listitem> <listitem>
<para> <para>
pointer to row set to free pointer to row set to free, or NULL to do nothing
</para> </para>
</listitem> </listitem>
</varlistentry> </varlistentry>
......
...@@ -126,6 +126,7 @@ SPI_connect(void) ...@@ -126,6 +126,7 @@ SPI_connect(void)
_SPI_current->processed = 0; _SPI_current->processed = 0;
_SPI_current->lastoid = InvalidOid; _SPI_current->lastoid = InvalidOid;
_SPI_current->tuptable = NULL; _SPI_current->tuptable = NULL;
slist_init(&_SPI_current->tuptables);
_SPI_current->procCxt = NULL; /* in case we fail to create 'em */ _SPI_current->procCxt = NULL; /* in case we fail to create 'em */
_SPI_current->execCxt = NULL; _SPI_current->execCxt = NULL;
_SPI_current->connectSubid = GetCurrentSubTransactionId(); _SPI_current->connectSubid = GetCurrentSubTransactionId();
...@@ -166,7 +167,7 @@ SPI_finish(void) ...@@ -166,7 +167,7 @@ SPI_finish(void)
/* Restore memory context as it was before procedure call */ /* Restore memory context as it was before procedure call */
MemoryContextSwitchTo(_SPI_current->savedcxt); MemoryContextSwitchTo(_SPI_current->savedcxt);
/* Release memory used in procedure call */ /* Release memory used in procedure call (including tuptables) */
MemoryContextDelete(_SPI_current->execCxt); MemoryContextDelete(_SPI_current->execCxt);
_SPI_current->execCxt = NULL; _SPI_current->execCxt = NULL;
MemoryContextDelete(_SPI_current->procCxt); MemoryContextDelete(_SPI_current->procCxt);
...@@ -282,11 +283,35 @@ AtEOSubXact_SPI(bool isCommit, SubTransactionId mySubid) ...@@ -282,11 +283,35 @@ AtEOSubXact_SPI(bool isCommit, SubTransactionId mySubid)
*/ */
if (_SPI_current && !isCommit) if (_SPI_current && !isCommit)
{ {
slist_mutable_iter siter;
/* free Executor memory the same as _SPI_end_call would do */ /* free Executor memory the same as _SPI_end_call would do */
MemoryContextResetAndDeleteChildren(_SPI_current->execCxt); MemoryContextResetAndDeleteChildren(_SPI_current->execCxt);
/* throw away any partially created tuple-table */
SPI_freetuptable(_SPI_current->tuptable); /* throw away any tuple tables created within current subxact */
slist_foreach_modify(siter, &_SPI_current->tuptables)
{
SPITupleTable *tuptable;
tuptable = slist_container(SPITupleTable, next, siter.cur);
if (tuptable->subid >= mySubid)
{
/*
* If we used SPI_freetuptable() here, its internal search of
* the tuptables list would make this operation O(N^2).
* Instead, just free the tuptable manually. This should
* match what SPI_freetuptable() does.
*/
slist_delete_current(&siter);
if (tuptable == _SPI_current->tuptable)
_SPI_current->tuptable = NULL; _SPI_current->tuptable = NULL;
if (tuptable == SPI_tuptable)
SPI_tuptable = NULL;
MemoryContextDelete(tuptable->tuptabcxt);
}
}
/* in particular we should have gotten rid of any in-progress table */
Assert(_SPI_current->tuptable == NULL);
} }
} }
...@@ -1021,7 +1046,58 @@ SPI_freetuple(HeapTuple tuple) ...@@ -1021,7 +1046,58 @@ SPI_freetuple(HeapTuple tuple)
void void
SPI_freetuptable(SPITupleTable *tuptable) SPI_freetuptable(SPITupleTable *tuptable)
{ {
if (tuptable != NULL) bool found = false;
/* ignore call if NULL pointer */
if (tuptable == NULL)
return;
/*
* Since this function might be called during error recovery, it seems
* best not to insist that the caller be actively connected. We just
* search the topmost SPI context, connected or not.
*/
if (_SPI_connected >= 0)
{
slist_mutable_iter siter;
if (_SPI_current != &(_SPI_stack[_SPI_connected]))
elog(ERROR, "SPI stack corrupted");
/* find tuptable in active list, then remove it */
slist_foreach_modify(siter, &_SPI_current->tuptables)
{
SPITupleTable *tt;
tt = slist_container(SPITupleTable, next, siter.cur);
if (tt == tuptable)
{
slist_delete_current(&siter);
found = true;
break;
}
}
}
/*
* Refuse the deletion if we didn't find it in the topmost SPI context.
* This is primarily a guard against double deletion, but might prevent
* other errors as well. Since the worst consequence of not deleting a
* tuptable would be a transient memory leak, this is just a WARNING.
*/
if (!found)
{
elog(WARNING, "attempt to delete invalid SPITupleTable %p", tuptable);
return;
}
/* for safety, reset global variables that might point at tuptable */
if (tuptable == _SPI_current->tuptable)
_SPI_current->tuptable = NULL;
if (tuptable == SPI_tuptable)
SPI_tuptable = NULL;
/* release all memory belonging to tuptable */
MemoryContextDelete(tuptable->tuptabcxt); MemoryContextDelete(tuptable->tuptabcxt);
} }
...@@ -1656,6 +1732,8 @@ spi_dest_startup(DestReceiver *self, int operation, TupleDesc typeinfo) ...@@ -1656,6 +1732,8 @@ spi_dest_startup(DestReceiver *self, int operation, TupleDesc typeinfo)
if (_SPI_current->tuptable != NULL) if (_SPI_current->tuptable != NULL)
elog(ERROR, "improper call to spi_dest_startup"); elog(ERROR, "improper call to spi_dest_startup");
/* We create the tuple table context as a child of procCxt */
oldcxt = _SPI_procmem(); /* switch to procedure memory context */ oldcxt = _SPI_procmem(); /* switch to procedure memory context */
tuptabcxt = AllocSetContextCreate(CurrentMemoryContext, tuptabcxt = AllocSetContextCreate(CurrentMemoryContext,
...@@ -1666,8 +1744,18 @@ spi_dest_startup(DestReceiver *self, int operation, TupleDesc typeinfo) ...@@ -1666,8 +1744,18 @@ spi_dest_startup(DestReceiver *self, int operation, TupleDesc typeinfo)
MemoryContextSwitchTo(tuptabcxt); MemoryContextSwitchTo(tuptabcxt);
_SPI_current->tuptable = tuptable = (SPITupleTable *) _SPI_current->tuptable = tuptable = (SPITupleTable *)
palloc(sizeof(SPITupleTable)); palloc0(sizeof(SPITupleTable));
tuptable->tuptabcxt = tuptabcxt; tuptable->tuptabcxt = tuptabcxt;
tuptable->subid = GetCurrentSubTransactionId();
/*
* The tuptable is now valid enough to be freed by AtEOSubXact_SPI, so put
* it onto the SPI context's tuptables list. This will ensure it's not
* leaked even in the unlikely event the following few lines fail.
*/
slist_push_head(&_SPI_current->tuptables, &tuptable->next);
/* set up initial allocations */
tuptable->alloced = tuptable->free = 128; tuptable->alloced = tuptable->free = 128;
tuptable->vals = (HeapTuple *) palloc(tuptable->alloced * sizeof(HeapTuple)); tuptable->vals = (HeapTuple *) palloc(tuptable->alloced * sizeof(HeapTuple));
tuptable->tupdesc = CreateTupleDescCopy(typeinfo); tuptable->tupdesc = CreateTupleDescCopy(typeinfo);
......
...@@ -13,6 +13,7 @@ ...@@ -13,6 +13,7 @@
#ifndef SPI_H #ifndef SPI_H
#define SPI_H #define SPI_H
#include "lib/ilist.h"
#include "nodes/parsenodes.h" #include "nodes/parsenodes.h"
#include "utils/portal.h" #include "utils/portal.h"
...@@ -24,6 +25,8 @@ typedef struct SPITupleTable ...@@ -24,6 +25,8 @@ typedef struct SPITupleTable
uint32 free; /* # of free vals */ uint32 free; /* # of free vals */
TupleDesc tupdesc; /* tuple descriptor */ TupleDesc tupdesc; /* tuple descriptor */
HeapTuple *vals; /* tuples */ HeapTuple *vals; /* tuples */
slist_node next; /* link for internal bookkeeping */
SubTransactionId subid; /* subxact in which tuptable was created */
} SPITupleTable; } SPITupleTable;
/* Plans are opaque structs for standard users of SPI */ /* Plans are opaque structs for standard users of SPI */
......
...@@ -23,8 +23,10 @@ typedef struct ...@@ -23,8 +23,10 @@ typedef struct
/* current results */ /* current results */
uint32 processed; /* by Executor */ uint32 processed; /* by Executor */
Oid lastoid; Oid lastoid;
SPITupleTable *tuptable; SPITupleTable *tuptable; /* tuptable currently being built */
/* resources of this execution context */
slist_head tuptables; /* list of all live SPITupleTables */
MemoryContext procCxt; /* procedure context */ MemoryContext procCxt; /* procedure context */
MemoryContext execCxt; /* executor context */ MemoryContext execCxt; /* executor context */
MemoryContext savedcxt; /* context of SPI_connect's caller */ MemoryContext savedcxt; /* context of SPI_connect's caller */
......
...@@ -1202,7 +1202,13 @@ exec_stmt_block(PLpgSQL_execstate *estate, PLpgSQL_stmt_block *block) ...@@ -1202,7 +1202,13 @@ exec_stmt_block(PLpgSQL_execstate *estate, PLpgSQL_stmt_block *block)
*/ */
SPI_restore_connection(); SPI_restore_connection();
/* Must clean up the econtext too */ /*
* Must clean up the econtext too. However, any tuple table made
* in the subxact will have been thrown away by SPI during subxact
* abort, so we don't need to (and mustn't try to) free the
* eval_tuptable.
*/
estate->eval_tuptable = NULL;
exec_eval_cleanup(estate); exec_eval_cleanup(estate);
/* Look for a matching exception handler */ /* Look for a matching exception handler */
......
...@@ -377,8 +377,6 @@ PLy_cursor_iternext(PyObject *self) ...@@ -377,8 +377,6 @@ PLy_cursor_iternext(PyObject *self)
} }
PG_CATCH(); PG_CATCH();
{ {
SPI_freetuptable(SPI_tuptable);
PLy_spi_subtransaction_abort(oldcontext, oldowner); PLy_spi_subtransaction_abort(oldcontext, oldowner);
return NULL; return NULL;
} }
...@@ -461,8 +459,6 @@ PLy_cursor_fetch(PyObject *self, PyObject *args) ...@@ -461,8 +459,6 @@ PLy_cursor_fetch(PyObject *self, PyObject *args)
} }
PG_CATCH(); PG_CATCH();
{ {
SPI_freetuptable(SPI_tuptable);
PLy_spi_subtransaction_abort(oldcontext, oldowner); PLy_spi_subtransaction_abort(oldcontext, oldowner);
return NULL; return NULL;
} }
......
...@@ -439,7 +439,6 @@ PLy_spi_execute_fetch_result(SPITupleTable *tuptable, int rows, int status) ...@@ -439,7 +439,6 @@ PLy_spi_execute_fetch_result(SPITupleTable *tuptable, int rows, int status)
{ {
MemoryContextSwitchTo(oldcontext); MemoryContextSwitchTo(oldcontext);
PLy_typeinfo_dealloc(&args); PLy_typeinfo_dealloc(&args);
SPI_freetuptable(tuptable);
Py_DECREF(result); Py_DECREF(result);
PG_RE_THROW(); PG_RE_THROW();
} }
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment