Commit 612a1ab7 authored by Peter Geoghegan's avatar Peter Geoghegan

Add equalimage B-Tree support functions.

Invent the concept of a B-Tree equalimage ("equality implies image
equality") support function, registered as support function 4.  This
indicates whether it is safe (or not safe) to apply optimizations that
assume that any two datums considered equal by an operator class's order
method must be interchangeable without any loss of semantic information.
This is static information about an operator class and a collation.

Register an equalimage routine for almost all of the existing B-Tree
opclasses.  We only need two trivial routines for all of the opclasses
that are included with the core distribution.  There is one routine for
opclasses that index non-collatable types (which returns 'true'
unconditionally), plus another routine for collatable types (which
returns 'true' when the collation is a deterministic collation).

This patch is infrastructure for an upcoming patch that adds B-Tree
deduplication.

Author: Peter Geoghegan, Anastasia Lubennikova
Discussion: https://postgr.es/m/CAH2-Wzn3Ee49Gmxb7V1VJ3-AC8fWn-Fr8pfWQebHe8rYRxt5OQ@mail.gmail.com
parent 4109bb5d
...@@ -207,7 +207,7 @@ ...@@ -207,7 +207,7 @@
<para> <para>
As shown in <xref linkend="xindex-btree-support-table"/>, btree defines As shown in <xref linkend="xindex-btree-support-table"/>, btree defines
one required and two optional support functions. The three one required and three optional support functions. The four
user-defined methods are: user-defined methods are:
</para> </para>
<variablelist> <variablelist>
...@@ -456,6 +456,100 @@ returns bool ...@@ -456,6 +456,100 @@ returns bool
</para> </para>
</listitem> </listitem>
</varlistentry> </varlistentry>
<varlistentry>
<term><function>equalimage</function></term>
<listitem>
<para>
Optionally, a btree operator family may provide
<function>equalimage</function> (<quote>equality implies image
equality</quote>) support functions, registered under support
function number 4. These functions allow the core code to
determine when it is safe to apply the btree deduplication
optimization. Currently, <function>equalimage</function>
functions are only called when building or rebuilding an index.
</para>
<para>
An <function>equalimage</function> function must have the
signature
<synopsis>
equalimage(<replaceable>opcintype</replaceable> <type>oid</type>) returns bool
</synopsis>
The return value is static information about an operator class
and collation. Returning <literal>true</literal> indicates that
the <function>order</function> function for the operator class is
guaranteed to only return <literal>0</literal> (<quote>arguments
are equal</quote>) when its <replaceable>A</replaceable> and
<replaceable>B</replaceable> arguments are also interchangeable
without any loss of semantic information. Not registering an
<function>equalimage</function> function or returning
<literal>false</literal> indicates that this condition cannot be
assumed to hold.
</para>
<para>
The <replaceable>opcintype</replaceable> argument is the
<literal><structname>pg_type</structname>.oid</literal> of the
data type that the operator class indexes. This is a convenience
that allows reuse of the same underlying
<function>equalimage</function> function across operator classes.
If <replaceable>opcintype</replaceable> is a collatable data
type, the appropriate collation OID will be passed to the
<function>equalimage</function> function, using the standard
<function>PG_GET_COLLATION()</function> mechanism.
</para>
<para>
As far as the operator class is concerned, returning
<literal>true</literal> indicates that deduplication is safe (or
safe for the collation whose OID was passed to its
<function>equalimage</function> function). However, the core
code will only deem deduplication safe for an index when
<emphasis>every</emphasis> indexed column uses an operator class
that registers an <function>equalimage</function> function, and
each function actually returns <literal>true</literal> when
called.
</para>
<para>
Image equality is <emphasis>almost</emphasis> the same condition
as simple bitwise equality. There is one subtle difference: When
indexing a varlena data type, the on-disk representation of two
image equal datums may not be bitwise equal due to inconsistent
application of <acronym>TOAST</acronym> compression on input.
Formally, when an operator class's
<function>equalimage</function> function returns
<literal>true</literal>, it is safe to assume that the
<literal>datum_image_eq()</literal> C function will always agree
with the operator class's <function>order</function> function
(provided that the same collation OID is passed to both the
<function>equalimage</function> and <function>order</function>
functions).
</para>
<para>
The core code is fundamentally unable to deduce anything about
the <quote>equality implies image equality</quote> status of an
operator class within a multiple-data-type family based on
details from other operator classes in the same family. Also, it
is not sensible for an operator family to register a cross-type
<function>equalimage</function> function, and attempting to do so
will result in an error. This is because <quote>equality implies
image equality</quote> status does not just depend on
sorting/equality semantics, which are more or less defined at the
operator family level. In general, the semantics that one
particular data type implements must be considered separately.
</para>
<para>
The convention followed by the operator classes included with the
core <productname>PostgreSQL</productname> distribution is to
register a stock, generic <function>equalimage</function>
function. Most operator classes register
<function>btequalimage()</function>, which indicates that
deduplication is safe unconditionally. Operator classes for
collatable data types such as <type>text</type> register
<function>btvarstrequalimage()</function>, which indicates that
deduplication is safe with deterministic collations. Best
practice for third-party extensions is to register their own
custom function to retain control.
</para>
</listitem>
</varlistentry>
</variablelist> </variablelist>
</sect1> </sect1>
......
...@@ -153,9 +153,10 @@ ALTER OPERATOR FAMILY <replaceable>name</replaceable> USING <replaceable class=" ...@@ -153,9 +153,10 @@ ALTER OPERATOR FAMILY <replaceable>name</replaceable> USING <replaceable class="
and hash functions it is not necessary to specify <replaceable and hash functions it is not necessary to specify <replaceable
class="parameter">op_type</replaceable> since the function's input class="parameter">op_type</replaceable> since the function's input
data type(s) are always the correct ones to use. For B-tree sort data type(s) are always the correct ones to use. For B-tree sort
support functions and all functions in GiST, SP-GiST and GIN operator support functions, B-Tree equal image functions, and all
classes, it is necessary to specify the operand data type(s) the function functions in GiST, SP-GiST and GIN operator classes, it is
is to be used with. necessary to specify the operand data type(s) the function is to
be used with.
</para> </para>
<para> <para>
......
...@@ -171,12 +171,14 @@ CREATE OPERATOR CLASS <replaceable class="parameter">name</replaceable> [ DEFAUL ...@@ -171,12 +171,14 @@ CREATE OPERATOR CLASS <replaceable class="parameter">name</replaceable> [ DEFAUL
function is intended to support, if different from function is intended to support, if different from
the input data type(s) of the function (for B-tree comparison functions the input data type(s) of the function (for B-tree comparison functions
and hash functions) and hash functions)
or the class's data type (for B-tree sort support functions and all or the class's data type (for B-tree sort support functions,
functions in GiST, SP-GiST, GIN and BRIN operator classes). These defaults B-tree equal image functions, and all functions in GiST,
are correct, and so <replaceable SP-GiST, GIN and BRIN operator classes). These defaults are
class="parameter">op_type</replaceable> need not be specified in correct, and so <replaceable
<literal>FUNCTION</literal> clauses, except for the case of a B-tree sort class="parameter">op_type</replaceable> need not be specified
support function that is meant to support cross-data-type comparisons. in <literal>FUNCTION</literal> clauses, except for the case of a
B-tree sort support function that is meant to support
cross-data-type comparisons.
</para> </para>
</listitem> </listitem>
</varlistentry> </varlistentry>
......
...@@ -402,7 +402,7 @@ ...@@ -402,7 +402,7 @@
<para> <para>
B-trees require a comparison support function, B-trees require a comparison support function,
and allow two additional support functions to be and allow three additional support functions to be
supplied at the operator class author's option, as shown in <xref supplied at the operator class author's option, as shown in <xref
linkend="xindex-btree-support-table"/>. linkend="xindex-btree-support-table"/>.
The requirements for these support functions are explained further in The requirements for these support functions are explained further in
...@@ -441,6 +441,13 @@ ...@@ -441,6 +441,13 @@
</entry> </entry>
<entry>3</entry> <entry>3</entry>
</row> </row>
<row>
<entry>
Determine if it is safe for indexes that use the operator
class to apply the btree deduplication optimization (optional)
</entry>
<entry>4</entry>
</row>
</tbody> </tbody>
</tgroup> </tgroup>
</table> </table>
...@@ -980,7 +987,8 @@ DEFAULT FOR TYPE int8 USING btree FAMILY integer_ops AS ...@@ -980,7 +987,8 @@ DEFAULT FOR TYPE int8 USING btree FAMILY integer_ops AS
OPERATOR 5 > , OPERATOR 5 > ,
FUNCTION 1 btint8cmp(int8, int8) , FUNCTION 1 btint8cmp(int8, int8) ,
FUNCTION 2 btint8sortsupport(internal) , FUNCTION 2 btint8sortsupport(internal) ,
FUNCTION 3 in_range(int8, int8, int8, boolean, boolean) ; FUNCTION 3 in_range(int8, int8, int8, boolean, boolean) ,
FUNCTION 4 btequalimage(oid) ;
CREATE OPERATOR CLASS int4_ops CREATE OPERATOR CLASS int4_ops
DEFAULT FOR TYPE int4 USING btree FAMILY integer_ops AS DEFAULT FOR TYPE int4 USING btree FAMILY integer_ops AS
...@@ -992,7 +1000,8 @@ DEFAULT FOR TYPE int4 USING btree FAMILY integer_ops AS ...@@ -992,7 +1000,8 @@ DEFAULT FOR TYPE int4 USING btree FAMILY integer_ops AS
OPERATOR 5 > , OPERATOR 5 > ,
FUNCTION 1 btint4cmp(int4, int4) , FUNCTION 1 btint4cmp(int4, int4) ,
FUNCTION 2 btint4sortsupport(internal) , FUNCTION 2 btint4sortsupport(internal) ,
FUNCTION 3 in_range(int4, int4, int4, boolean, boolean) ; FUNCTION 3 in_range(int4, int4, int4, boolean, boolean) ,
FUNCTION 4 btequalimage(oid) ;
CREATE OPERATOR CLASS int2_ops CREATE OPERATOR CLASS int2_ops
DEFAULT FOR TYPE int2 USING btree FAMILY integer_ops AS DEFAULT FOR TYPE int2 USING btree FAMILY integer_ops AS
...@@ -1004,7 +1013,8 @@ DEFAULT FOR TYPE int2 USING btree FAMILY integer_ops AS ...@@ -1004,7 +1013,8 @@ DEFAULT FOR TYPE int2 USING btree FAMILY integer_ops AS
OPERATOR 5 > , OPERATOR 5 > ,
FUNCTION 1 btint2cmp(int2, int2) , FUNCTION 1 btint2cmp(int2, int2) ,
FUNCTION 2 btint2sortsupport(internal) , FUNCTION 2 btint2sortsupport(internal) ,
FUNCTION 3 in_range(int2, int2, int2, boolean, boolean) ; FUNCTION 3 in_range(int2, int2, int2, boolean, boolean) ,
FUNCTION 4 btequalimage(oid) ;
ALTER OPERATOR FAMILY integer_ops USING btree ADD ALTER OPERATOR FAMILY integer_ops USING btree ADD
-- cross-type comparisons int8 vs int2 -- cross-type comparisons int8 vs int2
......
...@@ -20,6 +20,7 @@ ...@@ -20,6 +20,7 @@
#include "access/nbtree.h" #include "access/nbtree.h"
#include "access/reloptions.h" #include "access/reloptions.h"
#include "access/relscan.h" #include "access/relscan.h"
#include "catalog/catalog.h"
#include "commands/progress.h" #include "commands/progress.h"
#include "lib/qunique.h" #include "lib/qunique.h"
#include "miscadmin.h" #include "miscadmin.h"
...@@ -2566,3 +2567,75 @@ _bt_check_third_page(Relation rel, Relation heap, bool needheaptidspace, ...@@ -2566,3 +2567,75 @@ _bt_check_third_page(Relation rel, Relation heap, bool needheaptidspace,
"or use full text indexing."), "or use full text indexing."),
errtableconstraint(heap, RelationGetRelationName(rel)))); errtableconstraint(heap, RelationGetRelationName(rel))));
} }
/*
* Are all attributes in rel "equality is image equality" attributes?
*
* We use each attribute's BTEQUALIMAGE_PROC opclass procedure. If any
* opclass either lacks a BTEQUALIMAGE_PROC procedure or returns false, we
* return false; otherwise we return true.
*
* Returned boolean value is stored in index metapage during index builds.
* Deduplication can only be used when we return true.
*/
bool
_bt_allequalimage(Relation rel, bool debugmessage)
{
bool allequalimage = true;
/* INCLUDE indexes don't support deduplication */
if (IndexRelationGetNumberOfAttributes(rel) !=
IndexRelationGetNumberOfKeyAttributes(rel))
return false;
/*
* There is no special reason why deduplication cannot work with system
* relations (i.e. with system catalog indexes and TOAST indexes). We
* deem deduplication unsafe for these indexes all the same, since the
* alternative is to force users to always use deduplication, without
* being able to opt out. (ALTER INDEX is not supported with system
* indexes, so users would have no way to set the deduplicate_items
* storage parameter to 'off'.)
*/
if (IsSystemRelation(rel))
return false;
for (int i = 0; i < IndexRelationGetNumberOfKeyAttributes(rel); i++)
{
Oid opfamily = rel->rd_opfamily[i];
Oid opcintype = rel->rd_opcintype[i];
Oid collation = rel->rd_indcollation[i];
Oid equalimageproc;
equalimageproc = get_opfamily_proc(opfamily, opcintype, opcintype,
BTEQUALIMAGE_PROC);
/*
* If there is no BTEQUALIMAGE_PROC then deduplication is assumed to
* be unsafe. Otherwise, actually call proc and see what it says.
*/
if (!OidIsValid(equalimageproc) ||
!DatumGetBool(OidFunctionCall1Coll(equalimageproc, collation,
ObjectIdGetDatum(opcintype))))
{
allequalimage = false;
break;
}
}
/*
* Don't elog() until here to avoid reporting on a system relation index
* or an INCLUDE index
*/
if (debugmessage)
{
if (allequalimage)
elog(DEBUG1, "index \"%s\" can safely use deduplication",
RelationGetRelationName(rel));
else
elog(DEBUG1, "index \"%s\" cannot use deduplication",
RelationGetRelationName(rel));
}
return allequalimage;
}
...@@ -104,6 +104,10 @@ btvalidate(Oid opclassoid) ...@@ -104,6 +104,10 @@ btvalidate(Oid opclassoid)
procform->amprocrighttype, procform->amprocrighttype,
BOOLOID, BOOLOID); BOOLOID, BOOLOID);
break; break;
case BTEQUALIMAGE_PROC:
ok = check_amproc_signature(procform->amproc, BOOLOID, true,
1, 1, OIDOID);
break;
default: default:
ereport(INFO, ereport(INFO,
(errcode(ERRCODE_INVALID_OBJECT_DEFINITION), (errcode(ERRCODE_INVALID_OBJECT_DEFINITION),
...@@ -211,8 +215,8 @@ btvalidate(Oid opclassoid) ...@@ -211,8 +215,8 @@ btvalidate(Oid opclassoid)
/* /*
* Complain if there seems to be an incomplete set of either operators * Complain if there seems to be an incomplete set of either operators
* or support functions for this datatype pair. The only things * or support functions for this datatype pair. The sortsupport,
* considered optional are the sortsupport and in_range functions. * in_range, and equalimage functions are considered optional.
*/ */
if (thisgroup->operatorset != if (thisgroup->operatorset !=
((1 << BTLessStrategyNumber) | ((1 << BTLessStrategyNumber) |
......
...@@ -1143,9 +1143,10 @@ assignProcTypes(OpFamilyMember *member, Oid amoid, Oid typeoid) ...@@ -1143,9 +1143,10 @@ assignProcTypes(OpFamilyMember *member, Oid amoid, Oid typeoid)
/* /*
* btree comparison procs must be 2-arg procs returning int4. btree * btree comparison procs must be 2-arg procs returning int4. btree
* sortsupport procs must take internal and return void. btree in_range * sortsupport procs must take internal and return void. btree in_range
* procs must be 5-arg procs returning bool. hash support proc 1 must be * procs must be 5-arg procs returning bool. btree equalimage procs must
* a 1-arg proc returning int4, while proc 2 must be a 2-arg proc * take 1 arg and return bool. hash support proc 1 must be a 1-arg proc
* returning int8. Otherwise we don't know. * returning int4, while proc 2 must be a 2-arg proc returning int8.
* Otherwise we don't know.
*/ */
if (amoid == BTREE_AM_OID) if (amoid == BTREE_AM_OID)
{ {
...@@ -1205,6 +1206,29 @@ assignProcTypes(OpFamilyMember *member, Oid amoid, Oid typeoid) ...@@ -1205,6 +1206,29 @@ assignProcTypes(OpFamilyMember *member, Oid amoid, Oid typeoid)
if (!OidIsValid(member->righttype)) if (!OidIsValid(member->righttype))
member->righttype = procform->proargtypes.values[2]; member->righttype = procform->proargtypes.values[2];
} }
else if (member->number == BTEQUALIMAGE_PROC)
{
if (procform->pronargs != 1)
ereport(ERROR,
(errcode(ERRCODE_INVALID_OBJECT_DEFINITION),
errmsg("btree equal image functions must have one argument")));
if (procform->prorettype != BOOLOID)
ereport(ERROR,
(errcode(ERRCODE_INVALID_OBJECT_DEFINITION),
errmsg("btree equal image functions must return boolean")));
/*
* pg_amproc functions are indexed by (lefttype, righttype), but
* an equalimage function can only be called at CREATE INDEX time.
* The same opclass opcintype OID is always used for leftype and
* righttype. Providing a cross-type routine isn't sensible.
* Reject cross-type ALTER OPERATOR FAMILY ... ADD FUNCTION 4
* statements here.
*/
if (member->lefttype != member->righttype)
ereport(ERROR,
(errcode(ERRCODE_INVALID_OBJECT_DEFINITION),
errmsg("btree equal image functions must not be cross-type")));
}
} }
else if (amoid == HASH_AM_OID) else if (amoid == HASH_AM_OID)
{ {
......
...@@ -44,6 +44,7 @@ ...@@ -44,6 +44,7 @@
#include "access/detoast.h" #include "access/detoast.h"
#include "fmgr.h" #include "fmgr.h"
#include "utils/builtins.h"
#include "utils/datum.h" #include "utils/datum.h"
#include "utils/expandeddatum.h" #include "utils/expandeddatum.h"
...@@ -323,6 +324,31 @@ datum_image_eq(Datum value1, Datum value2, bool typByVal, int typLen) ...@@ -323,6 +324,31 @@ datum_image_eq(Datum value1, Datum value2, bool typByVal, int typLen)
return result; return result;
} }
/*-------------------------------------------------------------------------
* btequalimage
*
* Generic "equalimage" support function.
*
* B-Tree operator classes whose equality function could safely be replaced by
* datum_image_eq() in all cases can use this as their "equalimage" support
* function.
*
* Currently, we unconditionally assume that any B-Tree operator class that
* registers btequalimage as its support function 4 must be able to safely use
* optimizations like deduplication (i.e. we return true unconditionally). If
* it ever proved necessary to rescind support for an operator class, we could
* do that in a targeted fashion by doing something with the opcintype
* argument.
*-------------------------------------------------------------------------
*/
Datum
btequalimage(PG_FUNCTION_ARGS)
{
/* Oid opcintype = PG_GETARG_OID(0); */
PG_RETURN_BOOL(true);
}
/*------------------------------------------------------------------------- /*-------------------------------------------------------------------------
* datumEstimateSpace * datumEstimateSpace
* *
......
...@@ -2783,6 +2783,26 @@ varstr_abbrev_abort(int memtupcount, SortSupport ssup) ...@@ -2783,6 +2783,26 @@ varstr_abbrev_abort(int memtupcount, SortSupport ssup)
return true; return true;
} }
/*
* Generic equalimage support function for character type's operator classes.
* Disables the use of deduplication with nondeterministic collations.
*/
Datum
btvarstrequalimage(PG_FUNCTION_ARGS)
{
/* Oid opcintype = PG_GETARG_OID(0); */
Oid collid = PG_GET_COLLATION();
check_collation_set(collid);
if (lc_collate_is_c(collid) ||
collid == DEFAULT_COLLATION_OID ||
get_collation_isdeterministic(collid))
PG_RETURN_BOOL(true);
else
PG_RETURN_BOOL(false);
}
Datum Datum
text_larger(PG_FUNCTION_ARGS) text_larger(PG_FUNCTION_ARGS)
{ {
......
...@@ -522,7 +522,8 @@ my %tests = ( ...@@ -522,7 +522,8 @@ my %tests = (
OPERATOR 4 >=(bigint,int4), OPERATOR 4 >=(bigint,int4),
OPERATOR 5 >(bigint,int4), OPERATOR 5 >(bigint,int4),
FUNCTION 1 (int4, int4) btint4cmp(int4,int4), FUNCTION 1 (int4, int4) btint4cmp(int4,int4),
FUNCTION 2 (int4, int4) btint4sortsupport(internal);', FUNCTION 2 (int4, int4) btint4sortsupport(internal),
FUNCTION 4 (int4, int4) btequalimage(oid);',
regexp => qr/^ regexp => qr/^
\QALTER OPERATOR FAMILY dump_test.op_family USING btree ADD\E\n\s+ \QALTER OPERATOR FAMILY dump_test.op_family USING btree ADD\E\n\s+
\QOPERATOR 1 <(bigint,integer) ,\E\n\s+ \QOPERATOR 1 <(bigint,integer) ,\E\n\s+
...@@ -531,7 +532,8 @@ my %tests = ( ...@@ -531,7 +532,8 @@ my %tests = (
\QOPERATOR 4 >=(bigint,integer) ,\E\n\s+ \QOPERATOR 4 >=(bigint,integer) ,\E\n\s+
\QOPERATOR 5 >(bigint,integer) ,\E\n\s+ \QOPERATOR 5 >(bigint,integer) ,\E\n\s+
\QFUNCTION 1 (integer, integer) btint4cmp(integer,integer) ,\E\n\s+ \QFUNCTION 1 (integer, integer) btint4cmp(integer,integer) ,\E\n\s+
\QFUNCTION 2 (integer, integer) btint4sortsupport(internal);\E \QFUNCTION 2 (integer, integer) btint4sortsupport(internal) ,\E\n\s+
\QFUNCTION 4 (integer, integer) btequalimage(oid);\E
/xm, /xm,
like => like =>
{ %full_runs, %dump_test_schema_runs, section_pre_data => 1, }, { %full_runs, %dump_test_schema_runs, section_pre_data => 1, },
...@@ -1554,7 +1556,8 @@ my %tests = ( ...@@ -1554,7 +1556,8 @@ my %tests = (
OPERATOR 4 >=(bigint,bigint), OPERATOR 4 >=(bigint,bigint),
OPERATOR 5 >(bigint,bigint), OPERATOR 5 >(bigint,bigint),
FUNCTION 1 btint8cmp(bigint,bigint), FUNCTION 1 btint8cmp(bigint,bigint),
FUNCTION 2 btint8sortsupport(internal);', FUNCTION 2 btint8sortsupport(internal),
FUNCTION 4 btequalimage(oid);',
regexp => qr/^ regexp => qr/^
\QCREATE OPERATOR CLASS dump_test.op_class\E\n\s+ \QCREATE OPERATOR CLASS dump_test.op_class\E\n\s+
\QFOR TYPE bigint USING btree FAMILY dump_test.op_family AS\E\n\s+ \QFOR TYPE bigint USING btree FAMILY dump_test.op_family AS\E\n\s+
...@@ -1564,7 +1567,8 @@ my %tests = ( ...@@ -1564,7 +1567,8 @@ my %tests = (
\QOPERATOR 4 >=(bigint,bigint) ,\E\n\s+ \QOPERATOR 4 >=(bigint,bigint) ,\E\n\s+
\QOPERATOR 5 >(bigint,bigint) ,\E\n\s+ \QOPERATOR 5 >(bigint,bigint) ,\E\n\s+
\QFUNCTION 1 (bigint, bigint) btint8cmp(bigint,bigint) ,\E\n\s+ \QFUNCTION 1 (bigint, bigint) btint8cmp(bigint,bigint) ,\E\n\s+
\QFUNCTION 2 (bigint, bigint) btint8sortsupport(internal);\E \QFUNCTION 2 (bigint, bigint) btint8sortsupport(internal) ,\E\n\s+
\QFUNCTION 4 (bigint, bigint) btequalimage(oid);\E
/xm, /xm,
like => like =>
{ %full_runs, %dump_test_schema_runs, section_pre_data => 1, }, { %full_runs, %dump_test_schema_runs, section_pre_data => 1, },
......
...@@ -387,12 +387,17 @@ typedef struct BTMetaPageData ...@@ -387,12 +387,17 @@ typedef struct BTMetaPageData
* an operator class may choose to offer a third amproc procedure * an operator class may choose to offer a third amproc procedure
* (BTINRANGE_PROC), independently of whether it offers sortsupport. * (BTINRANGE_PROC), independently of whether it offers sortsupport.
* For full details, see doc/src/sgml/btree.sgml. * For full details, see doc/src/sgml/btree.sgml.
*
* To facilitate B-Tree deduplication, an operator class may choose to
* offer a forth amproc procedure (BTEQUALIMAGE_PROC). For full details,
* see doc/src/sgml/btree.sgml.
*/ */
#define BTORDER_PROC 1 #define BTORDER_PROC 1
#define BTSORTSUPPORT_PROC 2 #define BTSORTSUPPORT_PROC 2
#define BTINRANGE_PROC 3 #define BTINRANGE_PROC 3
#define BTNProcs 3 #define BTEQUALIMAGE_PROC 4
#define BTNProcs 4
/* /*
* We need to be able to tell the difference between read and write * We need to be able to tell the difference between read and write
...@@ -829,6 +834,7 @@ extern bool _bt_check_natts(Relation rel, bool heapkeyspace, Page page, ...@@ -829,6 +834,7 @@ extern bool _bt_check_natts(Relation rel, bool heapkeyspace, Page page,
OffsetNumber offnum); OffsetNumber offnum);
extern void _bt_check_third_page(Relation rel, Relation heap, extern void _bt_check_third_page(Relation rel, Relation heap,
bool needheaptidspace, Page page, IndexTuple newtup); bool needheaptidspace, Page page, IndexTuple newtup);
extern bool _bt_allequalimage(Relation rel, bool debugmessage);
/* /*
* prototypes for functions in nbtvalidate.c * prototypes for functions in nbtvalidate.c
......
...@@ -53,6 +53,6 @@ ...@@ -53,6 +53,6 @@
*/ */
/* yyyymmddN */ /* yyyymmddN */
#define CATALOG_VERSION_NO 202002191 #define CATALOG_VERSION_NO 202002261
#endif #endif
This diff is collapsed.
...@@ -1013,6 +1013,9 @@ ...@@ -1013,6 +1013,9 @@
{ oid => '3255', descr => 'sort support', { oid => '3255', descr => 'sort support',
proname => 'bttextsortsupport', prorettype => 'void', proname => 'bttextsortsupport', prorettype => 'void',
proargtypes => 'internal', prosrc => 'bttextsortsupport' }, proargtypes => 'internal', prosrc => 'bttextsortsupport' },
{ oid => '8505', descr => 'equal image',
proname => 'btvarstrequalimage', prorettype => 'bool', proargtypes => 'oid',
prosrc => 'btvarstrequalimage' },
{ oid => '377', descr => 'less-equal-greater', { oid => '377', descr => 'less-equal-greater',
proname => 'cash_cmp', proleakproof => 't', prorettype => 'int4', proname => 'cash_cmp', proleakproof => 't', prorettype => 'int4',
proargtypes => 'money money', prosrc => 'cash_cmp' }, proargtypes => 'money money', prosrc => 'cash_cmp' },
...@@ -9483,6 +9486,9 @@ ...@@ -9483,6 +9486,9 @@
{ oid => '3187', descr => 'less-equal-greater based on byte images', { oid => '3187', descr => 'less-equal-greater based on byte images',
proname => 'btrecordimagecmp', prorettype => 'int4', proname => 'btrecordimagecmp', prorettype => 'int4',
proargtypes => 'record record', prosrc => 'btrecordimagecmp' }, proargtypes => 'record record', prosrc => 'btrecordimagecmp' },
{ oid => '8506', descr => 'equal image',
proname => 'btequalimage', prorettype => 'bool', proargtypes => 'oid',
prosrc => 'btequalimage' },
# Extensions # Extensions
{ oid => '3082', descr => 'list available extensions', { oid => '3082', descr => 'list available extensions',
......
...@@ -354,9 +354,9 @@ ERROR: invalid operator number 0, must be between 1 and 5 ...@@ -354,9 +354,9 @@ ERROR: invalid operator number 0, must be between 1 and 5
ALTER OPERATOR FAMILY alt_opf4 USING btree ADD OPERATOR 1 < ; -- operator without argument types ALTER OPERATOR FAMILY alt_opf4 USING btree ADD OPERATOR 1 < ; -- operator without argument types
ERROR: operator argument types must be specified in ALTER OPERATOR FAMILY ERROR: operator argument types must be specified in ALTER OPERATOR FAMILY
ALTER OPERATOR FAMILY alt_opf4 USING btree ADD FUNCTION 0 btint42cmp(int4, int2); -- function number should be between 1 and 5 ALTER OPERATOR FAMILY alt_opf4 USING btree ADD FUNCTION 0 btint42cmp(int4, int2); -- function number should be between 1 and 5
ERROR: invalid function number 0, must be between 1 and 3 ERROR: invalid function number 0, must be between 1 and 4
ALTER OPERATOR FAMILY alt_opf4 USING btree ADD FUNCTION 6 btint42cmp(int4, int2); -- function number should be between 1 and 5 ALTER OPERATOR FAMILY alt_opf4 USING btree ADD FUNCTION 6 btint42cmp(int4, int2); -- function number should be between 1 and 5
ERROR: invalid function number 6, must be between 1 and 3 ERROR: invalid function number 6, must be between 1 and 4
ALTER OPERATOR FAMILY alt_opf4 USING btree ADD STORAGE invalid_storage; -- Ensure STORAGE is not a part of ALTER OPERATOR FAMILY ALTER OPERATOR FAMILY alt_opf4 USING btree ADD STORAGE invalid_storage; -- Ensure STORAGE is not a part of ALTER OPERATOR FAMILY
ERROR: STORAGE cannot be specified in ALTER OPERATOR FAMILY ERROR: STORAGE cannot be specified in ALTER OPERATOR FAMILY
DROP OPERATOR FAMILY alt_opf4 USING btree; DROP OPERATOR FAMILY alt_opf4 USING btree;
...@@ -493,6 +493,10 @@ ALTER OPERATOR FAMILY alt_opf18 USING btree ADD ...@@ -493,6 +493,10 @@ ALTER OPERATOR FAMILY alt_opf18 USING btree ADD
OPERATOR 4 >= (int4, int2) , OPERATOR 4 >= (int4, int2) ,
OPERATOR 5 > (int4, int2) , OPERATOR 5 > (int4, int2) ,
FUNCTION 1 btint42cmp(int4, int2); FUNCTION 1 btint42cmp(int4, int2);
-- Should fail. Not allowed to have cross-type equalimage function.
ALTER OPERATOR FAMILY alt_opf18 USING btree
ADD FUNCTION 4 (int4, int2) btequalimage(oid);
ERROR: btree equal image functions must not be cross-type
ALTER OPERATOR FAMILY alt_opf18 USING btree DROP FUNCTION 2 (int4, int4); ALTER OPERATOR FAMILY alt_opf18 USING btree DROP FUNCTION 2 (int4, int4);
ERROR: function 2(integer,integer) does not exist in operator family "alt_opf18" ERROR: function 2(integer,integer) does not exist in operator family "alt_opf18"
DROP OPERATOR FAMILY alt_opf18 USING btree; DROP OPERATOR FAMILY alt_opf18 USING btree;
......
...@@ -2111,6 +2111,42 @@ WHERE p1.amproc = p2.oid AND ...@@ -2111,6 +2111,42 @@ WHERE p1.amproc = p2.oid AND
--------------+--------+-------- --------------+--------+--------
(0 rows) (0 rows)
-- Almost all of the core distribution's Btree opclasses can use one of the
-- two generic "equalimage" functions as their support function 4. Look for
-- opclasses that don't allow deduplication unconditionally here.
--
-- Newly added Btree opclasses don't have to support deduplication. It will
-- usually be trivial to add support, though. Note that the expected output
-- of this part of the test will need to be updated when a new opclass cannot
-- support deduplication (by using btequalimage).
SELECT amp.amproc::regproc AS proc, opf.opfname AS opfamily_name,
opc.opcname AS opclass_name, opc.opcintype::regtype AS opcintype
FROM pg_am AS am
JOIN pg_opclass AS opc ON opc.opcmethod = am.oid
JOIN pg_opfamily AS opf ON opc.opcfamily = opf.oid
LEFT JOIN pg_amproc AS amp ON amp.amprocfamily = opf.oid AND
amp.amproclefttype = opc.opcintype AND amp.amprocnum = 4
WHERE am.amname = 'btree' AND
amp.amproc IS DISTINCT FROM 'btequalimage'::regproc
ORDER BY 1, 2, 3;
proc | opfamily_name | opclass_name | opcintype
--------------------+------------------+------------------+------------------
btvarstrequalimage | bpchar_ops | bpchar_ops | character
btvarstrequalimage | text_ops | name_ops | name
btvarstrequalimage | text_ops | text_ops | text
btvarstrequalimage | text_ops | varchar_ops | text
| array_ops | array_ops | anyarray
| float_ops | float4_ops | real
| float_ops | float8_ops | double precision
| jsonb_ops | jsonb_ops | jsonb
| numeric_ops | numeric_ops | numeric
| range_ops | range_ops | anyrange
| record_image_ops | record_image_ops | record
| record_ops | record_ops | record
| tsquery_ops | tsquery_ops | tsquery
| tsvector_ops | tsvector_ops | tsvector
(14 rows)
-- **************** pg_index **************** -- **************** pg_index ****************
-- Look for illegal values in pg_index fields. -- Look for illegal values in pg_index fields.
SELECT p1.indexrelid, p1.indrelid SELECT p1.indexrelid, p1.indrelid
......
...@@ -430,6 +430,9 @@ ALTER OPERATOR FAMILY alt_opf18 USING btree ADD ...@@ -430,6 +430,9 @@ ALTER OPERATOR FAMILY alt_opf18 USING btree ADD
OPERATOR 4 >= (int4, int2) , OPERATOR 4 >= (int4, int2) ,
OPERATOR 5 > (int4, int2) , OPERATOR 5 > (int4, int2) ,
FUNCTION 1 btint42cmp(int4, int2); FUNCTION 1 btint42cmp(int4, int2);
-- Should fail. Not allowed to have cross-type equalimage function.
ALTER OPERATOR FAMILY alt_opf18 USING btree
ADD FUNCTION 4 (int4, int2) btequalimage(oid);
ALTER OPERATOR FAMILY alt_opf18 USING btree DROP FUNCTION 2 (int4, int4); ALTER OPERATOR FAMILY alt_opf18 USING btree DROP FUNCTION 2 (int4, int4);
DROP OPERATOR FAMILY alt_opf18 USING btree; DROP OPERATOR FAMILY alt_opf18 USING btree;
......
...@@ -1323,6 +1323,24 @@ WHERE p1.amproc = p2.oid AND ...@@ -1323,6 +1323,24 @@ WHERE p1.amproc = p2.oid AND
p1.amproclefttype != p1.amprocrighttype AND p1.amproclefttype != p1.amprocrighttype AND
p2.provolatile = 'v'; p2.provolatile = 'v';
-- Almost all of the core distribution's Btree opclasses can use one of the
-- two generic "equalimage" functions as their support function 4. Look for
-- opclasses that don't allow deduplication unconditionally here.
--
-- Newly added Btree opclasses don't have to support deduplication. It will
-- usually be trivial to add support, though. Note that the expected output
-- of this part of the test will need to be updated when a new opclass cannot
-- support deduplication (by using btequalimage).
SELECT amp.amproc::regproc AS proc, opf.opfname AS opfamily_name,
opc.opcname AS opclass_name, opc.opcintype::regtype AS opcintype
FROM pg_am AS am
JOIN pg_opclass AS opc ON opc.opcmethod = am.oid
JOIN pg_opfamily AS opf ON opc.opcfamily = opf.oid
LEFT JOIN pg_amproc AS amp ON amp.amprocfamily = opf.oid AND
amp.amproclefttype = opc.opcintype AND amp.amprocnum = 4
WHERE am.amname = 'btree' AND
amp.amproc IS DISTINCT FROM 'btequalimage'::regproc
ORDER BY 1, 2, 3;
-- **************** pg_index **************** -- **************** pg_index ****************
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment