Commits · e6ae3b5dbf2c07bceb737c5a0ff199b1156051d1 · Abuhujair Javed / Postgres FD Implementation

21 Oct, 2008 1 commit

Add a concept of "placeholder" variables to the planner. These are variables · e6ae3b5d

Tom Lane authored 16 years ago

that represent some expression that we desire to compute below the top level
of the plan, and then let that value "bubble up" as though it were a plain
Var (ie, a column value).

The immediate application is to allow sub-selects to be flattened even when
they are below an outer join and have non-nullable output expressions.
Formerly we couldn't flatten because such an expression wouldn't properly
go to NULL when evaluated above the outer join. Now, we wrap it in a
PlaceHolderVar and arrange for the actual evaluation to occur below the outer
join. When the resulting Var bubbles up through the join, it will be set to
NULL if necessary, yielding the correct results. This fixes a planner
limitation that's existed since 7.1.

In future we might want to use this mechanism to re-introduce some form of
Hellerstein's "expensive functions" optimization, ie place the evaluation of
an expensive function at the most suitable point in the plan tree.

e6ae3b5d

07 Oct, 2008 1 commit

Extend CTE patch to support recursive UNION (ie, without ALL). The · 0d115dde

Tom Lane authored 16 years ago

implementation uses an in-memory hash table, so it will poop out for very
large recursive results ... but the performance characteristics of a
sort-based implementation would be pretty unpleasant too.

0d115dde

06 Oct, 2008 1 commit

When expanding a whole-row Var into a RowExpr during ResolveNew(), attach · bf461538

Tom Lane authored 16 years ago

the column alias names of the RTE referenced by the Var to the RowExpr.
This is needed to allow ruleutils.c to correctly deparse FieldSelect nodes
referencing such a construct. Per my recent bug report.

Adding a field to RowExpr forces initdb (because of stored rules changes)
so this solution is not back-patchable; which is unfortunate because 8.2
and 8.3 have this issue. But it only affects EXPLAIN for some pretty odd
corner cases, so we can probably live without a solution for the back
branches.

bf461538

04 Oct, 2008 1 commit

Implement SQL-standard WITH clauses, including WITH RECURSIVE. · 44d5be0e

Tom Lane authored 16 years ago

There are some unimplemented aspects: recursive queries must use UNION ALL
(should allow UNION too), and we don't have SEARCH or CYCLE clauses.
These might or might not get done for 8.4, but even without them it's a
pretty useful feature.

There are also a couple of small loose ends and definitional quibbles,
which I'll send a memo about to pgsql-hackers shortly.  But let's land
the patch now so we can get on with other development.

Yoshiyuki Asaba, with lots of help from Tatsuo Ishii and Tom Lane

44d5be0e

28 Aug, 2008 1 commit

Extend the parser location infrastructure to include a location field in · a2794623

Tom Lane authored 16 years ago

most node types used in expression trees (both before and after parse
analysis). This allows us to place an error cursor in many situations
where we formerly could not, because the information wasn't available
beyond the very first level of parse analysis. There's a fair amount
of work still to be done to persuade individual ereport() calls to actually
include an error location, but this gets the initdb-forcing part of the
work out of the way; and the situation is already markedly better than
before for complaints about unimplementable implicit casts, such as
CASE and UNION constructs with incompatible alternative data types.
Per my proposal of a few days ago.

a2794623

25 Aug, 2008 1 commit

Move exprType(), exprTypmod(), expression_tree_walker(), and related routines · e5536e77

Tom Lane authored 16 years ago

into nodes/nodeFuncs, so as to reduce wanton cross-subsystem #includes inside
the backend.  There's probably more that should be done along this line,
but this is a start anyway.

e5536e77

14 Aug, 2008 1 commit

Implement SEMI and ANTI joins in the planner and executor. (Semijoins replace · e006a24a

Tom Lane authored 16 years ago

the old JOIN_IN code, but antijoins are new functionality.) Teach the planner
to convert appropriate EXISTS and NOT EXISTS subqueries into semi and anti
joins respectively. Also, LEFT JOINs with suitable upper-level IS NULL
filters are recognized as being anti joins. Unify the InClauseInfo and
OuterJoinInfo infrastructure into "SpecialJoinInfo". With that change,
it becomes possible to associate a SpecialJoinInfo with every join attempt,
which permits some cleanup of join selectivity estimation. That needs to be
taken much further than this patch does, but the next step is to change the
API for oprjoin selectivity functions, which seems like material for a
separate patch. So for the moment the output size estimates for semi and
especially anti joins are quite bogus.

e006a24a

07 Aug, 2008 3 commits

Improve INTERSECT/EXCEPT hashing by realizing that we don't need to make any · af95d7aa

Tom Lane authored 16 years ago

hashtable entries for tuples that are found only in the second input: they
can never contribute to the output. Furthermore, this implies that the
planner should endeavor to put first the smaller (in number of groups) input
relation for an INTERSECT. Implement that, and upgrade prepunion's estimation
of the number of rows returned by setops so that there's some amount of sanity
in the estimate of which one is smaller.

af95d7aa

Support hashing for duplicate-elimination in INTERSECT and EXCEPT queries. · 368df304

Tom Lane authored 16 years ago

This completes my project of improving usage of hashing for duplicate
elimination (aggregate functions with DISTINCT remain undone, but that's
for some other day).

As with the previous patches, this means we can INTERSECT/EXCEPT on datatypes
that can hash but not sort, and it means that INTERSECT/EXCEPT without ORDER
BY are no longer certain to produce sorted output.

368df304

Teach the system how to use hashing for UNION. (INTERSECT/EXCEPT will follow, · 2d1d96b1

Tom Lane authored 16 years ago

but seem like a separate patch since most of the remaining work is on the
executor side.) I took the opportunity to push selection of the grouping
operators for set operations into the parser where it belongs. Otherwise this
is just a small exercise in making prepunion.c consider both alternatives.

As with the recent DISTINCT patch, this means we can UNION on datatypes that
can hash but not sort, and it means that UNION without ORDER BY is no longer
certain to produce sorted output.

2d1d96b1

02 Aug, 2008 1 commit

Rearrange the querytree representation of ORDER BY/GROUP BY/DISTINCT items · 95113047

Tom Lane authored 16 years ago

as per my recent proposal:

1. Fold SortClause and GroupClause into a single node type SortGroupClause.
We were already relying on them to be struct-equivalent, so using two node
tags wasn't accomplishing much except to get in the way of comparing items
with equal().

2. Add an "eqop" field to SortGroupClause to carry the associated equality
operator. This is cheap for the parser to get at the same time it's looking
up the sort operator, and storing it eliminates the need for repeated
not-so-cheap lookups during planning. In future this will also let us
represent GROUP/DISTINCT operations on datatypes that have hash opclasses
but no btree opclasses (ie, they have equality but no natural sort order).
The previous representation simply didn't work for that, since its only
indicator of comparison semantics was a sort operator.

3. Add a hasDistinctOn boolean to struct Query to explicitly record whether
the distinctClause came from DISTINCT or DISTINCT ON. This allows removing
some complicated and not 100% bulletproof code that attempted to figure
that out from the distinctClause alone.

This patch doesn't in itself create any new capability, but it's necessary
infrastructure for future attempts to use hash-based grouping for DISTINCT
and UNION/INTERSECT/EXCEPT.

95113047

31 Jul, 2008 1 commit

Fix parser so that we don't modify the user-written ORDER BY list in order · 63247bec

Tom Lane authored 16 years ago

to represent DISTINCT or DISTINCT ON. This gets rid of a longstanding
annoyance that a view or rule using SELECT DISTINCT will be dumped out
with an overspecified ORDER BY list, and is one small step along the way
to decoupling DISTINCT and ORDER BY enough so that hash-based implementation
of DISTINCT will be possible. In passing, improve transformDistinctClause
so that it doesn't reject duplicate DISTINCT ON items, as was reported by
Steve Midgley a couple weeks ago.

63247bec

19 Jun, 2008 1 commit

Improve our #include situation by moving pointer types away from the · a3540b0f

Alvaro Herrera authored 16 years ago

corresponding struct definitions. This allows other headers to avoid including
certain highly-loaded headers such as rel.h and relscan.h, instead using just
relcache.h, heapam.h or genam.h, which are more lightweight and thus cause less
unnecessary dependencies.

a3540b0f

01 Jan, 2008 1 commit
- Update copyrights in source tree to 2008. · 9098ab9e
  Bruce Momjian authored 17 years ago
  
  9098ab9e
15 Nov, 2007 1 commit
- pgindent run for 8.3. · fdf5a5ef
  Bruce Momjian authored 17 years ago
  
  fdf5a5ef
22 Oct, 2007 1 commit
- Remove an Assert that's been obsoleted by recent changes in the parsetree · 88ae1bd3
  Tom Lane authored 17 years ago
```
representation of DECLARE CURSOR.  Report and fix by Heikki.
```
  88ae1bd3
12 Jul, 2007 1 commit
- Fix mistaken Assert in adjust_appendrel_attr_needed, per Greg Stark. · bc8d164d
  Tom Lane authored 17 years ago
  
  bc8d164d
11 Jun, 2007 1 commit

Support UPDATE/DELETE WHERE CURRENT OF cursor_name, per SQL standard. · 6808f1b1

Tom Lane authored 17 years ago

Along the way, allow FOR UPDATE in non-WITH-HOLD cursors; there may once
have been a reason to disallow that, but it seems to work now, and it's
really rather necessary if you want to select a row via a cursor and then
update it in a concurrent-safe fashion.

Original patch by Arul Shaji, rather heavily editorialized by Tom Lane.

6808f1b1

21 Apr, 2007 1 commit

Tweak make_inh_translation_lists() to check the common case wherein parent and · 925ca9d7

Tom Lane authored 17 years ago

child attnums are the same, before it grovels through each and every child
column looking for a name match.  Saves some time in large inheritance trees,
per example from Greg.

925ca9d7

17 Mar, 2007 1 commit

Fix up the remaining places where the expression node structure would lose · 0f4ff460

Tom Lane authored 17 years ago

available information about the typmod of an expression; namely, Const,
ArrayRef, ArrayExpr, and EXPR and ARRAY SubLinks. In the ArrayExpr and
SubLink cases it wasn't really the data structure's fault, but exprTypmod()
being lazy. This seems like a good idea in view of the expected increase in
typmod usage from Teodor's work to allow user-defined types to have typmods.
In particular this responds to the concerns we had about eliminating the
special-purpose hack that exprTypmod() used to have for BPCHAR Consts.
We can now tell whether or not such a Const has been cast to a specific
length, and report or display properly if so.

initdb forced due to changes in stored rules.

0f4ff460

22 Feb, 2007 1 commit

Turn the rangetable used by the executor into a flat list, and avoid storing · eab6b8b2

Tom Lane authored 18 years ago

useless substructure for its RangeTblEntry nodes. (I chose to keep using the
same struct node type and just zero out the link fields for unneeded info,
rather than making a separate ExecRangeTblEntry type --- it seemed too
fragile to have two different rangetable representations.)

Along the way, put subplans into a list in the toplevel PlannedStmt node,
and have SubPlan nodes refer to them by list index instead of direct pointers.
Vadim wanted to do that years ago, but I never understood what he was on about
until now. It makes things a *whole* lot more robust, because we can stop
worrying about duplicate processing of subplans during expression tree
traversals. That's been a constant source of bugs, and it's finally gone.

There are some consequent simplifications yet to be made, like not using
a separate EState for subplans in the executor, but I'll tackle that later.

eab6b8b2

19 Feb, 2007 1 commit

Get rid of some old and crufty global variables in the planner. When · 7c5e5439

Tom Lane authored 18 years ago

this code was last gone over, there wasn't really any alternative to
globals because we didn't have the PlannerInfo struct being passed all
through the planner code.  Now that we do, we can restructure things
to avoid non-reentrancy.  I'm fooling with this because otherwise I'd
have had to add another global variable for the planned compact
range table list.

7c5e5439

22 Jan, 2007 1 commit

Put back planner's ability to cache the results of mergejoinscansel(), · 4f06c688

Tom Lane authored 18 years ago

which I had removed in the first cut of the EquivalenceClass rewrite to
simplify that patch a little. But it's still important --- in a four-way
join problem mergejoinscansel() was eating about 40% of the planning time
according to gprof. Also, improve the EquivalenceClass code to re-use
join RestrictInfos rather than generating fresh ones for each join
considered. This saves some memory space but more importantly improves
the effectiveness of caching planning info in RestrictInfos.

4f06c688

20 Jan, 2007 1 commit

Refactor planner's pathkeys data structure to create a separate, explicit · f41803bb

Tom Lane authored 18 years ago

representation of equivalence classes of variables.  This is an extensive
rewrite, but it brings a number of benefits:
* planner no longer fails in the presence of "incomplete" operator families
that don't offer operators for every possible combination of datatypes.
* avoid generating and then discarding redundant equality clauses.
* remove bogus assumption that derived equalities always use operators
named "=".
* mergejoins can work with a variety of sort orders (e.g., descending) now,
instead of tying each mergejoinable operator to exactly one sort order.
* better recognition of redundant sort columns.
* can make use of equalities appearing underneath an outer join.

f41803bb

05 Jan, 2007 1 commit
- Update CVS HEAD for 2007 copyright. Back branches are typically not · 29dccf5f
  Bruce Momjian authored 18 years ago
```
back-stamped for this.
```
  29dccf5f
04 Oct, 2006 1 commit
- pgindent run for 8.2. · f99a569a
  Bruce Momjian authored 18 years ago
  
  f99a569a
10 Aug, 2006 1 commit

Fix UNION/INTERSECT/EXCEPT so that when two inputs being merged have · 0ee26100

Tom Lane authored 18 years ago

same data type and same typmod, we show that typmod as the output
typmod, rather than generic -1.  This responds to several complaints
over the past few years about UNIONs unexpectedly dropping length or
precision info.

0ee26100

30 Apr, 2006 1 commit

Improve the representation of FOR UPDATE/FOR SHARE so that we can · 986085a7

Tom Lane authored 18 years ago

support both FOR UPDATE and FOR SHARE in one command, as well as both
NOWAIT and normal WAIT behavior.  The more general code is actually
simpler and cleaner.

986085a7

05 Mar, 2006 1 commit
- Update copyright for 2006. Update scripts. · f2f5b056
  Bruce Momjian authored 18 years ago
  
  f2f5b056
03 Feb, 2006 1 commit

Teach planner to convert simple UNION ALL subqueries into append relations, · 8b109ebf

Tom Lane authored 19 years ago

thereby sharing code with the inheritance case.  This puts the UNION-ALL-view
approach to partitioned tables on par with inheritance, so far as constraint
exclusion is concerned: it works either way.  (Still need to update the docs
to say so.)  The definition of "simple UNION ALL" is a little simpler than
I would like --- basically the union arms can only be SELECT * FROM foo
--- but it's good enough for partitioned-table cases.

8b109ebf

31 Jan, 2006 1 commit

Restructure planner's handling of inheritance. Rather than processing · 8a1468af

Tom Lane authored 19 years ago

inheritance trees on-the-fly, which pretty well constrained us to considering
only one way of planning inheritance, expand inheritance sets during the
planner prep phase, and build a side data structure that can be consulted
later to find which RTEs are members of which inheritance sets. As proof of
concept, use the data structure to plan joins against inheritance sets more
efficiently: we can now use indexes on the set members in inner-indexscan
joins. (The generated plans could be improved further, but it'll take some
executor changes.) This data structure will also support handling UNION ALL
subqueries in the same way as inheritance sets, but that aspect of it isn't
finished yet.

8a1468af

22 Nov, 2005 1 commit

Re-run pgindent, fixing a problem where comment lines after a blank · 436a2956

Bruce Momjian authored 19 years ago

comment line where output as too long, and update typedefs for /lib
directory.  Also fix case where identifiers were used as variable names
in the backend, but as typedefs in ecpg (favor the backend for
indenting).

Backpatch to 8.1.X.

436a2956

15 Oct, 2005 1 commit
- Standard pgindent run for 8.1. · 1dc34982
  Bruce Momjian authored 19 years ago
  
  1dc34982
02 Aug, 2005 1 commit
- Prevent planner from including temp tables of other backends when expanding · 688784f6
  Tom Lane authored 19 years ago
```
an inheritance tree.  Per recent discussions.
```
  688784f6
28 Jul, 2005 1 commit
- Make use of new list primitives list_append_unique and list_concat_unique · 5d27bf20
  Tom Lane authored 19 years ago
```
where applicable.
```
  5d27bf20
10 Jun, 2005 1 commit

If a LIMIT is applied to a UNION ALL query, plan each UNION arm as · 3b167a40

Tom Lane authored 19 years ago

if the limit were directly applied to it. This does not actually
add a LIMIT plan node to the generated subqueries --- that would be
useless overhead --- but it does cause the planner to prefer fast-
start plans when the limit is small. After an idea from Phil Endecott.

3b167a40

09 Jun, 2005 1 commit

Simplify the planner's join clause management by storing join clauses · a31ad27f

Tom Lane authored 19 years ago

of a relation in a flat 'joininfo' list. The former arrangement grouped
the join clauses according to the set of unjoined relids used in each;
however, profiling on test cases involving lots of joins proves that
that data structure is a net loss. It takes more time to group the
join clauses together than is saved by avoiding duplicate tests later.
It doesn't help any that there are usually not more than one or two
clauses per group ...

a31ad27f

05 Jun, 2005 1 commit

Remove planner's private fields from Query struct, and put them into · 9ab4d981

Tom Lane authored 19 years ago

a new PlannerInfo struct, which is passed around instead of the bare
Query in all the planning code.  This commit is essentially just a
code-beautification exercise, but it does open the door to making
larger changes to the planner data structures without having to muck
with the widely-known Query struct.

9ab4d981

22 May, 2005 1 commit

Teach the planner to remove SubqueryScan nodes from the plan if they · e2159f38

Tom Lane authored 19 years ago

aren't doing anything useful (ie, neither selection nor projection).
Also, extend to SubqueryScan the hacks already in place to avoid
unnecessary ExecProject calls when the result would just be the same
tuple the subquery already delivered. This saves some overhead in
UNION and other set operations, as well as avoiding overhead for
unflatten-able subqueries. Per example from Sokolov Yura.

e2159f38

06 Apr, 2005 1 commit

Merge Resdom nodes into TargetEntry nodes to simplify code and save a · ad161bcc

Tom Lane authored 19 years ago

few palloc's. I also chose to eliminate the restype and restypmod fields
entirely, since they are redundant with information stored in the node's
contained expression; re-examining the expression at need seems simpler
and more reliable than trying to keep restype/restypmod up to date.

initdb forced due to change in contents of stored rules.

ad161bcc