Commit e4106b25 authored by Robert Haas's avatar Robert Haas

postgres_fdw: Push down joins to remote servers.

If we've got a relatively straightforward join between two tables,
this pushes that join down to the remote server instead of fetching
the rows for each table and performing the join locally.  Some cases
are not handled yet, such as SEMI and ANTI joins.  Also, we don't
yet attempt to create presorted join paths or parameterized join
paths even though these options do get tried for a base relation
scan.  Nevertheless, this seems likely to be a very significant win
in many practical cases.

Shigeru Hanada and Ashutosh Bapat, reviewed by Robert Haas, with
additional review at various points by Tom Lane, Etsuro Fujita,
KaiGai Kohei, and Jeevan Chalke.
parent 7351e182
This diff is collapsed.
This diff is collapsed.
...@@ -26,7 +26,25 @@ ...@@ -26,7 +26,25 @@
*/ */
typedef struct PgFdwRelationInfo typedef struct PgFdwRelationInfo
{ {
/* baserestrictinfo clauses, broken down into safe and unsafe subsets. */ /*
* True means that the relation can be pushed down. Always true for simple
* foreign scan.
*/
bool pushdown_safe;
/*
* Restriction clauses, divided into safe and unsafe to pushdown subsets.
*
* For a base foreign relation this is a list of clauses along-with
* RestrictInfo wrapper. Keeping RestrictInfo wrapper helps while dividing
* scan_clauses in postgresGetForeignPlan into safe and unsafe subsets.
* Also it helps in estimating costs since RestrictInfo caches the
* selectivity and qual cost for the clause in it.
*
* For a join relation, however, they are part of otherclause list
* obtained from extract_actual_join_clauses, which strips RestrictInfo
* construct. So, for a join relation they are list of bare clauses.
*/
List *remote_conds; List *remote_conds;
List *local_conds; List *local_conds;
...@@ -37,11 +55,17 @@ typedef struct PgFdwRelationInfo ...@@ -37,11 +55,17 @@ typedef struct PgFdwRelationInfo
QualCost local_conds_cost; QualCost local_conds_cost;
Selectivity local_conds_sel; Selectivity local_conds_sel;
/* Estimated size and cost for a scan with baserestrictinfo quals. */ /* Selectivity of join conditions */
Selectivity joinclause_sel;
/* Estimated size and cost for a scan or join. */
double rows; double rows;
int width; int width;
Cost startup_cost; Cost startup_cost;
Cost total_cost; Cost total_cost;
/* Costs excluding costs for transferring data from the foreign server */
Cost rel_startup_cost;
Cost rel_total_cost;
/* Options extracted from catalogs. */ /* Options extracted from catalogs. */
bool use_remote_estimate; bool use_remote_estimate;
...@@ -55,6 +79,19 @@ typedef struct PgFdwRelationInfo ...@@ -55,6 +79,19 @@ typedef struct PgFdwRelationInfo
UserMapping *user; /* only set in use_remote_estimate mode */ UserMapping *user; /* only set in use_remote_estimate mode */
int fetch_size; /* fetch size for this remote table */ int fetch_size; /* fetch size for this remote table */
/*
* Name of the relation while EXPLAINing ForeignScan. It is used for join
* relations but is set for all relations. For join relation, the name
* indicates which foreign tables are being joined and the join type used.
*/
StringInfo relation_name;
/* Join information */
RelOptInfo *outerrel;
RelOptInfo *innerrel;
JoinType jointype;
List *joinclauses;
} PgFdwRelationInfo; } PgFdwRelationInfo;
/* in postgres_fdw.c */ /* in postgres_fdw.c */
...@@ -102,12 +139,15 @@ extern void deparseAnalyzeSql(StringInfo buf, Relation rel, ...@@ -102,12 +139,15 @@ extern void deparseAnalyzeSql(StringInfo buf, Relation rel,
List **retrieved_attrs); List **retrieved_attrs);
extern void deparseStringLiteral(StringInfo buf, const char *val); extern void deparseStringLiteral(StringInfo buf, const char *val);
extern Expr *find_em_expr_for_rel(EquivalenceClass *ec, RelOptInfo *rel); extern Expr *find_em_expr_for_rel(EquivalenceClass *ec, RelOptInfo *rel);
extern List *build_tlist_to_deparse(RelOptInfo *foreign_rel);
extern void deparseSelectStmtForRel(StringInfo buf, PlannerInfo *root, extern void deparseSelectStmtForRel(StringInfo buf, PlannerInfo *root,
RelOptInfo *baserel, List *remote_conds, List *pathkeys, RelOptInfo *foreignrel, List *tlist,
List *remote_conds, List *pathkeys,
List **retrieved_attrs, List **params_list); List **retrieved_attrs, List **params_list);
/* in shippable.c */ /* in shippable.c */
extern bool is_builtin(Oid objectId); extern bool is_builtin(Oid objectId);
extern bool is_shippable(Oid objectId, Oid classId, PgFdwRelationInfo *fpinfo); extern bool is_shippable(Oid objectId, Oid classId, PgFdwRelationInfo *fpinfo);
extern const char *get_jointype_name(JoinType jointype);
#endif /* POSTGRES_FDW_H */ #endif /* POSTGRES_FDW_H */
This diff is collapsed.
...@@ -486,6 +486,16 @@ ...@@ -486,6 +486,16 @@
be <literal>IMMUTABLE</> as well. be <literal>IMMUTABLE</> as well.
</para> </para>
<para>
When <filename>postgres_fdw</> encounters a join between foreign tables on
the same foreign server, it sends the entire join to the foreign server,
unless for some reason it believes that it will be more efficient to fetch
rows from each table individually, or unless the table references involved
are subject to different user mappings. While sending the <literal>JOIN</>
clauses, it takes the same precautions as mentioned above for the
<literal>WHERE</> clauses.
</para>
<para> <para>
The query that is actually sent to the remote server for execution can The query that is actually sent to the remote server for execution can
be examined using <command>EXPLAIN VERBOSE</>. be examined using <command>EXPLAIN VERBOSE</>.
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment