1. 21 Jul, 2019 4 commits
    • David Rowley's avatar
      Adjust overly strict Assert · e1a0f6a9
      David Rowley authored
      3373c715 changed how we determine EquivalenceClasses for relations and
      added an Assert to ensure all relations mentioned in each EC's ec_relids
      was a RELOPT_BASEREL.  However, the join removal code may remove a LEFT
      JOIN and since it does not clean up EC members belonging to the removed
      relations it can leave RELOPT_DEADREL rels in ec_relids.
      
      Fix this by adjusting the Assert to allow RELOPT_DEADREL rels too.
      
      Reported-by: sqlsmith via Andreas Seltenreich
      Discussion: https://postgr.es/m/87y30r8sls.fsf@ansel.ydns.eu
      e1a0f6a9
    • Tom Lane's avatar
      Remove no-longer-helpful reliance on fixed-size local array. · 330cafdf
      Tom Lane authored
      Coverity complained about this code, apparently because it uses a local
      array of size FUNC_MAX_ARGS without a guard that the input argument list
      is no longer than that.  (Not sure why it complained today, since this
      code's been the same for a long time; possibly it re-analyzed everything
      the List API change touched?)
      
      Rather than add a guard, though, let's just get rid of the local array
      altogether.  It was only there to avoid list_nth() calls, and those are
      no longer expensive.
      330cafdf
    • Michael Paquier's avatar
      Fix compilation warning of pg_basebackup with MinGW · 90317ab7
      Michael Paquier authored
      Several buildfarm members have been complaining about that with gcc,
      like jacana.  Weirdly enough, Visual Studio's compilers do not find this
      issue.
      
      Author: Michael Paquier
      Reviewed-by: Andrew Dunstan
      Discussion: https://postgr.es/m/20190719050830.GK1859@paquier.xyz
      90317ab7
    • David Rowley's avatar
      Speed up finding EquivalenceClasses for a given set of rels · 3373c715
      David Rowley authored
      Previously in order to determine which ECs a relation had members in, we
      had to loop over all ECs stored in PlannerInfo's eq_classes and check if
      ec_relids mentioned the relation.  For the most part, this was fine, as
      generally, unless queries were fairly complex, the overhead of performing
      the lookup would have not been that significant.  However, when queries
      contained large numbers of joins and ECs, the overhead to find the set of
      classes matching a given set of relations could become a significant
      portion of the overall planning effort.
      
      Here we allow a much more efficient method to access the ECs which match a
      given relation or set of relations.  A new Bitmapset field in RelOptInfo
      now exists to store the indexes into PlannerInfo's eq_classes list which
      each relation is mentioned in.  This allows very fast lookups to find all
      ECs belonging to a single relation.  When we need to lookup ECs belonging
      to a given pair of relations, we can simply bitwise-AND the Bitmapsets from
      each relation and use the result to perform the lookup.
      
      We also take the opportunity to write a new implementation of
      generate_join_implied_equalities which makes use of the new indexes.
      generate_join_implied_equalities_for_ecs must remain as is as it can be
      given a custom list of ECs, which we can't easily determine the indexes of.
      
      This was originally intended to fix the performance penalty of looking up
      foreign keys matching a join condition which was introduced by 100340e2.
      However, we're speeding up much more than just that here.
      
      Author: David Rowley, Tom Lane
      Reviewed-by: Tom Lane, Tomas Vondra
      Discussion: https://postgr.es/m/6970.1545327857@sss.pgh.pa.us
      3373c715
  2. 20 Jul, 2019 3 commits
    • Peter Geoghegan's avatar
      Don't rely on estimates for amcheck Bloom filters. · 894af78f
      Peter Geoghegan authored
      Solely relying on a relation's reltuples/relpages estimate to size the
      Bloom filters used by amcheck verification makes verification less
      effective when the estimates are very stale.  In extreme cases,
      verification options that use Bloom filters internally could be totally
      ineffective, without users receiving any clear indication that certain
      types of corruption might easily be missed.
      
      To fix, use RelationGetNumberOfBlocks() instead of relpages to size the
      downlink block Bloom filter.  Use the same RelationGetNumberOfBlocks()
      value to derive a minimum size for the heapallindexed Bloom filter,
      rather than completely trusting reltuples.  Verification will still be
      reasonably effective when the projected/estimated number of Bloom filter
      elements is at least 1/5 of the final number of elements, which is
      assured by the new sizing logic.
      
      Reported-By: Alexander Korotkov
      Discussion: https://postgr.es/m/CAH2-Wzk0ke2J42KrNYBKu0Xovjy-sU5ub7PWjgpbsKdAQcL4OA@mail.gmail.com
      Backpatch: 11-, where downlink/heapallindexed verification were added.
      894af78f
    • Tomas Vondra's avatar
      Use column collation for extended statistics · a63378a0
      Tomas Vondra authored
      The current extended statistics code was a bit confused which collation
      to use.  When building the statistics, the collations defined as default
      for the data types were used (since commit 5e092800).  The MCV code was
      however using the column collations for MCV serialization, and then
      DEFAULT_COLLATION_OID when computing estimates. So overall the code was
      using all three possible options, inconsistently.
      
      This uses the column colation everywhere - this makes it consistent with
      what 5e092800 did for regular stats.  We however do not track the
      collations in a catalog, because we can derive them from column-level
      information.  This may need to change in the future, e.g. after allowing
      statistics on expressions.
      
      Reviewed-by: Tom Lane
      Discussion: https://postgr.es/m/8736jdhbhc.fsf%40ansel.ydns.eu
      Backpatch-to: 12
      a63378a0
    • Tomas Vondra's avatar
      Rework examine_opclause_expression to use varonleft · e38a55ba
      Tomas Vondra authored
      The examine_opclause_expression function needs to return information on
      which side of the operator we found the Var, but the variable was called
      "isgt" which is rather misleading (it assumes the operator is either
      less-than or greater-than, but it may be equality or something else).
      Other places in the planner use a variable called "varonleft" for this
      purpose, so just adopt the same convention here.
      
      The code also assumed we don't care about this flag for equality, as
      (Var = Const) and (Const = Var) should be the same thing. But that does
      not work for cross-type operators, in which case we need to pass the
      parameters to the procedure in the right order. So just use the same
      code for all types of expressions.
      
      This means we don't need to care about the selectivity estimation
      function anymore, at least not in this code. We should only get the
      supported cases here (thanks to statext_is_compatible_clause).
      
      Reviewed-by: Tom Lane
      Discussion: https://postgr.es/m/8736jdhbhc.fsf%40ansel.ydns.eu
      Backpatch-to: 12
      e38a55ba
  3. 19 Jul, 2019 5 commits
  4. 18 Jul, 2019 10 commits
    • Michael Paquier's avatar
      Fix typo in mvdistinct.c · 70a33b21
      Michael Paquier authored
      Noticed while browsing the code.
      70a33b21
    • Jeff Davis's avatar
      Fix daterange canonicalization for +/- infinity. · e6feef57
      Jeff Davis authored
      The values 'infinity' and '-infinity' are a part of the DATE type
      itself, so a bound of the date 'infinity' is not the same as an
      unbounded/infinite range. However, it is still wrong to try to
      canonicalize such values, because adding or subtracting one has no
      effect. Fix by treating 'infinity' and '-infinity' the same as
      unbounded ranges for the purposes of canonicalization (but not other
      purposes).
      
      Backpatch to all versions because it is inconsistent with the
      documented behavior. Note that this could be an incompatibility for
      applications relying on the behavior contrary to the documentation.
      
      Author: Laurenz Albe
      Reviewed-by: Thomas Munro
      Discussion: https://postgr.es/m/77f24ea19ab802bc9bc60ddbb8977ee2d646aec1.camel%40cybertec.at
      Backpatch-through: 9.4
      e6feef57
    • Peter Geoghegan's avatar
      Fix nbtree metapage cache upgrade bug. · d004147e
      Peter Geoghegan authored
      Commit 857f9c36, which taught nbtree VACUUM to avoid unnecessary
      index scans, bumped the nbtree version number from 2 to 3, while adding
      the ability for nbtree indexes to be upgraded on-the-fly.  Various
      assertions that assumed that an nbtree index was always on version 2 had
      to be changed to accept any supported version (version 2 or 3 on
      Postgres 11).
      
      However, a few assertions were missed in the initial commit, all of
      which were in code paths that cache a local copy of the metapage
      metadata, where the index had been expected to be on the current version
      (no longer version 2) as a generic sanity check.  Rather than simply
      update the assertions, follow-up commit 0a64b451 intentionally made
      the metapage caching code update the per-backend cached metadata version
      without changing the on-disk version at the same time.  This could even
      happen when the planner needed to determine the height of a B-Tree for
      costing purposes.  The assertions only fail on Postgres v12 when
      upgrading from v10, because they were adjusted to use the authoritative
      shared memory metapage by v12's commit dd299df8.
      
      To fix, remove the cache-only upgrade mechanism entirely, and update the
      assertions themselves to accept any supported version (go back to using
      the cached version in v12).  The fix is almost a full revert of commit
      0a64b451 on the v11 branch.
      
      VACUUM only considers the authoritative metapage, and never bothers with
      a locally cached version, whereas everywhere else isn't interested in
      the metapage fields that were added by commit 857f9c36.  It seems
      unlikely that this bug has affected any user on v11.
      
      Reported-By: Christoph Berg
      Bug: #15896
      Discussion: https://postgr.es/m/15896-5b25e260fdb0b081%40postgresql.org
      Backpatch: 11-, where VACUUM was taught to avoid unnecessary index scans.
      d004147e
    • Tom Lane's avatar
      Further adjust SPITupleTable to provide a public row-count field. · bc8393cf
      Tom Lane authored
      Now that commit fec0778c drew a clear line between public and private
      fields in SPITupleTable, it seems pretty silly that the count of valid
      tuples isn't on the public side of that line.  The reason why not was
      that there wasn't such a count.  For reasons lost in the mists of time,
      spi.c preferred to keep a count of remaining free entries in the array.
      But that seems pretty pointless: it's unlike the way we handle similar
      code everywhere else, and it involves extra subtractions that surely
      outweigh having to do a comparison rather than test-for-zero to check
      for array-full.
      
      Hence, rearrange so that this code does the expansible array logic
      the same as everywhere else, with a count of valid entries alongside
      the allocated array length.  And document the count as public.
      
      I looked for core-code callers where it would make sense to start
      relying on tuptable->numvals rather than the separate SPI_processed
      variable.  Right now there don't seem to be places where it'd be
      a win to do so without more code restructuring than I care to
      undertake today.  In principle, though, having SPITupleTables be
      fully self-contained should be helpful down the line.
      
      Discussion: https://postgr.es/m/16852.1563395722@sss.pgh.pa.us
      bc8393cf
    • Tomas Vondra's avatar
      Simplify bitmap updates in multivariate MCV code · 7d24f6a4
      Tomas Vondra authored
      When evaluating clauses on a multivariate MCV list, we build a bitmap
      tracking how the clauses match each item of the MCV list.  When updating
      the bitmap we need to consider the current value (tracking how the item
      matches preceding clauses), match for the current clause and whether the
      clauses are connected by AND or OR.
      
      Until now the logic was copied on every place updating the bitmap, which
      was not quite readable.  So just move it to a separate function and call
      it where needed.
      
      Backpatch to 12, where the code was introduced. While not a bugfix, this
      should make maintenance and future backpatches easier.
      
      Discussion: https://postgr.es/m/8736jdhbhc.fsf%40ansel.ydns.eu
      7d24f6a4
    • Tomas Vondra's avatar
      Fix handling of NULLs in MCV items and constants · e4deae73
      Tomas Vondra authored
      There were two issues in how the extended statistics handled NULL values
      in opclauses. Firstly, the code was oblivious to the possibility that
      Const may be NULL (constisnull=true) in which case the constvalue is
      undefined. We need to treat this as a mismatch, and not call the proc.
      
      Secondly, the MCV item itself may contain NULL values too - the code
      already did check that, and updated the match bitmap accordingly, but
      failed to ensure we won't call the operator procedure anyway. It did
      work for AND-clauses, because in that case false in the bitmap stops
      evaluation of further clauses. But for OR-clauses ir was not easy to
      get incorrect estimates or even trigger a crash.
      
      This fixes both issues by extending the existing check so that it looks
      at constisnull too, and making sure it skips calling the procedure.
      
      Discussion: https://postgr.es/m/8736jdhbhc.fsf%40ansel.ydns.eu
      e4deae73
    • Tomas Vondra's avatar
      Fix handling of opclauses in extended statistics · e8b6ae21
      Tomas Vondra authored
      We expect opclauses to have exactly one Var and one Const, but the code
      was checking the Const by calling is_pseudo_constant_clause() which is
      incorrect - we need a proper constant.
      
      Fixed by using plain IsA(x,Const) to check type of the node. We need to
      do these checks in two places, so move it into a separate function that
      can be called in both places.
      
      Reported by Andreas Seltenreich, based on crash reported by sqlsmith.
      
      Backpatch to v12, where this code was introduced.
      
      Discussion: https://postgr.es/m/8736jdhbhc.fsf%40ansel.ydns.eu
      Backpatch-to: 12
      e8b6ae21
    • Tomas Vondra's avatar
      Remove unnecessary TYPECACHE_GT_OPR lookup · a4303a07
      Tomas Vondra authored
      The TYPECACHE_GT_OPR is not needed (it used to be in older version of
      the MCV code), but the compiler failed to detect this as the result was
      used in a fmgr_info() call, populating a FmgrInfo entry.
      
      Backpatch to v12, where this code was introduced.
      
      Discussion: https://postgr.es/m/8736jdhbhc.fsf%40ansel.ydns.eu
      Backpatch-to: 12
      a4303a07
    • Andres Freund's avatar
      21039555
    • Michael Paquier's avatar
      Simplify description of --data-checksums in documentation of initdb · 1c1602b8
      Michael Paquier authored
      The documentation mentioned that data checksums cannot be changed after
      initialization, which is not true as pg_checksums can do that with its
      --enable option introduced in v12.  This simply removes the sentence
      telling so.
      
      Reported-by: Basil Bourque
      Author: Michael Paquier
      Reviewed-by: Daniel Gustafsson
      Discussion: https://postgr.es/m/15909-e9d74271f1647472@postgresql.org
      Backpatch-through: 12
      1c1602b8
  5. 17 Jul, 2019 7 commits
    • Tom Lane's avatar
      Update time zone data files to tzdata release 2019b. · 93907478
      Tom Lane authored
      Brazil no longer observes DST.
      Historical corrections for Palestine, Hong Kong, and Italy.
      93907478
    • Tom Lane's avatar
      Sync our copy of the timezone library with IANA release tzcode2019b. · f285322f
      Tom Lane authored
      A large fraction of this diff is just due to upstream's somewhat
      random decision to rename a bunch of internal variables and struct
      fields.  However, there is an interesting new feature in zic:
      it's grown a "-b slim" option that emits zone files without 32-bit
      data and other backwards-compatibility hacks.  We should consider
      whether we wish to enable that.
      f285322f
    • Tom Lane's avatar
      Clarify the distinction between public and private SPITupleTable fields. · fec0778c
      Tom Lane authored
      The fields that we consider public are "tupdesc" and "vals", which
      historically are in the middle of the struct.  Move them to the front
      (this should be perfectly safe to do in HEAD) and add comments to make
      it quite clear which fields are public or not.
      
      Also adjust spi.sgml's documentation of the struct to match.
      That doc had bit-rotted somewhat, as it was missing some fields.
      (Arguably we should just remove all the private fields from the docs,
      but for now I refrained.)
      
      Daniel Gustafsson, reviewed by Fabien Coelho
      
      Discussion: https://postgr.es/m/0D19F836-B743-4340-B6A2-F148CA3DD1F0@yesql.se
      fec0778c
    • Tom Lane's avatar
      Doc: explain where to find Makefile used to build sepgsql-regtest.pp. · 860c095f
      Tom Lane authored
      At least on Fedora and RHEL, it's not in the same RPM that's needed
      for building sepgsql itself.  Today is the second or third time I've
      had to rediscover how to install that, so let's document it this time.
      860c095f
    • Tom Lane's avatar
      Fix sepgsql test results for commit d97b714a. · 82c8a3c5
      Tom Lane authored
      The aggregate-order difference explained in my previous commit
      turns out to also affect the order of log entries emitted in the
      contrib/sepgsql regression test.  Per buildfarm.
      
      Discussion: https://postgr.es/m/21272.1563318411@sss.pgh.pa.us
      82c8a3c5
    • Tom Lane's avatar
      Avoid using lcons and list_delete_first where it's easy to do so. · d97b714a
      Tom Lane authored
      Formerly, lcons was about the same speed as lappend, but with the new
      List implementation, that's not so; with a long List, data movement
      imposes an O(N) cost on lcons and list_delete_first, but not lappend.
      
      Hence, invent list_delete_last with semantics parallel to
      list_delete_first (but O(1) cost), and change various places to use
      lappend and list_delete_last where this can be done without much
      violence to the code logic.
      
      There are quite a few places that construct result lists using lcons not
      lappend.  Some have semantic rationales for that; I added comments about
      it to a couple that didn't have them already.  In many such places though,
      I think the coding is that way only because back in the dark ages lcons
      was faster than lappend.  Hence, switch to lappend where this can be done
      without causing semantic changes.
      
      In ExecInitExprRec(), this results in aggregates and window functions that
      are in the same plan node being executed in a different order than before.
      Generally, the executions of such functions ought to be independent of
      each other, so this shouldn't result in visibly different query results.
      But if you push it, as one regression test case does, you can show that
      the order is different.  The new order seems saner; it's closer to
      the order of the functions in the query text.  And we never documented
      or promised anything about this, anyway.
      
      Also, in gistfinishsplit(), don't bother building a reverse-order list;
      it's easy now to iterate backwards through the original list.
      
      It'd be possible to go further towards removing uses of lcons and
      list_delete_first, but it'd require more extensive logic changes,
      and I'm not convinced it's worth it.  Most of the remaining uses
      deal with queues that probably never get long enough to be worth
      sweating over.  (Actually, I doubt that any of the changes in this
      patch will have measurable performance effects either.  But better
      to have good examples than bad ones in the code base.)
      
      Patch by me, thanks to David Rowley and Daniel Gustafsson for review.
      
      Discussion: https://postgr.es/m/21272.1563318411@sss.pgh.pa.us
      d97b714a
    • Thomas Munro's avatar
      Move some md.c-specific logic from smgr.c to md.c. · dfd0121d
      Thomas Munro authored
      Potential future SMGR implementations may not want to create
      tablespace directories when creating an SMGR relation.  Move that
      logic to mdcreate().  Move the initialization of md-specific
      data structures from smgropen() to a new callback mdopen().
      
      Author: Thomas Munro
      Reviewed-by: Shawn Debnath (as part of an earlier patch set)
      Discussion: https://postgr.es/m/CA%2BhUKG%2BOZqOiOuDm5tC5DyQZtJ3FH4%2BFSVMqtdC4P1atpJ%2Bqhg%40mail.gmail.com
      dfd0121d
  6. 16 Jul, 2019 7 commits
    • Tom Lane's avatar
      Fix thinko in construction of old_conpfeqop list. · 3093eb2b
      Tom Lane authored
      This should lappend the OIDs, not lcons them; the existing code produced
      a list in reversed order.  This is harmless for single-key FKs or FKs
      where all the key columns are of the same type, which probably explains
      how it went unnoticed.  But if those conditions are not met,
      ATAddForeignKeyConstraint would make the wrong decision about whether an
      existing FK needs to be revalidated.  I think it would almost always err
      in the safe direction by revalidating a constraint that didn't need it.
      You could imagine scenarios where the pfeqop check was fooled by
      swapping the types of two FK columns in one ALTER TABLE, but that case
      would probably be rejected by other tests, so it might be impossible to
      get to the worst-case scenario where an FK should be revalidated and
      isn't.  (And even then, it's likely to be fine, unless there are weird
      inconsistencies in the equality behavior of the replacement types.)
      However, this is a performance bug at least.
      
      Noted while poking around to see whether lcons calls could be converted
      to lappend.
      
      This bug is old, dating to commit cb3a7c2b, so back-patch to all
      supported branches.
      3093eb2b
    • Tom Lane's avatar
      Remove lappend_cell...() family of List functions. · c2457769
      Tom Lane authored
      It seems worth getting rid of these functions because they require the
      caller to retain a ListCell pointer into a List that it's modifying,
      which is a dangerous practice with the new List implementation.
      (The only other List-modifying function that takes a ListCell pointer
      as input is list_delete_cell, which nowadays is preferentially used
      via the constrained API foreach_delete_current.)
      
      There was only one remaining caller of these functions after commit
      2f5b8eb5, and that was some fairly ugly GEQO code that can be much
      more clearly expressed using a list-index variable and list_insert_nth.
      Hence, rewrite that code, and remove the functions.
      
      Discussion: https://postgr.es/m/26193.1563228600@sss.pgh.pa.us
      c2457769
    • Tom Lane's avatar
      Clean up some ad-hoc code for sorting and de-duplicating Lists. · 2f5b8eb5
      Tom Lane authored
      heap.c and relcache.c contained nearly identical copies of logic
      to insert OIDs into an OID list while preserving the list's OID
      ordering (and rejecting duplicates, in one case but not the other).
      
      The comments argue that this is faster than qsort for small numbers
      of OIDs, which is at best unproven, and seems even less likely to be
      true now that lappend_cell_oid has to move data around.  In any case
      it's ugly and hard-to-follow code, and if we do have a lot of OIDs
      to consider, it's O(N^2).
      
      Hence, replace with simply lappend'ing OIDs to a List, then list_sort
      the completed List, then remove adjacent duplicates if necessary.
      This is demonstrably O(N log N) and it's much simpler for the
      callers.  It's possible that this would be somewhat inefficient
      if there were a very large number of duplicates, but that seems
      unlikely in the existing usage.
      
      This adds list_deduplicate_oid and list_oid_cmp infrastructure
      to list.c.  I didn't bother with equivalent functionality for
      integer or pointer Lists, but such could always be added later
      if we find a use for it.
      
      Discussion: https://postgr.es/m/26193.1563228600@sss.pgh.pa.us
      2f5b8eb5
    • Tom Lane's avatar
      Redesign the API for list sorting (list_qsort becomes list_sort). · 569ed7f4
      Tom Lane authored
      In the wake of commit 1cff1b95, the obvious way to sort a List
      is to apply qsort() directly to the array of ListCells.  list_qsort
      was building an intermediate array of pointers-to-ListCells, which
      we no longer need, but getting rid of it forces an API change:
      the comparator functions need to do one less level of indirection.
      
      Since we're having to touch the callers anyway, let's do two additional
      changes: sort the given list in-place rather than making a copy (as
      none of the existing callers have any use for the copying behavior),
      and rename list_qsort to list_sort.  It was argued that the old name
      exposes more about the implementation than it should, which I find
      pretty questionable, but a better reason to rename it is to be sure
      we get the attention of any external callers about the need to fix
      their comparator functions.
      
      While we're at it, change four existing callers of qsort() to use
      list_sort instead; previously, they all had local reinventions
      of list_qsort, ie build-an-array-from-a-List-and-qsort-it.
      (There are some other places where changing to list_sort perhaps
      would be worthwhile, but they're less obviously wins.)
      
      Discussion: https://postgr.es/m/29361.1563220190@sss.pgh.pa.us
      569ed7f4
    • Michael Paquier's avatar
      Fix inconsistencies and typos in the tree · 0896ae56
      Michael Paquier authored
      This is numbered take 7, and addresses a set of issues around:
      - Fixes for typos and incorrect reference names.
      - Removal of unneeded comments.
      - Removal of unreferenced functions and structures.
      - Fixes regarding variable name consistency.
      
      Author: Alexander Lakhin
      Discussion: https://postgr.es/m/10bfd4ac-3e7c-40ab-2b2e-355ed15495e8@gmail.com
      0896ae56
    • Tom Lane's avatar
      Remove dead code. · 4c3d05d8
      Tom Lane authored
      These memory context switches are useless in the wake of commit
      1cff1b95.  Noted by Jesper Pedersen.
      
      Discussion: https://postgr.es/m/f078ce63-9e04-0f3e-d200-d7ee66279abe@redhat.com
      4c3d05d8
    • Bruce Momjian's avatar
      doc: mention pg_reload_conf() for reloading the config file · c6bce6eb
      Bruce Momjian authored
      Reported-by: Ian Barwick
      
      Discussion: https://postgr.es/m/538950ec-b86a-1650-6078-beb7091c09c2@2ndquadrant.com
      
      Backpatch-through: 9.4
      c6bce6eb
  7. 15 Jul, 2019 4 commits