1. 03 May, 2017 4 commits
    • Alvaro Herrera's avatar
      pg_dump/t/002: append terminating semicolon to SQL commands · 698923d6
      Alvaro Herrera authored
      It's easy to overlook the need for one, and its lack is annoying for the
      next developer wanting to create a new test.  Rather than expect every
      individual command to add the semicolon, just append one automatically.
      
      Discussion: http://postgr.es/m/20170503172746.rwftidszir67sgk7@alvherre.pgsql
      698923d6
    • Heikki Linnakangas's avatar
      Add PQencryptPasswordConn function to libpq, use it in psql and createuser. · 8f8b9be5
      Heikki Linnakangas authored
      The new function supports creating SCRAM verifiers, in addition to md5
      hashes. The algorithm is chosen based on password_encryption, by default.
      
      This fixes the issue reported by Jeff Janes, that there was previously
      no way to create a SCRAM verifier with "\password".
      
      Michael Paquier and me
      
      Discussion: https://www.postgresql.org/message-id/CAMkU%3D1wfBgFPbfAMYZQE78p%3DVhZX7nN86aWkp0QcCp%3D%2BKxZ%3Dbg%40mail.gmail.com
      8f8b9be5
    • Tom Lane's avatar
      Improve performance of timezone loading, especially pg_timezone_names view. · af2c5aa8
      Tom Lane authored
      tzparse() would attempt to load the "posixrules" timezone database file on
      each call.  That might seem like it would only be an issue when selecting a
      POSIX-style zone name rather than a zone defined in the timezone database,
      but it turns out that each zone definition file contains a POSIX-style zone
      string and tzload() will call tzparse() to parse that.  Thus, when scanning
      the whole timezone file tree as we do in the pg_timezone_names view,
      "posixrules" was read repetitively for each zone definition file.  Fix
      that by caching the file on first use within any given process.  (We cache
      other zone definitions for the life of the process, so there seems little
      reason not to cache this one as well.)  This probably won't help much in
      processes that never run pg_timezone_names, but even one additional SET
      of the timezone GUC would come out ahead.
      
      An even worse problem for pg_timezone_names is that pg_open_tzfile()
      has an inefficient way of identifying the canonical case of a zone name:
      it basically re-descends the directory tree to the zone file.  That's not
      awful for an individual "SET timezone" operation, but it's pretty horrid
      when we're inspecting every zone in the database.  And it's pointless too
      because we already know the canonical spelling, having just read it from
      the filesystem.  Fix by teaching pg_open_tzfile() to avoid the directory
      search if it's not asked for the canonical name, and backfilling the
      proper result in pg_tzenumerate_next().
      
      In combination these changes seem to make the pg_timezone_names view
      about 3x faster to read, for me.  Since a scan of pg_timezone_names
      has up to now been one of the slowest queries in the regression tests,
      this should help some little bit for buildfarm cycle times.
      
      Back-patch to all supported branches, not so much because it's likely
      that users will care much about the view's performance as because
      tracking changes in the upstream IANA timezone code is really painful
      if we don't keep all the branches in sync.
      
      Discussion: https://postgr.es/m/27962.1493671706@sss.pgh.pa.us
      af2c5aa8
    • Tom Lane's avatar
      Remove create_singleton_array(), hard-coding the case in its sole caller. · 23c6eb03
      Tom Lane authored
      create_singleton_array() was not really as useful as we perhaps thought
      when we added it.  It had never accreted more than one call site, and is
      only saving a dozen lines of code at that one, which is considerably less
      bulk than the function itself.  Moreover, because of its insistence on
      using the caller's fn_extra cache space, it's arguably a coding hazard.
      text_to_array_internal() does not currently use fn_extra in any other way,
      but if it did it would be subtly broken, since the conflicting fn_extra
      uses could be needed within a single query, in the seldom-tested case that
      the field separator varies during the query.  The same objection seems
      likely to apply to any other potential caller.
      
      The replacement code is a bit uglier, because it hardwires knowledge of
      the storage parameters of type TEXT, but it's not like we haven't got
      dozens or hundreds of other places that do the same.  Uglier seems like
      a good tradeoff for smaller, faster, and safer.
      
      Per discussion with Neha Khatri.
      
      Discussion: https://postgr.es/m/CAFO0U+_fS5SRhzq6uPG+4fbERhoA9N2+nPrtvaC9mmeWivxbsA@mail.gmail.com
      23c6eb03
  2. 02 May, 2017 10 commits
  3. 01 May, 2017 8 commits
    • Tom Lane's avatar
      Improve function header comment for create_singleton_array(). · 54affb41
      Tom Lane authored
      Mentioning the caller is neither future-proof nor an adequate substitute
      for giving an API specification.  Per gripe from Neha Khatri, though
      I changed the patch around some.
      
      Discussion: https://postgr.es/m/CAFO0U+_fS5SRhzq6uPG+4fbERhoA9N2+nPrtvaC9mmeWivxbsA@mail.gmail.com
      54affb41
    • Tom Lane's avatar
      Reduce semijoins with unique inner relations to plain inner joins. · 92a43e48
      Tom Lane authored
      If the inner relation can be proven unique, that is it can have no more
      than one matching row for any row of the outer query, then we might as
      well implement the semijoin as a plain inner join, allowing substantially
      more freedom to the planner.  This is a form of outer join strength
      reduction, but it can't be implemented in reduce_outer_joins() because
      we don't have enough info about the individual relations at that stage.
      Instead do it much like remove_useless_joins(): once we've built base
      relations, we can make another pass over the SpecialJoinInfo list and
      get rid of any entries representing reducible semijoins.
      
      This is essentially a followon to the inner-unique patch (commit 9c7f5229)
      and makes use of the proof machinery that that patch created.  We need only
      minor refactoring of innerrel_is_unique's API to support this usage.
      
      Per performance complaint from Teodor Sigaev.
      
      Discussion: https://postgr.es/m/f994fc98-389f-4a46-d1bc-c42e05cb43ed@sigaev.ru
      92a43e48
    • Tom Lane's avatar
      Fix mis-optimization of semijoins with more than one LHS relation. · 2057a58d
      Tom Lane authored
      The inner-unique patch (commit 9c7f5229) supposed that if we're
      considering a JOIN_UNIQUE_INNER join path, we can always set inner_unique
      for the join, because the inner path produced by create_unique_path should
      be unique relative to the outer relation.  However, that's true only if
      we're considering joining to the whole outer relation --- otherwise we may
      be applying only some of the join quals, and so the inner path might be
      non-unique from the perspective of this join.  Adjust the test to only
      believe that we can set inner_unique if we have the whole semijoin LHS on
      the outer side.
      
      There is more that can be done in this area, but this commit is only
      intended to provide the minimal fix needed to get correct plans.
      
      Per report from Teodor Sigaev.  Thanks to David Rowley for preliminary
      investigation.
      
      Discussion: https://postgr.es/m/f994fc98-389f-4a46-d1bc-c42e05cb43ed@sigaev.ru
      2057a58d
    • Tom Lane's avatar
      Update time zone data files to tzdata release 2017b. · 74a20d0a
      Tom Lane authored
      DST law changes in Chile, Haiti, and Mongolia.  Historical corrections for
      Ecuador, Kazakhstan, Liberia, and Spain.
      
      The IANA crew continue their campaign to replace invented time zone
      abbrevations with numeric GMT offsets.  This update changes numerous zones
      in South America, the Pacific and Indian oceans, and some Asian and Middle
      Eastern zones.  I kept these abbreviations in the tznames/ data files,
      however, so that we will still accept them for input.  (We may want to
      start trimming those files someday, but I think we should wait for the
      upstream dust to settle before deciding what to do.)
      
      In passing, add MESZ (Mitteleuropaeische Sommerzeit) to the tznames lists;
      since we accept MEZ (Mitteleuropaeische Zeit) it seems rather strange not
      to take the other one.  And fix some incorrect, or at least obsolete,
      comments that certain abbreviations are not traceable to the IANA data.
      74a20d0a
    • Robert Haas's avatar
      libpq: Fix inadvertent change in .pgpass lookup behavior. · bdac9836
      Robert Haas authored
      Commit 274bb2b3 caused password file
      lookups to use the hostaddr in preference to the host, but that was
      not intended and the documented behavior is the opposite.
      
      Report and patch by Kyotaro Horiguchi.
      
      Discussion: http://postgr.es/m/20170428.165432.60857995.horiguchi.kyotaro@lab.ntt.co.jp
      bdac9836
    • Andrew Dunstan's avatar
      Allow vcregress.pl to run an arbitrary TAP test set · fed6df48
      Andrew Dunstan authored
      Currently only provision for running the bin checks in a single step is
      provided for. Now these tests can be run individually, as well as tests
      in other locations (e.g. src.test/recover).
      
      Also provide for suppressing unnecessary temp installs by setting the
      NO_TEMP_INSTALL environment variable just as the Makefiles do.
      
      Backpatch to 9.4.
      fed6df48
    • Peter Eisentraut's avatar
      Fix logical replication launcher wake up and reset · 9414e41e
      Peter Eisentraut authored
      After the logical replication launcher was told to wake up at
      commit (for example, by a CREATE SUBSCRIPTION command), the flag to wake
      up was not reset, so it would be woken up at every following commit as
      well.  So fix that by resetting the flag.
      
      Also, we don't need to wake up anything if the transaction was rolled
      back.  Just reset the flag in that case.
      
      Author: Masahiko Sawada <sawada.mshk@gmail.com>
      Reported-by: default avatarFujii Masao <masao.fujii@gmail.com>
      9414e41e
    • Robert Haas's avatar
      Fire per-statement triggers on partitioned tables. · e180c8aa
      Robert Haas authored
      Even though no actual tuples are ever inserted into a partitioned
      table (the actual tuples are in the partitions, not the partitioned
      table itself), we still need to have a ResultRelInfo for the
      partitioned table, or per-statement triggers won't get fired.
      
      Amit Langote, per a report from Rajkumar Raghuwanshi.  Reviewed by me.
      
      Discussion: http://postgr.es/m/CAKcux6%3DwYospCRY2J4XEFuVy0L41S%3Dfic7rmkbsU-GXhhSbmBg%40mail.gmail.com
      e180c8aa
  4. 30 Apr, 2017 3 commits
    • Tom Lane's avatar
      Sync our copy of the timezone library with IANA release tzcode2017b. · e18b2c48
      Tom Lane authored
      zic no longer mishandles some transitions in January 2038 when it
      attempts to work around Qt bug 53071.  This fixes a bug affecting
      Pacific/Tongatapu that was introduced in zic 2016e.  localtime.c
      now contains a workaround, useful when loading a file generated by
      a buggy zic.
      
      There are assorted cosmetic changes as well, notably relocation
      of a bunch of #defines.
      e18b2c48
    • Tom Lane's avatar
      Fix possible null pointer dereference or invalid warning message. · 12d11432
      Tom Lane authored
      Thinko in commit de438971: this warning message references the wrong
      "LogicalRepWorker *" variable.  This would often result in a core dump,
      but if it didn't, the message would show the wrong subscription OID.
      
      In passing, adjust the message text to format a subscription OID
      similarly to how that's done elsewhere in the function; and fix
      grammatical issues in some nearby messages.
      
      Per Coverity testing.
      12d11432
    • Tom Lane's avatar
      Micro-optimize some slower queries in the opr_sanity regression test. · c2384421
      Tom Lane authored
      Convert the binary_coercible() and physically_coercible() functions from
      SQL to plpgsql.  It's not that plpgsql is inherently better at doing
      queries; if you simply convert the previous single SQL query into one
      RETURN expression, it's no faster.  The problem with the existing code
      is that it fools the plancache into deciding that it's worth re-planning
      the query every time, since constant-folding with a concrete value for $2
      allows elimination of at least one sub-SELECT.  In reality that's using the
      planner to do the equivalent of a few runtime boolean tests, causing the
      function to run much slower than it should.  Splitting the AND/OR logic
      into separate plpgsql statements allows each if-expression to acquire a
      static plan.
      
      Also, get rid of some uses of obj_description() in favor of explicitly
      joining to pg_description, allowing the joins to be optimized better.
      (Someday we might improve the SQL-function-inlining logic enough that
      this happens automatically, but today is not that day.)
      
      Together, these changes reduce the runtime of the opr_sanity regression
      test by about a factor of two on one of my slower machines.  They don't
      seem to help as much on a fast machine, but this should at least benefit
      the buildfarm.
      c2384421
  5. 28 Apr, 2017 9 commits
  6. 27 Apr, 2017 6 commits
    • Andres Freund's avatar
      Don't build full initial logical decoding snapshot if NOEXPORT_SNAPSHOT. · ab9c4338
      Andres Freund authored
      Earlier commits (56e19d93 and 2bef06d5) make it cheaper to
      create a logical slot if not exporting the initial snapshot.  If
      NOEXPORT_SNAPSHOT is specified, we can skip the overhead, not just
      when creating a slot via sql (which can't export snapshots).  As
      NOEXPORT_SNAPSHOT has only recently been introduced, this shouldn't be
      backpatched.
      ab9c4338
    • Andres Freund's avatar
      Don't use on-disk snapshots for exported logical decoding snapshot. · 56e19d93
      Andres Freund authored
      Logical decoding stores historical snapshots on disk, so that logical
      decoding can restart without having to reconstruct a snapshot from
      scratch (for which the resources are not guaranteed to be present
      anymore).  These serialized snapshots were also used when creating a
      new slot via the walsender interface, which can export a "full"
      snapshot (i.e. one that can read all tables, not just catalog ones).
      
      The problem is that the serialized snapshots are only useful for
      catalogs and not for normal user tables.  Thus the use of such a
      serialized snapshot could result in an inconsistent snapshot being
      exported, which could lead to queries returning wrong data.  This
      would only happen if logical slots are created while another logical
      slot already exists.
      
      Author: Petr Jelinek
      Reviewed-By: Andres Freund
      Discussion: https://postgr.es/m/f37e975c-908f-858e-707f-058d3b1eb214@2ndquadrant.com
      Backport: 9.4, where logical decoding was introduced.
      56e19d93
    • Tom Lane's avatar
      Avoid slow shutdown of pg_basebackup. · 7834d20b
      Tom Lane authored
      pg_basebackup's child process did not pay any attention to the pipe
      from its parent while waiting for input from the source server.
      If no server data was arriving, it would only wake up and check the
      pipe every standby_message_timeout or so.  This creates a problem
      since the parent process might determine and send the desired stop
      position only after the server has reached end-of-WAL and stopped
      sending data.  In the src/test/recovery regression tests, the timing
      is repeatably such that it takes nearly 10 seconds for the child
      process to realize that it should shut down.  It's not clear how
      often that would happen in real-world cases, but it sure seems like
      a bug --- and if the user turns off standby_message_timeout or sets
      it very large, the delay could be a lot worse.
      
      To fix, expand the StreamCtl API to allow the pipe input FD to be
      passed down to the low-level wait routine, and watch both sockets
      when sleeping.
      
      (Note: AFAICS this issue doesn't affect the Windows port, since
      it doesn't rely on a pipe to transfer the stop position to the
      child thread.)
      
      Discussion: https://postgr.es/m/6456.1493263884@sss.pgh.pa.us
      7834d20b
    • Fujii Masao's avatar
      Fix bug so logical rep launcher saves correctly time of last startup of worker. · 9f11fcec
      Fujii Masao authored
      Previously the logical replication launcher stored the last timestamp
      when it started the worker, in the local variable "last_start_time",
      in order to check whether wal_retrive_retry_interval elapsed since
      the last startup of worker. If it has elapsed, the launcher sees
      pg_subscription and starts new worker if necessary. This is for
      limitting the startup of worker to once a wal_retrieve_retry_interval.
      
      The bug was that the variable "last_start_time" was defined and
      always initialized with 0 at the beginning of the launcher's main loop.
      So even if it's set to the last timestamp in later phase of the loop,
      it's always reset to 0. Therefore the launcher could not check
      correctly whether wal_retrieve_retry_interval elapsed since
      the last startup.
      
      This patch moves the variable "last_start_time" outside the main loop
      so that it will not be reset.
      
      Reviewed-by: Petr Jelinek
      Discussion: http://postgr.es/m/CAHGQGwGJrPO++XM4mFENAwpy1eGXKsGdguYv43GUgLgU-x8nTQ@mail.gmail.com
      9f11fcec
    • Tom Lane's avatar
      Cope with glibc too old to have epoll_create1(). · 82ebbeb0
      Tom Lane authored
      Commit fa31b6f4 supposed that we didn't have to worry about that
      anymore, but it seems that RHEL5 is like that, and that's still
      a supported platform.  Put back the prior coding under an #ifdef,
      adding an explicit fcntl() to retain the desired CLOEXEC property.
      
      Discussion: https://postgr.es/m/12307.1493325329@sss.pgh.pa.us
      82ebbeb0
    • Andres Freund's avatar
      Preserve required !catalog tuples while computing initial decoding snapshot. · 2bef06d5
      Andres Freund authored
      The logical decoding machinery already preserved all the required
      catalog tuples, which is sufficient in the course of normal logical
      decoding, but did not guarantee that non-catalog tuples were preserved
      during computation of the initial snapshot when creating a slot over
      the replication protocol.
      
      This could cause a corrupted initial snapshot being exported.  The
      time window for issues is usually not terribly large, but on a busy
      server it's perfectly possible to it hit it.  Ongoing decoding is not
      affected by this bug.
      
      To avoid increased overhead for the SQL API, only retain additional
      tuples when a logical slot is being created over the replication
      protocol.  To do so this commit changes the signature of
      CreateInitDecodingContext(), but it seems unlikely that it's being
      used in an extension, so that's probably ok.
      
      In a drive-by fix, fix handling of
      ReplicationSlotsComputeRequiredXmin's already_locked argument, which
      should only apply to ProcArrayLock, not ReplicationSlotControlLock.
      
      Reported-By: Erik Rijkers
      Analyzed-By: Petr Jelinek
      Author: Petr Jelinek, heavily editorialized by Andres Freund
      Reviewed-By: Andres Freund
      Discussion: https://postgr.es/m/9a897b86-46e1-9915-ee4c-da02e4ff6a95@2ndquadrant.com
      Backport: 9.4, where logical decoding was introduced.
      2bef06d5