1. 01 Jul, 2020 1 commit
    • David Rowley's avatar
      Further adjustments to Hashagg EXPLAIN ANALYZE output · 40efbf87
      David Rowley authored
      The "Disk Usage" and "HashAgg Batches" properties in the EXPLAIN ANALYZE
      output for HashAgg were previously only shown if the number of batches
      was greater than 0.  Here we change this so that these properties are
      always shown for EXPLAIN ANALYZE formats other than "text".  The idea here
      is that since the HashAgg could have spilled to disk if there had been
      more data or groups to aggregate, then it's relevant that we're clear in
      the EXPLAIN ANALYZE output when no spilling occurred in this particular
      execution of the given plan.
      
      For the "text" EXPLAIN format, we still hide these properties when no
      spilling occurs.  This EXPLAIN format is designed to be easy for humans
      to read.  To maintain the readability we have a higher threshold for which
      properties we display for this format.
      
      Discussion: https://postgr.es/m/CAApHDvo_dmNozQQTmN-2jGp1vT%3Ddxx7Q0vd%2BMvD1cGpv2HU%3DSg%40mail.gmail.com
      Backpatch-through: 13, where the hashagg spilling code was added.
      40efbf87
  2. 30 Jun, 2020 5 commits
  3. 29 Jun, 2020 8 commits
    • Peter Eisentraut's avatar
      pgstattuple: Have pgstattuple_approx accept TOAST tables · ee0202d5
      Peter Eisentraut authored
      TOAST tables have a visibility map and a free space map, so they can
      be supported by pgstattuple_approx just fine.
      
      Add test cases to show how various pgstattuple functions accept TOAST
      tables.  Also add similar tests to pg_visibility, which already
      accepted TOAST tables correctly but had no test coverage for them.
      Reviewed-by: default avatarLaurenz Albe <laurenz.albe@cybertec.at>
      Discussion: https://www.postgresql.org/message-id/flat/27c4496a-02b9-dc87-8f6f-bddbef54e0fe@2ndquadrant.com
      ee0202d5
    • Tom Lane's avatar
      Remove support for timezone "posixrules" file. · ea57e531
      Tom Lane authored
      The IANA tzcode library has a feature to read a time zone file named
      "posixrules" and apply the daylight-savings transition dates and times
      therein, when it is given a POSIX-style time zone specification that
      lacks an explicit transition rule.  However, there's a problem with
      that code: it doesn't work for dates past the Y2038 time_t rollover.
      (Effectively, all times beyond that point are treated as standard
      time.)  The IANA crew regard this feature as legacy, so their plan is
      to remove it not fix it.  The time frame in which that will happen
      is unclear, but presumably it'll happen well before 2038.
      
      Moreover, effective with the next IANA data update (probably this
      fall), the recommended default will be to not install a "posixrules"
      file in the first place.  The time frame in which tzdata packagers
      might adopt that suggestion is likewise unclear, but at least some
      platforms will probably do it in the next year or so.  While we could
      ignore that recommendation so far as PG-supplied tzdata trees are
      concerned, builds using --with-system-tzdata will be subject to
      whatever the platform's tzdata packager decides to do.
      
      Thus, whether or not we do anything, some increasing fraction of
      Postgres users will be exposed to the behavior observed when there
      is no "posixrules" file; and if we do nothing, we'll have essentially
      no control over the timing of that change.
      
      The best thing to do to ameliorate the uncertainty seems to be to
      proactively remove the posixrules-reading feature.  If we do that in
      a scheduled release then at least we can release-note the behavioral
      change, rather than having users be surprised by it after a routine
      tzdata update.
      
      The change in question is fairly minor anyway: to be affected,
      you have to be using a POSIX-style timezone spec, it has to not
      have an explicit rule, and it has to not be one of the four traditional
      continental-USA zone names (EST5EDT, CST6CDT, MST7MDT, or PST8PDT),
      as those are special-cased.  Since the default "posixrules" file
      provides USA DST rules, the number of people who are likely to find
      such a zone spec useful is probably quite small.  Moreover, the
      fallback behavior with no explicit rule and no "posixrules" file is to
      apply current USA rules, so the only thing that really breaks is the
      DST transitions in years before 2007 (and you get the countervailing
      fix that transitions after 2038 will be applied).
      
      Now, some installations might have replaced the "posixrules" file,
      allowing e.g. EU rules to be applied to a POSIX-style timezone spec.
      That won't work anymore.  But it's not exactly clear why this solution
      would be preferable to using a regular named zone.  In any case, given
      the Y2038 issue, we need to be pushing users to stop depending on this.
      
      Back-patch into v13; it hasn't been released yet, so it seems OK to
      change its behavior.  (Personally I think we ought to back-patch
      further, but I've been outvoted.)
      
      Discussion: https://postgr.es/m/1390.1562258309@sss.pgh.pa.us
      Discussion: https://postgr.es/m/20200621211855.6211-1-eggert@cs.ucla.edu
      ea57e531
    • Tom Lane's avatar
      Mop up some no-longer-necessary hacks around printf %.*s format. · c410af09
      Tom Lane authored
      Commit 54cd4f04 added some kluges to work around an old glibc bug,
      namely that %.*s could misbehave if glibc thought any characters in
      the supplied string were incorrectly encoded.  Now that we use our
      own snprintf.c implementation, we need not worry about that bug (even
      if it still exists in the wild).  Revert a couple of particularly
      ugly hacks, and remove or improve assorted comments.
      
      Note that there can still be encoding-related hazards here: blindly
      clipping at a fixed length risks producing wrongly-encoded output
      if the clip splits a multibyte character.  However, code that's
      doing correct multibyte-aware clipping doesn't really need a comment
      about that, while code that isn't needs an explanation why not,
      rather than a red-herring comment about an obsolete bug.
      
      Discussion: https://postgr.es/m/279428.1593373684@sss.pgh.pa.us
      c410af09
    • Peter Geoghegan's avatar
      nbtree: Correct inaccurate split location comment. · f7a476f0
      Peter Geoghegan authored
      Minor oversight in commit fab25024.
      f7a476f0
    • Tom Lane's avatar
      Avoid using %c printf format for potentially non-ASCII characters. · 16e3ad5d
      Tom Lane authored
      Since %c only passes a C "char" to printf, it's incapable of dealing
      with multibyte characters.  Passing just the first byte of such a
      character leads to an output string that is visibly not correctly
      encoded, resulting in undesirable behavior such as encoding conversion
      failures while sending error messages to clients.
      
      We've lived with this issue for a long time because it was inconvenient
      to avoid in a portable fashion.  However, now that we always use our own
      snprintf code, it's reasonable to use the %.*s format to print just one
      possibly-multibyte character in a string.  (We previously avoided that
      obvious-looking answer in order to work around glibc's bug #6530, cf
      commits 54cd4f04 and ed437e2b.)
      
      Hence, run around and fix a bunch of places that used %c to report
      a character found in a user-supplied string.  For simplicity, I did
      not touch places that were emitting non-user-facing debug messages,
      or reporting catalog data that should always be ASCII.  (It's also
      unclear how useful this approach could be in frontend code, where
      it's less certain that we know what encoding we're dealing with.)
      
      In passing, improve a couple of poorly-written error messages in
      pageinspect/heapfuncs.c.
      
      This is a longstanding issue, but I'm hesitant to back-patch because
      of the impact on translatable message strings.  In any case this fix
      would not work reliably before v12.
      
      Tom Lane and Quan Zongliang
      
      Discussion: https://postgr.es/m/a120087c-4c88-d9d4-1ec5-808d7a7f133d@gmail.com
      16e3ad5d
    • Peter Eisentraut's avatar
      Add current substring regular expression syntax · 78c88767
      Peter Eisentraut authored
      SQL:1999 had syntax
      
          SUBSTRING(text FROM pattern FOR escapechar)
      
      but this was replaced in SQL:2003 by the more clear
      
          SUBSTRING(text SIMILAR pattern ESCAPE escapechar)
      
      but this was never implemented in PostgreSQL.  This patch adds that
      new syntax as an alternative in the parser, and updates documentation
      and tests to indicate that this is the preferred alternative now.
      Reviewed-by: default avatarPavel Stehule <pavel.stehule@gmail.com>
      Reviewed-by: default avatarVik Fearing <vik@postgresfriends.org>
      Reviewed-by: default avatarFabien COELHO <coelho@cri.ensmp.fr>
      Discussion: https://www.postgresql.org/message-id/flat/a15db31c-d0f8-8ce0-9039-578a31758adb%402ndquadrant.com
      78c88767
    • Peter Eisentraut's avatar
      Clean up grammar a bit · aafefb4d
      Peter Eisentraut authored
      Simplify the grammar specification of substring() and overlay() a bit,
      simplify and update some comments.
      Reviewed-by: default avatarPavel Stehule <pavel.stehule@gmail.com>
      Reviewed-by: default avatarVik Fearing <vik@postgresfriends.org>
      Reviewed-by: default avatarFabien COELHO <coelho@cri.ensmp.fr>
      Discussion: https://www.postgresql.org/message-id/flat/a15db31c-d0f8-8ce0-9039-578a31758adb%402ndquadrant.com
      aafefb4d
    • Michael Paquier's avatar
      Refactor ObjectAddress field assignments for type dependencies · 68de1440
      Michael Paquier authored
      The logic used to build the set of dependencies needed for a type is
      rather repetitive with direct assignments for each ObjectAddress field.
      This refactors the code to use the macro ObjectAddressSet() instead, to
      do the same work.  There are more areas of the backend code that could
      use this macro, but these are left for a follow-up patch that will
      partially rework the way dependencies are recorded as well.  Type
      dependencies are left out of the follow-up patch, so they are refactored
      separately here.
      
      Extracted from a larger patch by the same author.
      
      Author: Daniel Gustafsson
      Discussion: https://potgr.es/m/20190213182737.mxn6hkdxwrzgxk35@alap3.anarazel.de
      68de1440
  4. 28 Jun, 2020 1 commit
    • Noah Misch's avatar
      Fix documentation of "must be vacuumed within" warning. · 96879a0e
      Noah Misch authored
      Warnings start 10M transactions before xidStopLimit, which is 11M
      transactions before wraparound.  The sample WARNING output showed a
      value greater than 11M, and its HINT message predated commit
      25ec228e.  Hence, the sample was
      impossible.  Back-patch to 9.5 (all supported versions).
      96879a0e
  5. 27 Jun, 2020 5 commits
  6. 26 Jun, 2020 1 commit
  7. 25 Jun, 2020 5 commits
  8. 24 Jun, 2020 5 commits
  9. 22 Jun, 2020 4 commits
    • Jeff Davis's avatar
      Doc fixup for hashagg_avoid_disk_plan GUC. · 7ce46156
      Jeff Davis authored
      Reported-by: Justin Pryzby
      Discussion: https://postgr.es/m/20200620220402.GZ17995@telsasoft.com
      Backport-through: 13
      7ce46156
    • Tom Lane's avatar
      Undo double-quoting of index names in non-text EXPLAIN output formats. · 63d2ac23
      Tom Lane authored
      explain_get_index_name() applied quote_identifier() to the index name.
      This is fine for text output, but the non-text output formats all have
      their own quoting conventions and would much rather start from the
      actual index name.  For example in JSON you'd get something like
      
             "Index Name": "\"My Index\"",
      
      which is surely not desirable, especially when the same does not
      happen for table names.  Hence, move the responsibility for applying
      quoting out to the callers, where it can go into already-existing
      special code paths for text format.
      
      This changes the API spec for users of explain_get_index_name_hook:
      before, they were supposed to apply quote_identifier() if necessary,
      now they should not.  Research suggests that the only publicly
      available user of the hook is hypopg, and it actually forgot to
      apply quoting anyway, so it's fine.  (In any case, there's no
      behavioral change for the output of a hook as seen in non-text
      EXPLAIN formats, so this won't break any case that programs should
      be relying on.)
      
      Digging in the commit logs, it appears that quoting was included in
      explain_get_index_name's duties when commit 604ffd28 invented it;
      and that was fine at the time because we only had text output format.
      This should have been rethought when non-text formats were invented,
      but it wasn't.
      
      This is a fairly clear bug for users of non-text EXPLAIN formats,
      so back-patch to all supported branches.
      
      Per bug #16502 from Maciek Sakrejda.  Patch by me (based on
      investigation by Euler Taveira); thanks to Julien Rouhaud for review.
      
      Discussion: https://postgr.es/m/16502-57bd1c9f913ed1d1@postgresql.org
      63d2ac23
    • Michael Paquier's avatar
      Fix inconsistent markups in catalogs.sgml · fe186b4c
      Michael Paquier authored
      Some fields related to pg_opclass and pg_opfamily were using incorrect
      markups, listing them as structname instead of structfield.
      
      Author: Fabien Coelho
      Discussion: https://postgr.es/m/alpine.DEB.2.22.394.2006210903560.859381@pseudo
      fe186b4c
    • Michael Paquier's avatar
      Add --no-index-cleanup and --no-truncate to vacuumdb. · 9550ea30
      Michael Paquier authored
      Both INDEX_CLEANUP and TRUNCATE have been available since v12, and are
      enabled by default except if respectively vacuum_index_cleanup and
      vacuum_truncate are disabled for a given relation.  This change adds
      support for disabling these options from vacuumdb.
      
      Author: Nathan Bossart
      Reviewed-by: Michael Paquier, Masahiko Sawada
      Discussion: https://postgr.es/m/6F7F17EF-B1F2-4681-8D03-BA96365717C0@amazon.com
      9550ea30
  10. 21 Jun, 2020 2 commits
  11. 20 Jun, 2020 3 commits