1. 04 Apr, 2020 4 commits
    • Amit Kapila's avatar
      Add infrastructure to track WAL usage. · df3b1814
      Amit Kapila authored
      This allows gathering the WAL generation statistics for each statement
      execution.  The three statistics that we collect are the number of WAL
      records, the number of full page writes and the amount of WAL bytes
      generated.
      
      This helps the users who have write-intensive workload to see the impact
      of I/O due to WAL.  This further enables us to see approximately what
      percentage of overall WAL is due to full page writes.
      
      In the future, we can extend this functionality to allow us to compute the
      the exact amount of WAL data due to full page writes.
      
      This patch in itself is just an infrastructure to compute WAL usage data.
      The upcoming patches will expose this data via explain, auto_explain,
      pg_stat_statements and verbose (auto)vacuum output.
      
      Author: Kirill Bychik, Julien Rouhaud
      Reviewed-by: Dilip Kumar, Fujii Masao and Amit Kapila
      Discussion: https://postgr.es/m/CAB-hujrP8ZfUkvL5OYETipQwA=e3n7oqHFU=4ZLxWS_Cza3kQQ@mail.gmail.com
      df3b1814
    • Jeff Davis's avatar
      Include chunk overhead in hash table entry size estimate. · 0588ee63
      Jeff Davis authored
      Don't try to be precise about it, just use a constant 16 bytes of
      chunk overhead. Being smarter would require knowing the memory context
      where the chunk will be allocated, which is not known by all callers.
      
      Discussion: https://postgr.es/m/20200325220936.il3ni2fj2j2b45y5@alap3.anarazel.de
      0588ee63
    • Robert Haas's avatar
      Fix resource management bug with replication=database. · 3e0d80fd
      Robert Haas authored
      Commit 0d8c9c12 allowed BASE_BACKUP to
      acquire a ResourceOwner without a transaction so that the backup
      manifest functionality could use a BufFile, but it overlooked the fact
      that when a walsender is used with replication=database, it might have
      a transaction in progress, because in that mode, SQL and replication
      commands can be mixed.  Try to fix things up so that the two cleanup
      mechanisms don't conflict.
      
      Per buildfarm member serinus, which triggered the problem when
      CREATE_REPLICATION_SLOT failed from inside a transaction.  It passed
      on the subsequent run, so evidently the failure doesn't happen every
      time.
      3e0d80fd
    • Robert Haas's avatar
      Be more careful about time_t vs. pg_time_t in basebackup.c. · db1531ca
      Robert Haas authored
      lapwing is complaining that about a call to pg_gmtime, saying that
      it "expected 'const pg_time_t *' but argument is of type 'time_t *'".
      I at first thought that the problem had someting to do with const,
      but Thomas Munro suggested that it might be just because time_t
      and pg_time_t are different identifers. lapwing is i686 rather than
      x86_64, and pg_time_t is always int64, so that seems like a good
      guess.
      
      There is other code that just casts time_t to pg_time_t without
      any conversion function, so try that approach here.
      
      Introduced in commit 0d8c9c12.
      db1531ca
  2. 03 Apr, 2020 19 commits
  3. 02 Apr, 2020 14 commits
    • Tom Lane's avatar
      Improve stability fix for partition_aggregate test. · 7cb0a423
      Tom Lane authored
      Instead of disabling autovacuum on these test tables, adjust the
      partition boundaries so that the child partitions are not all the
      same size.  That should cause the planner to use a predictable
      ordering of the per-partition scan nodes even in cases where
      autovacuum causes the rowcount estimates to be off a bit.
      Moreover, this also lets these tests show that the planner does
      properly order the tables in descending size order, something
      that wasn't being proven before.
      
      The pagg_tab1 and pagg_tab2 partitions are still all the same
      size, but that should be fine, because those tables are so small
      that (1) autovacuum won't fire on them, and (2) even if it did,
      it couldn't change the reltuples value --- with only one page,
      it can't see just part of the relation.
      
      Discussion: https://postgr.es/m/24467.1585838693@sss.pgh.pa.us
      7cb0a423
    • Bruce Momjian's avatar
      doc: remove unnecessary INNER keyword · 8da1538b
      Bruce Momjian authored
      A join that was added in commit 9b2009c4 that did not use the INNER
      keyword but the existing query used it.  It was cleaner to remove the
      existing INNER keyword.
      
      Reported-by: Peter Eisentraut
      
      Discussion: https://postgr.es/m/a1ffbfda-59d2-5732-e5fb-3df8582b6434@2ndquadrant.com
      
      Backpatch-through: 9.5
      8da1538b
    • Bruce Momjian's avatar
      doc: remove comma, related to commit 92d31085 · c713dc2f
      Bruce Momjian authored
      Reported-by: Peter Eisentraut
      
      Discussion: https://postgr.es/m/750b8832-d123-7f9b-931e-43ce8321b2d7@2ndquadrant.com
      
      Backpatch-through: 9.5
      c713dc2f
    • Tom Lane's avatar
      Improve user control over truncation of logged bind-parameter values. · 0b34e7d3
      Tom Lane authored
      This patch replaces the boolean GUC log_parameters_on_error introduced
      by commit ba79cb5d with an integer log_parameter_max_length_on_error,
      adding the ability to specify how many bytes to trim each logged
      parameter value to.  (The previous coding hard-wired that choice at
      64 bytes.)
      
      In addition, add a new parameter log_parameter_max_length that provides
      similar control over truncation of query parameters that are logged in
      response to statement-logging options, as opposed to errors.  Previous
      releases always logged such parameters in full, possibly causing log
      bloat.
      
      For backwards compatibility with prior releases,
      log_parameter_max_length defaults to -1 (log in full), while
      log_parameter_max_length_on_error defaults to 0 (no logging).
      
      Per discussion, log_parameter_max_length is SUSET since the DBA should
      control routine logging behavior, but log_parameter_max_length_on_error
      is USERSET because it also affects errcontext data sent back to the
      client.
      
      Alexey Bashtanov, editorialized a little by me
      
      Discussion: https://postgr.es/m/b10493cc-a399-a03a-67c7-068f2791ee50@imap.cc
      0b34e7d3
    • Tomas Vondra's avatar
    • David Rowley's avatar
      Attempt to stabilize partitionwise_aggregate test · cefb82d4
      David Rowley authored
      In b07642db, we added code to trigger autovacuums based on the number of
      INSERTs into a table. This seems to have cause some destabilization of
      the regression tests. Likely this is due to an autovacuum triggering
      mid-test and (per theory from Tom Lane) one of the test's queries causes
      autovacuum to skip some number of pages, resulting in the reltuples
      estimate changing.
      
      The failure that this is attempting to fix is around the order of subnodes
      in an Append. Since the planner orders these according to the subnode
      cost, then it's possible that a small change in the reltuples value changes
      the subnode's cost enough that it swaps position with one of its fellow
      subnodes.
      
      The failure here only seems to occur on slower buildfarm machines. In this
      case, lousyjack, which seems have taken over 8 minutes to run just
      the partitionwise_aggregate test. Such a slow run would increase the
      chances that the autovacuum launcher would trigger a vacuum mid-test.
      Faster machines run this test in sub second time, so have a much smaller
      window for an autovacuum to trigger.
      
      Here we fix this by disabling autovacuum on all tables created in the test.
      
      Additionally, this reverts the change made in the
      partitionwise_aggregate test in 2dc16efe.
      
      Discussion: https://postgr.es/m/22297.1585797192@sss.pgh.pa.us
      cefb82d4
    • Peter Eisentraut's avatar
      Add SQL functions for Unicode normalization · 2991ac5f
      Peter Eisentraut authored
      This adds SQL expressions NORMALIZE() and IS NORMALIZED to convert and
      check Unicode normal forms, per SQL standard.
      
      To support fast IS NORMALIZED tests, we pull in a new data file
      DerivedNormalizationProps.txt from Unicode and build a lookup table
      from that, using techniques similar to ones already used for other
      Unicode data.  make update-unicode will keep it up to date.  We only
      build and use these tables for the NFC and NFKC forms, because they
      are too big for NFD and NFKD and the improvement is not significant
      enough there.
      Reviewed-by: default avatarDaniel Verite <daniel@manitou-mail.org>
      Reviewed-by: default avatarAndreas Karlsson <andreas@proxel.se>
      Discussion: https://www.postgresql.org/message-id/flat/c1909f27-c269-2ed9-12f8-3ab72c8caf7a@2ndquadrant.com
      2991ac5f
    • Peter Eisentraut's avatar
      Fix whitespace · 070c3d39
      Peter Eisentraut authored
      070c3d39
    • Peter Eisentraut's avatar
      doc: Update for Unix-domain sockets on Windows · 580a446c
      Peter Eisentraut authored
      Update the documentation to reflect that Unix-domain sockets are now
      usable on Windows.
      580a446c
    • Peter Eisentraut's avatar
      Add some comments to some SQL features · c6e0edad
      Peter Eisentraut authored
      Otherwise, it could be confusing to a reader that some of these
      well-publicized features are simply listed as unsupported without
      further explanation.
      c6e0edad
    • Thomas Munro's avatar
      Add maintenance_io_concurrency to postgresql.conf.sample. · 37b3794d
      Thomas Munro authored
      New GUC from commit fc34b0d9.
      37b3794d
    • Amit Kapila's avatar
      Allow parallel vacuum to accumulate buffer usage. · 3a5e2213
      Amit Kapila authored
      Commit 40d964ec allowed vacuum command to process indexes in parallel but
      forgot to accumulate the buffer usage stats of parallel workers.  This
      allows leader backend to accumulate buffer usage stats of all the parallel
      workers.
      
      Reported-by: Julien Rouhaud
      Author: Sawada Masahiko
      Reviewed-by: Dilip Kumar, Amit Kapila and Julien Rouhaud
      Discussion: https://postgr.es/m/20200328151721.GB12854@nol
      3a5e2213
    • Fujii Masao's avatar
      Allow pg_stat_statements to track planning statistics. · 17e03282
      Fujii Masao authored
      This commit makes pg_stat_statements support new GUC
      pg_stat_statements.track_planning. If this option is enabled,
      pg_stat_statements tracks the planning statistics of the statements,
      e.g., the number of times the statement was planned, the total time
      spent planning the statement, etc. This feature is useful to check
      the statements that it takes a long time to plan. Previously since
      pg_stat_statements tracked only the execution statistics, we could
      not use that for the purpose.
      
      The planning and execution statistics are stored at the end of
      each phase separately. So there are not always one-to-one relationship
      between them. For example, if the statement is successfully planned
      but fails in the execution phase, only its planning statistics are stored.
      This may cause the users to be able to see different pg_stat_statements
      results from the previous version. To avoid this,
      pg_stat_statements.track_planning needs to be disabled.
      
      This commit bumps the version of pg_stat_statements to 1.8
      since it changes the definition of pg_stat_statements function.
      
      Author: Julien Rouhaud, Pascal Legrand, Thomas Munro, Fujii Masao
      Reviewed-by: Sergei Kornilov, Tomas Vondra, Yoshikazu Imai, Haribabu Kommi, Tom Lane
      Discussion: https://postgr.es/m/CAHGQGwFx_=DO-Gu-MfPW3VQ4qC7TfVdH2zHmvZfrGv6fQ3D-Tw@mail.gmail.com
      Discussion: https://postgr.es/m/CAEepm=0e59Y_6Q_YXYCTHZkqOc6H2pJ54C_Xe=VFu50Aqqp_sA@mail.gmail.com
      Discussion: https://postgr.es/m/DB6PR0301MB21352F6210E3B11934B0DCC790B00@DB6PR0301MB2135.eurprd03.prod.outlook.com
      17e03282
    • Tomas Vondra's avatar
      Collect statistics about SLRU caches · 28cac71b
      Tomas Vondra authored
      There's a number of SLRU caches used to access important data like clog,
      commit timestamps, multixact, asynchronous notifications, etc. Until now
      we had no easy way to monitor these shared caches, compute hit ratios,
      number of reads/writes etc.
      
      This commit extends the statistics collector to track this information
      for a predefined list of SLRUs, and also introduces a new system view
      pg_stat_slru displaying the data.
      
      The list of built-in SLRUs is fixed, but additional SLRUs may be defined
      in extensions. Unfortunately, there's no suitable registry of SLRUs, so
      this patch simply defines a fixed list of SLRUs with entries for the
      built-in ones and one entry for all additional SLRUs. Extensions adding
      their own SLRU are fairly rare, so this seems acceptable.
      
      This patch only allows monitoring of SLRUs, not tuning. The SLRU sizes
      are still fixed (hard-coded in the code) and it's not entirely clear
      which of the SLRUs might need a GUC to tune size. In a way, allowing us
      to determine that is one of the goals of this patch.
      
      Bump catversion as the patch introduces new functions and system view.
      
      Author: Tomas Vondra
      Reviewed-by: Alvaro Herrera
      Discussion: https://www.postgresql.org/message-id/flat/20200119143707.gyinppnigokesjok@development
      28cac71b
  4. 01 Apr, 2020 3 commits
    • Tom Lane's avatar
      Clean up parsing of ltree and lquery some more. · 17ca0679
      Tom Lane authored
      Fix lquery parsing to handle repeated flag characters correctly,
      and to enforce the max label length correctly in some cases where
      it did not before, and to detect empty labels in some cases where
      it did not before.
      
      In a more cosmetic vein, use a switch rather than if-then chains to
      handle the different states, and avoid unnecessary checks on charlen
      when looking for ASCII characters, and factor out multiple copies of
      the label length checking code.
      
      Tom Lane and Dmitry Belyavsky
      
      Discussion: https://postgr.es/m/CADqLbzLVkBuPX0812o+z=c3i6honszsZZ6VQOSKR3VPbB56P3w@mail.gmail.com
      17ca0679
    • Tom Lane's avatar
      Add support for binary I/O of ltree, lquery, and ltxtquery types. · 949a9f04
      Tom Lane authored
      Not much to say here --- does what it says on the tin.  The "binary"
      representation in each case is really just the same as the text format,
      though we prefix a version-number byte in case anyone ever feels
      motivated to change that.  Thus, there's not any expectation of improved
      speed or reduced space; the point here is just to allow clients to use
      binary format for all columns of a query result or COPY data.
      
      This makes use of the recently added ALTER TYPE support to add binary
      I/O functions to an existing data type.  As in commit a8081860,
      we can piggy-back on there already being a new-for-v13 version of the
      ltree extension, so we don't need a new update script file.
      
      Nino Floris, reviewed by Alexander Korotkov and myself
      
      Discussion: https://postgr.es/m/CANmj9Vxx50jOo1L7iSRxd142NyTz6Bdcgg7u9P3Z8o0=HGkYyQ@mail.gmail.com
      949a9f04
    • Tom Lane's avatar
      Check equality semantics for unique indexes on partitioned tables. · 501b0187
      Tom Lane authored
      We require the partition key to be a subset of the set of columns
      being made unique, so that physically-separate indexes on the different
      partitions are sufficient to enforce the uniqueness constraint.
      
      The existing code checked that the listed columns appear, but did not
      inquire into the index semantics, which is a serious oversight given
      that different index opclasses might enforce completely different
      notions of uniqueness.
      
      Ideally, perhaps, we'd just match the partition key opfamily to the
      index opfamily.  But hash partitioning uses hash opfamilies which we
      can't directly match to btree opfamilies.  Hence, look up the equality
      operator in each family, and accept if it's the same operator.  This
      should be okay in a fairly general sense, since the equality operator
      ought to precisely represent the opfamily's notion of uniqueness.
      
      A remaining weak spot is that we don't have a cross-index-AM notion of
      which opfamily member is "equality".  But we know which one to use for
      hash and btree AMs, and those are the only two that are relevant here
      at present.  (Any non-core AMs that know how to enforce equality are
      out of luck, for now.)
      
      Back-patch to v11 where this feature was introduced.
      
      Guancheng Luo, revised a bit by me
      
      Discussion: https://postgr.es/m/D9C3CEF7-04E8-47A1-8300-CA1DCD5ED40D@gmail.com
      501b0187