1. 04 Apr, 2019 1 commit
    • Noah Misch's avatar
      Consistently test for in-use shared memory. · 2f932f71
      Noah Misch authored
      postmaster startup scrutinizes any shared memory segment recorded in
      postmaster.pid, exiting if that segment matches the current data
      directory and has an attached process.  When the postmaster.pid file was
      missing, a starting postmaster used weaker checks.  Change to use the
      same checks in both scenarios.  This increases the chance of a startup
      failure, in lieu of data corruption, if the DBA does "kill -9 `head -n1
      postmaster.pid` && rm postmaster.pid && pg_ctl -w start".  A postmaster
      will no longer recycle segments pertaining to other data directories.
      That's good for production, but it's bad for integration tests that
      crash a postmaster and immediately delete its data directory.  Such a
      test now leaks a segment indefinitely.  No "make check-world" test does
      that.  win32_shmem.c already avoided all these problems.  In 9.6 and
      later, enhance PostgresNode to facilitate testing.  Back-patch to 9.4
      (all supported versions).
      
      Reviewed by Daniel Gustafsson and Kyotaro HORIGUCHI.
      
      Discussion: https://postgr.es/m/20130911033341.GD225735@tornado.leadboat.com
      2f932f71
  2. 03 Apr, 2019 11 commits
    • Tom Lane's avatar
    • Tomas Vondra's avatar
      Add SETTINGS option to EXPLAIN, to print modified settings. · ea569d64
      Tomas Vondra authored
      Query planning is affected by a number of configuration options, and it
      may be crucial to know which of those options were set to non-default
      values.  With this patch you can say EXPLAIN (SETTINGS ON) to include
      that information in the query plan.  Only options affecting planning,
      with values different from the built-in default are printed.
      
      This patch also adds auto_explain.log_settings option, providing the
      same capability in auto_explain module.
      
      Author: Tomas Vondra
      Reviewed-by: Rafia Sabih, John Naylor
      Discussion: https://postgr.es/m/e1791b4c-df9c-be02-edc5-7c8874944be0@2ndquadrant.com
      ea569d64
    • Alvaro Herrera's avatar
      Tweak docs for log_statement_sample_rate · d1f04b96
      Alvaro Herrera authored
      Author: Justin Pryzby, partly after a suggestion from Masahiko Sawada
      Discussion: https://postgr.es/m/20190328135918.GA27808@telsasoft.com
      Discussion: https://postgr.es/m/CAD21AoB9+y8N4+Fan-ne-_7J5yTybPttxeVKfwUocKp4zT1vNQ@mail.gmail.com
      d1f04b96
    • Alvaro Herrera's avatar
      Log all statements from a sample of transactions · 799e2203
      Alvaro Herrera authored
      This is useful to obtain a view of the different transaction types in an
      application, regardless of the durations of the statements each runs.
      
      Author: Adrien Nayrat
      Reviewed-by: Masahiko Sawada, Hayato Kuroda, Andres Freund
      799e2203
    • Tom Lane's avatar
      Remove now-unnecessary thread pointer arguments in pgbench. · d8c0bd9f
      Tom Lane authored
      Not required after nuking the zipfian thread-local cache.
      
      Also add a comment about hazardous pointer punning in threadRun(),
      and avoid using "thread" to refer to the threads array as a whole.
      
      Fabien Coelho and Tom Lane, per suggestion from Alvaro Herrera
      
      Discussion: https://postgr.es/m/alpine.DEB.2.21.1904032126060.7997@lancre
      d8c0bd9f
    • Tomas Vondra's avatar
      Reduce overhead of pg_mcv_list (de)serialization · c50b3158
      Tomas Vondra authored
      Commit ea4e1c0e resolved issues with memory alignment in serialized
      pg_mcv_list values, but it required copying data to/from the varlena
      buffer during serialization and deserialization.  As the MCV lits may
      be fairly large, the overhead (memory consumption, CPU usage) can get
      rather significant too.
      
      This change tweaks the serialization format so that the alignment is
      correct with respect to the varlena value, and so the parts may be
      accessed directly without copying the data.
      
      Catversion bump, as it affects existing pg_statistic_ext data.
      c50b3158
    • Stephen Frost's avatar
      GSSAPI encryption support · b0b39f72
      Stephen Frost authored
      On both the frontend and backend, prepare for GSSAPI encryption
      support by moving common code for error handling into a separate file.
      Fix a TODO for handling multiple status messages in the process.
      Eliminate the OIDs, which have not been needed for some time.
      
      Add frontend and backend encryption support functions.  Keep the
      context initiation for authentication-only separate on both the
      frontend and backend in order to avoid concerns about changing the
      requested flags to include encryption support.
      
      In postmaster, pull GSSAPI authorization checking into a shared
      function.  Also share the initiator name between the encryption and
      non-encryption codepaths.
      
      For HBA, add "hostgssenc" and "hostnogssenc" entries that behave
      similarly to their SSL counterparts.  "hostgssenc" requires either
      "gss", "trust", or "reject" for its authentication.
      
      Similarly, add a "gssencmode" parameter to libpq.  Supported values are
      "disable", "require", and "prefer".  Notably, negotiation will only be
      attempted if credentials can be acquired.  Move credential acquisition
      into its own function to support this behavior.
      
      Add a simple pg_stat_gssapi view similar to pg_stat_ssl, for monitoring
      if GSSAPI authentication was used, what principal was used, and if
      encryption is being used on the connection.
      
      Finally, add documentation for everything new, and update existing
      documentation on connection security.
      
      Thanks to Michael Paquier for the Windows fixes.
      
      Author: Robbie Harwood, with changes to the read/write functions by me.
      Reviewed in various forms and at different times by: Michael Paquier,
         Andres Freund, David Steele.
      Discussion: https://www.postgresql.org/message-id/flat/jlg1tgq1ktm.fsf@thriss.redhat.com
      b0b39f72
    • Alvaro Herrera's avatar
      Copy name when cloning FKs recurses to partitions · 5f6fc34a
      Alvaro Herrera authored
      We were passing a string owned by a syscache entry, which was released
      before recursing.  Fix by pstrdup'ing the string.
      
      Per buildfarm member prion.
      5f6fc34a
    • Alvaro Herrera's avatar
      Support foreign keys that reference partitioned tables · f56f8f8d
      Alvaro Herrera authored
      Previously, while primary keys could be made on partitioned tables, it
      was not possible to define foreign keys that reference those primary
      keys.  Now it is possible to do that.
      
      Author: Álvaro Herrera
      Reviewed-by: Amit Langote, Jesper Pedersen
      Discussion: https://postgr.es/m/20181102234158.735b3fevta63msbj@alvherre.pgsql
      f56f8f8d
    • Heikki Linnakangas's avatar
      Generate less WAL during GiST, GIN and SP-GiST index build. · 9155580f
      Heikki Linnakangas authored
      Instead of WAL-logging every modification during the build separately,
      first build the index without any WAL-logging, and make a separate pass
      through the index at the end, to write all pages to the WAL. This
      significantly reduces the amount of WAL generated, and is usually also
      faster, despite the extra I/O needed for the extra scan through the index.
      WAL generated this way is also faster to replay.
      
      For GiST, the LSN-NSN interlock makes this a little tricky. All pages must
      be marked with a valid (i.e. non-zero) LSN, so that the parent-child
      LSN-NSN interlock works correctly. We now use magic value 1 for that during
      index build. Change the fake LSN counter to begin from 1000, so that 1 is
      safely smaller than any real or fake LSN. 2 would've been enough for our
      purposes, but let's reserve a bigger range, in case we need more special
      values in the future.
      
      Author: Anastasia Lubennikova, Andrey V. Lepikhov
      Reviewed-by: Heikki Linnakangas, Dmitry Dolgov
      9155580f
    • Alvaro Herrera's avatar
      Correctly initialize newly added struct member · 5f768045
      Alvaro Herrera authored
      Valgrind was rightly complaining that IndexVacuumInfo->report_progress
      (added by commit ab0dfc96) was not being initialized in some code
      paths.  Repair.
      
      Per buildfarm member lousyjack.
      5f768045
  3. 02 Apr, 2019 11 commits
  4. 01 Apr, 2019 17 commits
    • Andres Freund's avatar
      Only allow heap in a number of contrib modules. · 4b826641
      Andres Freund authored
      Contrib modules pgrowlocks, pgstattuple and some functionality in
      pageinspect currently only supports the heap table AM. As they are all
      concerned with low-level details that aren't reasonably exposed via
      tableam, error out if invoked on a non heap relation.
      
      Author: Andres Freund
      Discussion: https://postgr.es/m/20180703070645.wchpu5muyto5n647@alap3.anarazel.de
      4b826641
    • Andres Freund's avatar
      tableam: Add table_finish_bulk_insert(). · d45e4015
      Andres Freund authored
      This replaces the previous calls of heap_sync() in places using
      bulk-insert. By passing in the flags used for bulk-insert the AM can
      decide (first at insert time and then during the finish call) which of
      the optimizations apply to it, and what operations are necessary to
      finish a bulk insert operation.
      
      Also change HEAP_INSERT_* flags to TABLE_INSERT, and rename hi_options
      to ti_options.
      
      These changes are made even in copy.c, which hasn't yet been converted
      to tableam. There's no harm in doing so.
      
      Author: Andres Freund
      Discussion: https://postgr.es/m/20180703070645.wchpu5muyto5n647@alap3.anarazel.de
      d45e4015
    • Tom Lane's avatar
      Restrict pgbench's zipfian parameter to ensure good performance. · 26a76cb6
      Tom Lane authored
      Remove the code that supported zipfian distribution parameters less
      than 1.0, as it had undocumented performance hazards, and it's not
      clear that the case is useful enough to justify either fixing or
      documenting those hazards.
      
      Also, since the code path for parameter > 1.0 could perform badly
      for values very close to 1.0, establish a minimum allowed value
      of 1.001.  This solution seems superior to the previous vague
      documentation warning about small values not performing well.
      
      Fabien Coelho, per a gripe from Tomas Vondra
      
      Discussion: https://postgr.es/m/b5e172e9-ad22-48a3-86a3-589afa20e8f7@2ndquadrant.com
      26a76cb6
    • Thomas Munro's avatar
      Fix deadlock in heap_compute_xid_horizon_for_tuples(). · 4fd05bb5
      Thomas Munro authored
      We can't call code that uses syscache while we hold buffer locks
      on a catalog relation.  If passed such a relation, just fall back
      to the general effective_io_concurrency GUC rather than trying to
      look up the containing tablespace's IO concurrency setting.
      
      We might find a better way to control prefetching in follow-up
      work, but for now this is enough to avoid the deadlock introduced
      by commit 558a9165.
      
      Reviewed-by: Andres Freund
      Diagnosed-by: Peter Geoghegan
      Discussion: https://postgr.es/m/CA%2BhUKGLCwPF0S4Mk7S8qw%2BDK0Bq65LueN9rofAA3HHSYikW-Zw%40mail.gmail.com
      Discussion: https://postgr.es/m/962831d8-c18d-180d-75fb-8b842e3a2742%40chrullrich.net
      4fd05bb5
    • Tom Lane's avatar
      Improve documentation about our XML functionality. · 12d46ac3
      Tom Lane authored
      Add a section explaining how our XML features depart from current
      versions of the SQL standard.  Update and clarify the descriptions
      of some XML functions.
      
      Chapman Flack, reviewed by Ryan Lambert
      
      Discussion: https://postgr.es/m/5BD1284C.1010305@anastigmatix.net
      Discussion: https://postgr.es/m/5C81F8C0.6090901@anastigmatix.net
      Discussion: https://postgr.es/m/CAN-V+g-6JqUQEQZ55Q3toXEN6d5Ez5uvzL4VR+8KtvJKj31taw@mail.gmail.com
      12d46ac3
    • Tom Lane's avatar
    • Peter Eisentraut's avatar
      Unified logging system for command-line programs · cc8d4151
      Peter Eisentraut authored
      This unifies the various ad hoc logging (message printing, error
      printing) systems used throughout the command-line programs.
      
      Features:
      
      - Program name is automatically prefixed.
      
      - Message string does not end with newline.  This removes a common
        source of inconsistencies and omissions.
      
      - Additionally, a final newline is automatically stripped, simplifying
        use of PQerrorMessage() etc., another common source of mistakes.
      
      - I converted error message strings to use %m where possible.
      
      - As a result of the above several points, more translatable message
        strings can be shared between different components and between
        frontends and backend, without gratuitous punctuation or whitespace
        differences.
      
      - There is support for setting a "log level".  This is not meant to be
        user-facing, but can be used internally to implement debug or
        verbose modes.
      
      - Lazy argument evaluation, so no significant overhead if logging at
        some level is disabled.
      
      - Some color in the messages, similar to gcc and clang.  Set
        PG_COLOR=auto to try it out.  Some colors are predefined, but can be
        customized by setting PG_COLORS.
      
      - Common files (common/, fe_utils/, etc.) can handle logging much more
        simply by just using one API without worrying too much about the
        context of the calling program, requiring callbacks, or having to
        pass "progname" around everywhere.
      
      - Some programs called setvbuf() to make sure that stderr is
        unbuffered, even on Windows.  But not all programs did that.  This
        is now done centrally.
      
      Soft goals:
      
      - Reduces vertical space use and visual complexity of error reporting
        in the source code.
      
      - Encourages more deliberate classification of messages.  For example,
        in some cases it wasn't clear without analyzing the surrounding code
        whether a message was meant as an error or just an info.
      
      - Concepts and terms are vaguely aligned with popular logging
        frameworks such as log4j and Python logging.
      
      This is all just about printing stuff out.  Nothing affects program
      flow (e.g., fatal exits).  The uses are just too varied to do that.
      Some existing code had wrappers that do some kind of print-and-exit,
      and I adapted those.
      
      I tried to keep the output mostly the same, but there is a lot of
      historical baggage to unwind and special cases to consider, and I
      might not always have succeeded.  One significant change is that
      pg_rewind used to write all error messages to stdout.  That is now
      changed to stderr.
      Reviewed-by: default avatarDonald Dong <xdong@csumb.edu>
      Reviewed-by: default avatarArthur Zakirov <a.zakirov@postgrespro.ru>
      Discussion: https://www.postgresql.org/message-id/flat/6a609b43-4f57-7348-6480-bd022f924310@2ndquadrant.com
      cc8d4151
    • Alexander Korotkov's avatar
      Throw error in jsonb_path_match() when result is not single boolean · b4cc19ab
      Alexander Korotkov authored
      jsonb_path_match() checks if jsonb document matches jsonpath query.  Therefore,
      jsonpath query should return single boolean.  Currently, if result of jsonpath
      is not a single boolean, NULL is returned independently whether silent mode
      is on or off.  But that appears to be wrong when silent mode is off.  This
      commit makes jsonb_path_match() throw an error in this case.
      
      Author: Nikita Glukhov
      b4cc19ab
    • Alexander Korotkov's avatar
      Restrict some cases in parsing numerics in jsonpath · 2e643501
      Alexander Korotkov authored
      Jsonpath now accepts integers with leading zeroes and floats starting with
      a dot.  However, SQL standard requires to follow JSON specification, which
      doesn't allow none of these cases.  Our json[b] datatypes also restrict that.
      So, restrict it in jsonpath altogether.
      
      Author: Nikita Glukhov
      2e643501
    • Alexander Korotkov's avatar
      GIN support for @@ and @? jsonpath operators · 0a02e2ae
      Alexander Korotkov authored
      This commit makes existing GIN operator classes jsonb_ops and json_path_ops
      support "jsonb @@ jsonpath" and "jsonb @? jsonpath" operators.  Basic idea is
      to extract statements of following form out of jsonpath.
      
       key1.key2. ... .keyN = const
      
      The rest of jsonpath is rechecked from heap.
      
      Catversion is bumped.
      
      Discussion: https://postgr.es/m/fcc6fc6a-b497-f39a-923d-aa34d0c588e8%402ndQuadrant.com
      Author: Nikita Glukhov, Alexander Korotkov
      Reviewed-by: Jonathan Katz, Pavel Stehule
      0a02e2ae
    • Peter Eisentraut's avatar
      Catch syntax error in generated column definition · 72419117
      Peter Eisentraut authored
      The syntax
      
          GENERATED BY DEFAULT AS (expr)
      
      is not allowed but we have to accept it in the grammar to avoid
      shift/reduce conflicts because of the similar syntax for identity
      columns.  The existing code just ignored this, incorrectly.  Add an
      explicit error check and a bespoke error message.
      Reported-by: default avatarJustin Pryzby <pryzby@telsasoft.com>
      72419117
    • Michael Paquier's avatar
      Fix thinko in allocation call during MVC list deserialization · 4ae7f02b
      Michael Paquier authored
      Spotted by Coverity.
      4ae7f02b
    • Noah Misch's avatar
      Update HINT for pre-existing shared memory block. · 5a907404
      Noah Misch authored
      One should almost always terminate an old process, not use a manual
      removal tool like ipcrm.  Removal of the ipcclean script eleven years
      ago (39627b1a) and its non-replacement
      corroborate that manual shm removal is now a niche goal.  Back-patch to
      9.4 (all supported versions).
      
      Reviewed by Daniel Gustafsson and Kyotaro HORIGUCHI.
      
      Discussion: https://postgr.es/m/20180812064815.GB2301738@rfd.leadboat.com
      5a907404
    • Andres Freund's avatar
      tableam: bitmap table scan. · bfbcad47
      Andres Freund authored
      This moves bitmap heap scan support to below an optional tableam
      callback. It's optional as the whole concept of bitmap heapscans is
      fairly block specific.
      
      This basically moves the work previously done in bitgetpage() into the
      new scan_bitmap_next_block callback, and the direct poking into the
      buffer done in BitmapHeapNext() into the new scan_bitmap_next_tuple()
      callback.
      
      The abstraction is currently somewhat leaky because
      nodeBitmapHeapscan.c's prefetching and visibilitymap based logic
      remains - it's likely that we'll later have to move more into the
      AM. But it's not trivial to do so without introducing a significant
      amount of code duplication between the AMs, so that's a project for
      later.
      
      Note that now nodeBitmapHeapscan.c and the associated node types are a
      bit misnamed. But it's not clear whether renaming wouldn't be a cure
      worse than the disease. Either way, that'd be best done in a separate
      commit.
      
      Author: Andres Freund
      Reviewed-By: Robert Haas (in an older version)
      Discussion: https://postgr.es/m/20180703070645.wchpu5muyto5n647@alap3.anarazel.de
      bfbcad47
    • Andres Freund's avatar
      tableam: sample scan. · 73c954d2
      Andres Freund authored
      This moves sample scan support to below tableam. It's not optional as
      there is, in contrast to e.g. bitmap heap scans, no alternative way to
      perform tablesample queries. If an AM can't deal with the block based
      API, it will have to throw an ERROR.
      
      The tableam callbacks for this are block based, but given the current
      TsmRoutine interface, that seems to be required.
      
      The new interface doesn't require TsmRoutines to perform visibility
      checks anymore - that requires the TsmRoutine to know details about
      the AM, which we want to avoid.  To continue to allow taking the
      returned number of tuples account SampleScanState now has a donetuples
      field (which previously e.g. existed in SystemRowsSamplerData), which
      is only incremented after the visibility check succeeds.
      
      Author: Andres Freund
      Discussion: https://postgr.es/m/20180703070645.wchpu5muyto5n647@alap3.anarazel.de
      73c954d2
    • Andres Freund's avatar
      tableam: Formatting and other minor cleanups. · 4bb50236
      Andres Freund authored
      The superflous heapam_xlog.h includes were reported by Peter
      Geoghegan.
      4bb50236
    • Peter Geoghegan's avatar
      Fix nbtree high key "continuescan" row compare bug. · 76a39f22
      Peter Geoghegan authored
      Commit 29b64d1d mishandled skipping over truncated high key attributes
      during row comparisons.  The row comparison key matching loop would loop
      forever when a truncated attribute was encountered for a row compare
      subkey.  Fix by following the example of other code in the loop: advance
      the current subkey, or break out of the loop when the last subkey is
      reached.
      
      Add test coverage for the relevant _bt_check_rowcompare() code path.
      The new test case is somewhat tied to nbtree implementation details,
      which isn't ideal, but seems unavoidable.
      76a39f22