1. 19 Jan, 2016 3 commits
    • Alvaro Herrera's avatar
      Add two HyperLogLog functions · 948c9795
      Alvaro Herrera authored
      New functions initHyperLogLogError() and freeHyperLogLog() simplify
      using this module from elsewhere.
      
      Author: Tomáš Vondra
      Review: Peter Geoghegan
      948c9795
    • Tom Lane's avatar
      Fix assorted inconsistencies in GiST opclass support function declarations. · 9ff60273
      Tom Lane authored
      The conventions specified by the GiST SGML documentation were widely
      ignored.  For example, the strategy-number argument for "consistent" and
      "distance" functions is specified to be a smallint, but most of the
      built-in support functions declared it as an integer, and for that matter
      the core code passed it using Int32GetDatum not Int16GetDatum.  None of
      that makes any real difference at runtime, but it's quite confusing for
      newcomers to the code, and it makes it very hard to write an amvalidate()
      function that checks support function signatures.  So let's try to instill
      some consistency here.
      
      Another similar issue is that the "query" argument is not of a single
      well-defined type, but could have different types depending on the strategy
      (corresponding to search operators with different righthand-side argument
      types).  Some of the functions threw up their hands and declared the query
      argument as being of "internal" type, which surely isn't right ("any" would
      have been more appropriate); but the majority position seemed to be to
      declare it as being of the indexed data type, corresponding to a search
      operator with both input types the same.  So I've specified a convention
      that that's what to do always.
      
      Also, the result of the "union" support function actually must be of the
      index's storage type, but the documentation suggested declaring it to
      return "internal", and some of the functions followed that.  Standardize
      on telling the truth, instead.
      
      Similarly, standardize on declaring the "same" function's inputs as
      being of the storage type, not "internal".
      
      Also, somebody had forgotten to add the "recheck" argument to both
      the documentation of the "distance" support function and all of their
      SQL declarations, even though the C code was happily using that argument.
      Clean that up too.
      
      Fix up some other omissions in the docs too, such as documenting that
      union's second input argument is vestigial.
      
      So far as the errors in core function declarations go, we can just fix
      pg_proc.h and bump catversion.  Adjusting the erroneous declarations in
      contrib modules is more debatable: in principle any change in those
      scripts should involve an extension version bump, which is a pain.
      However, since these changes are purely cosmetic and make no functional
      difference, I think we can get away without doing that.
      9ff60273
    • Andrew Dunstan's avatar
      Remove Cygwin-specific code from pg_ctl · 53c949c1
      Andrew Dunstan authored
      This code has been there for a long time, but it's never really been
      needed. Cygwin has its own utility for registering, unregistering,
      stopping and starting Windows services, and that's what's used in the
      Cygwin postgres packages. So now pg_ctl for Cygwin looks like it is for
      any Unix platform.
      
      Michael Paquier and me
      53c949c1
  2. 18 Jan, 2016 4 commits
    • Tatsuo Ishii's avatar
      Fix typo. · 85f22281
      Tatsuo Ishii authored
      Reported by KOIZUMI Satoru.
      85f22281
    • Tom Lane's avatar
      Add explicit cast to amcostestimate call. · 49b49506
      Tom Lane authored
      My compiler doesn't complain here, but David Rowley's does ...
      49b49506
    • Tom Lane's avatar
      Restructure index access method API to hide most of it at the C level. · 65c5fcd3
      Tom Lane authored
      This patch reduces pg_am to just two columns, a name and a handler
      function.  All the data formerly obtained from pg_am is now provided
      in a C struct returned by the handler function.  This is similar to
      the designs we've adopted for FDWs and tablesample methods.  There
      are multiple advantages.  For one, the index AM's support functions
      are now simple C functions, making them faster to call and much less
      error-prone, since the C compiler can now check function signatures.
      For another, this will make it far more practical to define index access
      methods in installable extensions.
      
      A disadvantage is that SQL-level code can no longer see attributes
      of index AMs; in particular, some of the crosschecks in the opr_sanity
      regression test are no longer possible from SQL.  We've addressed that
      by adding a facility for the index AM to perform such checks instead.
      (Much more could be done in that line, but for now we're content if the
      amvalidate functions more or less replace what opr_sanity used to do.)
      We might also want to expose some sort of reporting functionality, but
      this patch doesn't do that.
      
      Alexander Korotkov, reviewed by Petr Jelínek, and rather heavily
      editorialized on by me.
      65c5fcd3
    • Tom Lane's avatar
      Re-pgindent a few files. · 8d290c8e
      Tom Lane authored
      In preparation for landing index AM interface changes.
      8d290c8e
  3. 17 Jan, 2016 2 commits
    • Tom Lane's avatar
      Remove dead code in pg_dump. · 57ce9acc
      Tom Lane authored
      Coverity quite reasonably complained that this check for fout==NULL
      occurred after we'd already dereferenced fout.  However, the check
      is just dead code since there is no code path by which CreateArchive
      can return a null pointer.  Errors such as can't-open-that-file are
      reported down inside CreateArchive, and control doesn't return.
      So let's silence the warning by removing the dead code, rather than
      continuing to pretend it does something.
      
      Coverity didn't complain about this before 5b5fea2a, so back-patch
      to 9.5 like that patch.
      57ce9acc
    • Peter Eisentraut's avatar
      psql: Add completion support for DROP INDEX CONCURRENTLY · 4189e3d6
      Peter Eisentraut authored
      based on patch by Kyotaro Horiguchi
      4189e3d6
  4. 15 Jan, 2016 2 commits
  5. 14 Jan, 2016 2 commits
    • Tom Lane's avatar
      Fix build_grouping_chain() to not clobber its input lists. · a923af38
      Tom Lane authored
      There's no good reason for stomping on the input data; it makes the logic
      in this function no simpler, in fact probably the reverse.  And it makes
      it impossible to separate path generation from plan generation, as I'm
      working towards doing; that will require more than one traversal of these
      lists.
      a923af38
    • Magnus Hagander's avatar
      Properly close token in sspi authentication · 6a61d1ff
      Magnus Hagander authored
      We can never leak more than one token, but we shouldn't do that. We
      don't bother closing it in the error paths since the process will
      exit shortly anyway.
      
      Christian Ullrich
      6a61d1ff
  6. 13 Jan, 2016 6 commits
    • Tom Lane's avatar
      Handle extension members when first setting object dump flags in pg_dump. · e72d7d85
      Tom Lane authored
      pg_dump's original approach to handling extension member objects was to
      run around and clear (or set) their dump flags rather late in its data
      collection process.  Unfortunately, quite a lot of code expects those flags
      to be valid before that; which was an entirely reasonable expectation
      before we added extensions.  In particular, this explains Karsten Hilbert's
      recent report of pg_upgrade failing on a database in which an extension
      has been installed into the pg_catalog schema.  Its objects are initially
      marked as not-to-be-dumped on the strength of their schema, and later we
      change them to must-dump because we're doing a binary upgrade of their
      extension; but we've already skipped essential tasks like making associated
      DO_SHELL_TYPE objects.
      
      To fix, collect extension membership data first, and incorporate it in the
      initial setting of the dump flags, so that those are once again correct
      from the get-go.  This has the undesirable side effect of slightly
      lengthening the time taken before pg_dump acquires table locks, but testing
      suggests that the increase in that window is not very much.
      
      Along the way, get rid of ugly special-case logic for deciding whether
      to dump procedural languages, FDWs, and foreign servers; dump decisions
      for those are now correct up-front, too.
      
      In 9.3 and up, this also fixes erroneous logic about when to dump event
      triggers (basically, they were *always* dumped before).  In 9.5 and up,
      transform objects had that problem too.
      
      Since this problem came in with extensions, back-patch to all supported
      versions.
      e72d7d85
    • Tom Lane's avatar
      Access pg_dump's options structs through Archive struct, not directly. · 5b5fea2a
      Tom Lane authored
      Rather than passing around DumpOptions and RestoreOptions as separate
      arguments, add fields to struct Archive to carry pointers to these objects,
      and access them through those fields when needed.  There already was a
      RestoreOptions pointer in Archive, though for no obvious reason it was part
      of the "private" struct rather than out where pg_dump.c could see it.
      
      Doing this allows reversion of quite a lot of parameter-addition changes
      made in commit 0eea8047, which is a good thing IMO because this will
      reduce the code delta between 9.4 and 9.5, probably easing a few future
      back-patch efforts.  Moreover, the previous commit only added a DumpOptions
      argument to functions that had to have it at the time, which means we could
      anticipate still more code churn (and more back-patch hazard) as the
      requirement spread further.  I'd hit exactly that problem in my upcoming
      patch to fix extension membership marking, which is what motivated me to
      do this.
      5b5fea2a
    • Tom Lane's avatar
      Run pgindent on src/bin/pg_dump/* · 26905e00
      Tom Lane authored
      To ease doing indent fixups on a couple of patches I have in progress.
      26905e00
    • Peter Eisentraut's avatar
      psql: Improve CREATE INDEX CONCURRENTLY tab completion · b1bfb28b
      Peter Eisentraut authored
      The completion of CREATE INDEX CONCURRENTLY was lacking in several ways
      compared to a plain CREATE INDEX command:
      
      - CREATE INDEX <name> ON completes table names, but didn't with
        CONCURRENTLY.
      
      - CREATE INDEX completes ON and existing index names, but with
        CONCURRENTLY it only completed ON.
      
      - CREATE INDEX <name> completes ON, but didn't with CONCURRENTLY.
      
      These are now all fixed.
      b1bfb28b
    • Peter Eisentraut's avatar
      psql: Fix CREATE INDEX tab completion · bc56d589
      Peter Eisentraut authored
      The previous code supported a syntax like CREATE INDEX name
      CONCURRENTLY, which never existed.  Mistake introduced in commit
      37ec19a1.  Remove the addition of
      CONCURRENTLY at that point.
      bc56d589
    • Peter Eisentraut's avatar
      psql: Update tab completion comment · 70327030
      Peter Eisentraut authored
      This just updates a comment to match the code.
      
      from Michael Paquier
      70327030
  7. 12 Jan, 2016 5 commits
    • Simon Riggs's avatar
      Add new user fn pg_current_xlog_flush_location() · e63bb454
      Simon Riggs authored
      Tomas Vondra, reviewed by Michael Paquier and Amit Kapila
      Minor edits by me
      e63bb454
    • Simon Riggs's avatar
      Maintain local LogwrtResult consistently · 1e29e632
      Simon Riggs authored
      Teach GetFlushRecPtr() to update LogwrtResult cache as performed by all other
      functions in xlog.c
      1e29e632
    • Tom Lane's avatar
      Remove no-longer-needed old-style check for incompatible plpythons. · 796d1e88
      Tom Lane authored
      Commit 866566a6 introduced a new mechanism for incompatible
      plpythons to detect each other.  I left the old mechanism in place,
      because it seems possible that a plpython predating that commit might be
      used with one postdating it.  (This would require updating plpython3 but
      not plpython2 or vice versa, but that seems well within the realm of
      possibility.)  However, surely it will not be able to happen in 9.6 or
      later, so we can delete the old mechanism in HEAD.
      796d1e88
    • Tom Lane's avatar
      Use LOAD not actual code execution to pull in plpython library. · fb6fcbd3
      Tom Lane authored
      Commit 866566a6 is insufficient to prevent dump/reload failures
      when using transform modules in a database with both plpython2 and
      plpython3 installed.  The reason is that the transform extension scripts
      use DO blocks as a mechanism to pull in the libpython library before
      creating the transform function.  It's necessary to preload the library
      because the dynamic loader won't do it for us on every platform, leading
      to "unresolved symbol" failures when the transform library is loaded.
      But it's *not* necessary to execute Python code, and doing so will
      provoke a multiple-Pythons-are-loaded error even after the preceding
      commit.
      
      To fix, use LOAD instead of a DO block.  That requires superuser privilege,
      but creation of a C function does anyway.  It also embeds knowledge of
      the underlying library name for each PL language; but that's wired into
      the initdb-time contents of pg_pltemplate too, so that doesn't seem like
      a large problem either.  Note that CREATE TRANSFORM as such doesn't call
      the language module at all.
      
      Per a report from Paul Jones.  Back-patch to 9.5 where transform modules
      were introduced.
      fb6fcbd3
    • Tom Lane's avatar
      Avoid dump/reload problems when using both plpython2 and plpython3. · 866566a6
      Tom Lane authored
      Commit 80371601 installed a safeguard against loading plpython2
      and plpython3 at the same time, but asserted that both could still be
      used in the same database, just not in the same session.  However, that's
      not actually all that practical because dumping and reloading will fail
      (since both libraries necessarily get loaded into the restoring session).
      pg_upgrade is even worse, because it checks for missing libraries by
      loading every .so library mentioned in the entire installation into one
      session, so that you can have only one across the whole cluster.
      
      We can improve matters by not throwing the error immediately in _PG_init,
      but only when and if we're asked to do something that requires calling
      into libpython.  This ameliorates both of the above situations, since
      while execution of CREATE LANGUAGE, CREATE FUNCTION, etc will result in
      loading plpython, it isn't asked to do anything interesting (at least
      not if check_function_bodies is off, as it will be during a restore).
      
      It's possible that this opens some corner-case holes in which a crash
      could be provoked with sufficient effort.  However, since plpython
      only exists as an untrusted language, any such crash would require
      superuser privileges, making it "don't do that" not a security issue.
      To reduce the hazards in this area, the error is still FATAL when it
      does get thrown.
      
      Per a report from Paul Jones.  Back-patch to 9.2, which is as far back
      as the patch applies without work.  (It could be made to work in 9.1,
      but given the lack of previous complaints, I'm disinclined to expend
      effort so far back.  We've been pretty desultory about support for
      Python 3 in 9.1 anyway.)
      866566a6
  8. 11 Jan, 2016 2 commits
  9. 09 Jan, 2016 7 commits
    • Tom Lane's avatar
      Remove a useless PG_GETARG_DATUM() call from jsonb_build_array. · 820bdccc
      Tom Lane authored
      This loop uselessly fetched the argument after the one it's currently
      looking at.  No real harm is done since we couldn't possibly fetch off
      the end of memory, but it's confusing to the reader.
      
      Also remove a duplicate (and therefore confusing) PG_ARGISNULL check in
      jsonb_build_object.
      
      I happened to notice these things while trolling for missed null-arg
      checks earlier today.  Back-patch to 9.5, not because there is any
      real bug, but just because 9.5 and HEAD are still in sync in this
      file and we might as well keep them so.
      
      In passing, re-pgindent.
      820bdccc
    • Tom Lane's avatar
      Add some checks on "char"-type columns to type_sanity and opr_sanity. · 3ef16c46
      Tom Lane authored
      I noticed that the sanity checks in the regression tests omitted to
      check a couple of "poor man's enum" columns that you'd reasonably
      expect them to check.
      
      There are other "char"-type columns in system catalogs that are not
      covered by either type_sanity or opr_sanity, e.g. pg_rewrite.ev_type.
      However, those catalogs are not populated with any manually-created
      data during bootstrap, so it seems less necessary to check them
      this way.
      3ef16c46
    • Tom Lane's avatar
      Clean up some lack-of-STRICT issues in the core code, too. · 26d538dc
      Tom Lane authored
      A scan for missed proisstrict markings in the core code turned up
      these functions:
      
      brin_summarize_new_values
      pg_stat_reset_single_table_counters
      pg_stat_reset_single_function_counters
      pg_create_logical_replication_slot
      pg_create_physical_replication_slot
      pg_drop_replication_slot
      
      The first three of these take OID, so a null argument will normally look
      like a zero to them, resulting in "ERROR: could not open relation with OID
      0" for brin_summarize_new_values, and no action for the pg_stat_reset_XXX
      functions.  The other three will dump core on a null argument, though this
      is mitigated by the fact that they won't do so until after checking that
      the caller is superuser or has rolreplication privilege.
      
      In addition, the pg_logical_slot_get/peek[_binary]_changes family was
      intentionally marked nonstrict, but failed to make nullness checks on all
      the arguments; so again a null-pointer-dereference crash is possible but
      only for superusers and rolreplication users.
      
      Add the missing ARGISNULL checks to the latter functions, and mark the
      former functions as strict in pg_proc.  Make that change in the back
      branches too, even though we can't force initdb there, just so that
      installations initdb'd in future won't have the issue.  Since none of these
      bugs rise to the level of security issues (and indeed the pg_stat_reset_XXX
      functions hardly misbehave at all), it seems sufficient to do this.
      
      In addition, fix some order-of-operations oddities in the slot_get_changes
      family, mostly cosmetic, but not the part that moves the function's last
      few operations into the PG_TRY block.  As it stood, there was significant
      risk for an error to exit without clearing historical information from
      the system caches.
      
      The slot_get_changes bugs go back to 9.4 where that code was introduced.
      Back-patch appropriate subsets of the pg_proc changes into all active
      branches, as well.
      26d538dc
    • Tom Lane's avatar
      Clean up code for widget_in() and widget_out(). · 1cb63c79
      Tom Lane authored
      Given syntactically wrong input, widget_in() could call atof() with an
      indeterminate pointer argument, typically leading to a crash; or if it
      didn't do that, it might return a NULL pointer, which again would lead
      to a crash since old-style C functions aren't supposed to do things
      that way.  Fix that by correcting the off-by-one syntax test and
      throwing a proper error rather than just returning NULL.
      
      Also, since widget_in and widget_out have been marked STRICT for a
      long time, their tests for null inputs are just dead code; remove 'em.
      In the oldest branches, also improve widget_out to use snprintf not
      sprintf, just to be sure.
      
      In passing, get rid of a long-since-useless sprintf into a local buffer
      that nothing further is done with, and make some other minor coding
      style cleanups.
      
      In the intended regression-testing usage of these functions, none of
      this is very significant; but if the regression test database were
      left around in a production installation, these bugs could amount
      to a minor security hazard.
      
      Piotr Stefaniak, Michael Paquier, and Tom Lane
      1cb63c79
    • Simon Riggs's avatar
      Revoke change to rmgr desc of btree vacuum · b6028426
      Simon Riggs authored
      Per discussion with Andres Freund
      b6028426
    • Tom Lane's avatar
      Add STRICT to some C functions created by the regression tests. · 529baf6a
      Tom Lane authored
      These functions readily crash when passed a NULL input value.  The tests
      themselves do not pass NULL values to them; but when the regression
      database is used as a basis for fuzz testing, they cause a lot of noise.
      Also, if someone were to leave a regression database lying about in a
      production installation, these would create a minor security hazard.
      
      Andreas Seltenreich
      529baf6a
    • Simon Riggs's avatar
      Avoid pin scan for replay of XLOG_BTREE_VACUUM · 687f2cd7
      Simon Riggs authored
      Replay of XLOG_BTREE_VACUUM during Hot Standby was previously thought to require
      complex interlocking that matched the requirements on the master. This required
      an O(N) operation that became a significant problem with large indexes, causing
      replication delays of seconds or in some cases minutes while the
      XLOG_BTREE_VACUUM was replayed.
      
      This commit skips the “pin scan” that was previously required, by observing in
      detail when and how it is safe to do so, with full documentation. The pin scan
      is skipped only in replay; the VACUUM code path on master is not touched here.
      
      The current commit still performs the pin scan for toast indexes, though this
      can also be avoided if we recheck scans on toast indexes. Later patch will
      address this.
      
      No tests included. Manual tests using an additional patch to view WAL records
      and their timing have shown the change in WAL records and their handling has
      successfully reduced replication delay.
      687f2cd7
  10. 08 Jan, 2016 6 commits
    • Alvaro Herrera's avatar
      Revert "Blind attempt at a Cygwin fix" · 46317211
      Alvaro Herrera authored
      This reverts commit e9282e95, which blew
      up in a pretty spectacular way.  Re-introduce the original code while we
      search for a real fix.
      46317211
    • Alvaro Herrera's avatar
      Blind attempt at a Cygwin fix · e9282e95
      Alvaro Herrera authored
      Further portability fix for a9676139.  Mingw- and MSVC-based builds
      appear to be working fine, but Cygwin needs an extra tweak whereby the
      new win32security.c file is explicitely added to the list of files to
      build in pgport, per Cygwin members brolga and lorikeet.
      
      Author: Michael Paquier
      e9282e95
    • Magnus Hagander's avatar
      Fix typo in comment · 2650486e
      Magnus Hagander authored
      Tatsuro Yamada
      2650486e
    • Magnus Hagander's avatar
      Remove reundand include of TestLib · c662ef1d
      Magnus Hagander authored
      Kyotaro HORIGUCHI
      c662ef1d
    • Tom Lane's avatar
      Marginal cleanup of GROUPING SETS code in grouping_planner(). · a54676ac
      Tom Lane authored
      Improve comments and make it a shade less messy.  I think we might want
      to move all of this somewhere else later, but it needs to be more
      readable first.
      
      In passing, re-pgindent the file, affecting some recently-added comments
      concerning parallel query planning.
      a54676ac
    • Tom Lane's avatar
      Delay creation of subplan tlist until after create_plan(). · c44d0138
      Tom Lane authored
      Once upon a time it was necessary for grouping_planner() to determine
      the tlist it wanted from the scan/join plan subtree before it called
      query_planner(), because query_planner() would actually make a Plan using
      that.  But we refactored things a long time ago to delay construction of
      the Plan tree till later, so there's no need to build that tlist until
      (and indeed unless) we're ready to plaster it onto the Plan.  The only
      thing query_planner() cares about is what Vars are going to be needed for
      the tlist, and it can perfectly well get that by looking at the real tlist
      rather than some masticated version.
      
      Well, actually, there is one minor glitch in that argument, which is that
      make_subplanTargetList also adds Vars appearing only in HAVING to the
      tlist it produces.  So now we have to account for HAVING explicitly in
      build_base_rel_tlists.  But that just adds a few lines of code, and
      I doubt it moves the needle much on processing time; we might be doing
      pull_var_clause() twice on the havingQual, but before we had it scanning
      dummy tlist entries instead.
      
      This is a very small down payment on rationalizing grouping_planner
      enough so it can be refactored.
      c44d0138
  11. 07 Jan, 2016 1 commit