1. 22 Mar, 2018 26 commits
    • Andres Freund's avatar
      Add FIELDNO_* macro designating offset into structs required for JIT. · 7ced1d12
      Andres Freund authored
      For any interesting JIT target, fields inside structs need to be
      accessed. b96d550e contains infrastructure for syncing the definition
      of types between postgres C code and runtime code generation with
      LLVM. But that doesn't sync the number or names of fields inside
      structs, just the types (including padding etc).
      
      One option would be to hardcode the offset numbers in the JIT code,
      but that'd be hard to keep in sync. Instead add macros indicating the
      field offset to the fields that need to be accessed. Not pretty, but
      manageable.
      
      Author: Andres Freund
      Discussion: https://postgr.es/m/20170901064131.tazjxwus3k2w3ybh@alap3.anarazel.de
      7ced1d12
    • Andres Freund's avatar
    • Tom Lane's avatar
      Improve style guideline compliance of assorted error-report messages. · feb82545
      Tom Lane authored
      Per the project style guide, details and hints should have leading
      capitalization and end with a period.  On the other hand, errcontext should
      not be capitalized and should not end with a period.  To support well
      formatted error contexts in dblink, extend dblink_res_error() to take a
      format+arguments rather than a hardcoded string.
      
      Daniel Gustafsson
      
      Discussion: https://postgr.es/m/B3C002C8-21A0-4F53-A06E-8CAB29FCF295@yesql.se
      feb82545
    • Robert Haas's avatar
      Consider Parallel Append of partial paths for UNION [ALL]. · 88ba0ae2
      Robert Haas authored
      Without this patch, we can implement a UNION or UNION ALL as an
      Append where Gather appears beneath one or more of the Append
      branches, but this lets us put the Gather node on top, with
      a partial path for each relation underneath.
      
      There is considerably more work that could be done to improve
      planning in this area, but that will probably need to wait
      for a future release.
      
      Patch by me, reviewed and tested by Ashutosh Bapat and Rajkumar
      Raghuwanshi.
      
      Discussion: http://postgr.es/m/CA+TgmoaLRAOqHmMZx=ESM3VDEPceg+-XXZsRXQ8GtFJO_zbMSw@mail.gmail.com
      88ba0ae2
    • Tom Lane's avatar
      Sync up our various ways of estimating pg_class.reltuples. · 7c91a036
      Tom Lane authored
      VACUUM thought that reltuples represents the total number of tuples in
      the relation, while ANALYZE counted only live tuples.  This can cause
      "flapping" in the value when background vacuums and analyzes happen
      separately.  The planner's use of reltuples essentially assumes that
      it's the count of live (visible) tuples, so let's standardize on having
      it mean live tuples.
      
      Another issue is that the definition of "live tuple" isn't totally clear;
      what should be done with INSERT_IN_PROGRESS or DELETE_IN_PROGRESS tuples?
      ANALYZE's choices in this regard are made on the assumption that if the
      originating transaction commits at all, it will happen after ANALYZE
      finishes, so we should ignore the effects of the in-progress transaction
      --- unless it is our own transaction, and then we should count it.
      Let's propagate this definition into VACUUM, too.
      
      Likewise propagate this definition into CREATE INDEX, and into
      contrib/pgstattuple's pgstattuple_approx() function.
      
      Tomas Vondra, reviewed by Haribabu Kommi, some corrections by me
      
      Discussion: https://postgr.es/m/16db4468-edfa-830a-f921-39a50498e77e@2ndquadrant.com
      7c91a036
    • Andres Freund's avatar
      Basic planner and executor integration for JIT. · cc415a56
      Andres Freund authored
      This adds simple cost based plan time decision about whether JIT
      should be performed. jit_above_cost, jit_optimize_above_cost are
      compared with the total cost of a plan, and if the cost is above them
      JIT is performed / optimization is performed respectively.
      
      For that PlannedStmt and EState have a jitFlags (es_jit_flags) field
      that stores information about what JIT operations should be performed.
      
      EState now also has a new es_jit field, which can store a
      JitContext. When there are no errors the context is released in
      standard_ExecutorEnd().
      
      It is likely that the default values for jit_[optimize_]above_cost
      will need to be adapted further, but in my test these values seem to
      work reasonably.
      
      Author: Andres Freund, with feedback by Peter Eisentraut
      Discussion: https://postgr.es/m/20170901064131.tazjxwus3k2w3ybh@alap3.anarazel.de
      cc415a56
    • Andres Freund's avatar
      Add helpers for emitting LLVM IR. · 7ec0d80c
      Andres Freund authored
      These basically just help to make code a bit more concise and pgindent
      proof.
      
      Author: Andres Freund
      Discussion: https://postgr.es/m/20170901064131.tazjxwus3k2w3ybh@alap3.anarazel.de
      7ec0d80c
    • Andres Freund's avatar
      Debugging and profiling support for LLVM JIT provider. · 250bca7f
      Andres Freund authored
      This currently requires patches to the LLVM codebase to be
      effective (submitted upstream), the GUCs are available without those
      patches however.
      
      Author: Andres Freund
      Discussion: https://postgr.es/m/20170901064131.tazjxwus3k2w3ybh@alap3.anarazel.de
      250bca7f
    • Andres Freund's avatar
      Support for optimizing and emitting code in LLVM JIT provider. · b96d550e
      Andres Freund authored
      This commit introduces the ability to actually generate code using
      LLVM. In particular, this adds:
      
      - Ability to emit code both in heavily optimized and largely
        unoptimized fashion
      - Batching facility to allow functions to be defined in small
        increments, but optimized and emitted in executable form in larger
        batches (for performance and memory efficiency)
      - Type and function declaration synchronization between runtime
        generated code and normal postgres code. This is critical to be able
        to access struct fields etc.
      - Developer oriented jit_dump_bitcode GUC, for inspecting / debugging
        the generated code.
      - per JitContext statistics of number of functions, time spent
        generating code, optimizing, and emitting it.  This will later be
        employed for EXPLAIN support.
      
      This commit doesn't yet contain any code actually generating
      functions. That'll follow in later commits.
      
      Documentation for GUCs added, and for JIT in general, will be added in
      later commits.
      
      Author: Andres Freund, with contributions by Pierre Ducroquet
      Testing-By: Thomas Munro, Peter Eisentraut
      Discussion: https://postgr.es/m/20170901064131.tazjxwus3k2w3ybh@alap3.anarazel.de
      b96d550e
    • Robert Haas's avatar
    • Robert Haas's avatar
      Fix typo in comment. · 8a8c4f3b
      Robert Haas authored
      Michael Paquier
      
      Discussion: http://postgr.es/m/20180205071404.GB17337@paquier.xyz
      8a8c4f3b
    • Robert Haas's avatar
    • Tom Lane's avatar
      Fix tuple counting in SP-GiST index build. · 649f1792
      Tom Lane authored
      Count the number of tuples in the index honestly, instead of assuming
      that it's the same as the number of tuples in the heap.  (It might be
      different if the index is partial.)
      
      Back-patch to all supported versions.
      
      Tomas Vondra
      
      Discussion: https://postgr.es/m/3b3d8eac-c709-0d25-088e-b98339a1b28a@2ndquadrant.com
      649f1792
    • Robert Haas's avatar
      Call pgstat_report_activity() in parallel CREATE INDEX workers. · 7de4a1bc
      Robert Haas authored
      Also set debug_query_string.
      
      Oversight in commit 9da0cc35
      
      Peter Geoghegan, per a report by Phil Florent.
      
      Discussion: https://postgr.es/m/CAH2-Wzmf-34hD4n40uTuE-ZY9P5c%2BmvhFbCdQfN%3DKrKiVm3j3A%40mail.gmail.com
      7de4a1bc
    • Tom Lane's avatar
      Fix errors in contrib/bloom index build. · c35b4728
      Tom Lane authored
      Count the number of tuples in the index honestly, instead of assuming
      that it's the same as the number of tuples in the heap.  (It might be
      different if the index is partial.)
      
      Fix counting of tuples in current index page, too.  This error would
      have led to failing to write out the final page of the index if it
      contained exactly one tuple, so that the last tuple of the relation
      would not get indexed.
      
      Back-patch to 9.6 where contrib/bloom was added.
      
      Tomas Vondra and Tom Lane
      
      Discussion: https://postgr.es/m/3b3d8eac-c709-0d25-088e-b98339a1b28a@2ndquadrant.com
      c35b4728
    • Robert Haas's avatar
      Implement partition-wise grouping/aggregation. · e2f1eb0e
      Robert Haas authored
      If the partition keys of input relation are part of the GROUP BY
      clause, all the rows belonging to a given group come from a single
      partition.  This allows aggregation/grouping over a partitioned
      relation to be broken down * into aggregation/grouping on each
      partition.  This should be no worse, and often better, than the normal
      approach.
      
      If the GROUP BY clause does not contain all the partition keys, we can
      still perform partial aggregation for each partition and then finalize
      aggregation after appending the partial results.  This is less certain
      to be a win, but it's still useful.
      
      Jeevan Chalke, Ashutosh Bapat, Robert Haas.  The larger patch series
      of which this patch is a part was also reviewed and tested by Antonin
      Houska, Rajkumar Raghuwanshi, David Rowley, Dilip Kumar, Konstantin
      Knizhnik, Pascal Legrand, and Rafia Sabih.
      
      Discussion: http://postgr.es/m/CAM2+6=V64_xhstVHie0Rz=KPEQnLJMZt_e314P0jaT_oJ9MR8A@mail.gmail.com
      e2f1eb0e
    • Teodor Sigaev's avatar
      Add conditional.c to libpgfeutils for MSVC build · 2058d6a2
      Teodor Sigaev authored
      conditional.c was moved in f67b113a commit
      but forgotten to add to Windows build system.
      
      I don't have a Windows box, so blind attempt.
      2058d6a2
    • Teodor Sigaev's avatar
      UINT64CONST'fy long constants in pgbench · 2216fded
      Teodor Sigaev authored
      In commit e51a0484 it was missed 64-bit
      constants, wrap them with UINT64CONST().
      
      Per buildfarm member dromedary and gripe from Tom Lane
      2216fded
    • Teodor Sigaev's avatar
      Add \if support to pgbench · f67b113a
      Teodor Sigaev authored
      Patch adds \if to pgbench as it done for psql. Implementation shares condition
      stack code with psql, so, this code is moved to fe_utils directory.
      
      Author: Fabien COELHO with minor editorization by me
      Review by: Vik Fearing, Fedor Sigaev
      Discussion: https://www.postgresql.org/message-id/flat/alpine.DEB.2.20.1711252200190.28523@lancre
      f67b113a
    • Dean Rasheed's avatar
      Improve ANALYZE's strategy for finding MCVs. · b5db1d93
      Dean Rasheed authored
      Previously, a value was included in the MCV list if its frequency was
      25% larger than the estimated average frequency of all nonnull values
      in the table.  For uniform distributions, that can lead to values
      being included in the MCV list and significantly overestimated on the
      basis of relatively few (sometimes just 2) instances being seen in the
      sample.  For non-uniform distributions, it can lead to too few values
      being included in the MCV list, since the overall average frequency
      may be dominated by a small number of very common values, while the
      remaining values may still have a large spread of frequencies, causing
      both substantial overestimation and underestimation of the remaining
      values.  Furthermore, increasing the statistics target may have little
      effect because the overall average frequency will remain relatively
      unchanged.
      
      Instead, populate the MCV list with the largest set of common values
      that are statistically significantly more common than the average
      frequency of the remaining values.  This takes into account the
      variance of the sample counts, which depends on the counts themselves
      and on the proportion of the table that was sampled.  As a result, it
      constrains the relative standard error of estimates based on the
      frequencies of values in the list, reducing the chances of too many
      values being included.  At the same time, it allows more values to be
      included, since the MCVs need only be more common than the remaining
      non-MCVs, rather than the overall average.  Thus it tends to produce
      fewer MCVs than the previous code for uniform distributions, and more
      for non-uniform distributions, reducing estimation errors in both
      cases.  In addition, the algorithm responds better to increasing the
      statistics target, allowing more values to be included in the MCV list
      when more of the table is sampled.
      
      Jeff Janes, substantially modified by me. Reviewed by John Naylor and
      Tomas Vondra.
      
      Discussion: https://postgr.es/m/CAMkU=1yvdGvW9TmiLAhz2erFnvnPFYHbOZuO+a=4DVkzpuQ2tw@mail.gmail.com
      b5db1d93
    • Andres Freund's avatar
    • Andres Freund's avatar
      Basic JIT provider and error handling infrastructure. · 432bb9e0
      Andres Freund authored
      This commit introduces:
      
      1) JIT provider abstraction, which allows JIT functionality to be
         implemented in separate shared libraries. That's desirable because
         it allows to install JIT support as a separate package, and because
         it allows experimentation with different forms of JITing.
      2) JITContexts which can be, using functions introduced in follow up
         commits, used to emit JITed functions, and have them be cleaned up
         on error.
      3) The outline of a LLVM JIT provider, which will be fleshed out in
         subsequent commits.
      
      Documentation for GUCs added, and for JIT in general, will be added in
      later commits.
      
      Author: Andres Freund, with architectural input from Jeff Davis
      Discussion: https://postgr.es/m/20170901064131.tazjxwus3k2w3ybh@alap3.anarazel.de
      432bb9e0
    • Andres Freund's avatar
      Fix typo in BITCODE_CXXFLAGS assignment. · 4317cc68
      Andres Freund authored
      Typoed-In: 5b2526c8
      Reported-By: Catalin Iacob
      4317cc68
    • Andres Freund's avatar
      Empty CXXFLAGS inherited from autoconf. · a02671cf
      Andres Freund authored
      We do the same for CFLAGS.  This was an omission in 6869b4f2.
      
      Reported-By: Catalin Iacob
      a02671cf
    • Tom Lane's avatar
      Prevent extensions from creating custom GUCs that are GUC_LIST_QUOTE. · 846b5a52
      Tom Lane authored
      Pending some solution for the problems noted in commit 74286994,
      disallow dynamic creation of GUC_LIST_QUOTE variables.
      
      If there are any extensions out there using this feature, they'd not
      be happy for us to start enforcing this rule in minor releases, so
      this is a HEAD-only change.  The previous commit didn't make things
      any worse than they already were for such cases.
      
      Discussion: https://postgr.es/m/20180111064900.GA51030@paquier.xyz
      846b5a52
    • Tom Lane's avatar
      Fix mishandling of quoted-list GUC values in pg_dump and ruleutils.c. · 74286994
      Tom Lane authored
      Code that prints out the contents of setconfig or proconfig arrays in
      SQL format needs to handle GUC_LIST_QUOTE variables differently from
      other ones, because for those variables, flatten_set_variable_args()
      already applied a layer of quoting.  The value can therefore safely
      be printed as-is, and indeed must be, or flatten_set_variable_args()
      will muck it up completely on reload.  For all other GUC variables,
      it's necessary and sufficient to quote the value as a SQL literal.
      
      We'd recognized the need for this long ago, but mis-analyzed the
      need slightly, thinking that all GUC_LIST_INPUT variables needed
      the special treatment.  That's actually wrong, since a valid value
      of a LIST variable might include characters that need quoting,
      although no existing variables accept such values.
      
      More to the point, we hadn't made any particular effort to keep the
      various places that deal with this up-to-date with the set of variables
      that actually need special treatment, meaning that we'd do the wrong
      thing with, for example, temp_tablespaces values.  This affects dumping
      of SET clauses attached to functions, as well as ALTER DATABASE/ROLE SET
      commands.
      
      In ruleutils.c we can fix it reasonably honestly by exporting a guc.c
      function that allows discovering the flags for a given GUC variable.
      But pg_dump doesn't have easy access to that, so continue the old method
      of having a hard-wired list of affected variable names.  At least we can
      fix it to have just one list not two, and update the list to match
      current reality.
      
      A remaining problem with this is that it only works for built-in
      GUC variables.  pg_dump's list obvious knows nothing of third-party
      extensions, and even the "ask guc.c" method isn't bulletproof since
      the relevant extension might not be loaded.  There's no obvious
      solution to that, so for now, we'll just have to discourage extension
      authors from inventing custom GUCs that need GUC_LIST_QUOTE.
      
      This has been busted for a long time, so back-patch to all supported
      branches.
      
      Michael Paquier and Tom Lane, reviewed by Kyotaro Horiguchi and
      Pavel Stehule
      
      Discussion: https://postgr.es/m/20180111064900.GA51030@paquier.xyz
      74286994
  2. 21 Mar, 2018 14 commits