1. 02 Apr, 2016 2 commits
  2. 01 Apr, 2016 13 commits
    • Alvaro Herrera's avatar
      f07d18b6
    • Tom Lane's avatar
      Omit null rows when setting the threshold for what's a most-common value. · 3d3bf62f
      Tom Lane authored
      As with the previous patch, large numbers of null rows could skew this
      calculation unfavorably, causing us to discard values that have a
      legitimate claim to be MCVs, since our definition of MCV is that it's
      most common among the non-null population of the column.  Hence, make
      the numerator of avgcount be the number of non-null sample values not
      the number of sample rows; likewise for maxmincount in the
      compute_scalar_stats variant.
      
      Also, make the denominator be the number of distinct values actually
      observed in the sample, rather than reversing it back out of the computed
      stadistinct.  This avoids depending on the accuracy of the Haas-Stokes
      approximation, and really it's what we want anyway; the threshold should
      depend only on what we see in the sample, not on what we extrapolate
      about the contents of the whole column.
      
      Alex Shulgin, reviewed by Tomas Vondra and myself
      3d3bf62f
    • Alvaro Herrera's avatar
      pgbench: Remove unused parameter · 5cb88267
      Alvaro Herrera authored
      For some reason this parameter was introduced as unused in 3da0dfb4,
      and has never been used for anything.  Remove it.
      
      Author: Fabien Coelho
      5cb88267
    • Tom Lane's avatar
      Omit null rows when applying the Haas-Stokes estimator for ndistinct. · be4b4dc7
      Tom Lane authored
      Previously, we included null rows in the values of n and N that went
      into the formula, which amounts to considering null as a value in its
      own right; but the d and f1 values do not include nulls.  This is
      inconsistent, and it contributes to significant underestimation of
      ndistinct when the column is mostly nulls.  In any case stadistinct
      is defined as the number of distinct non-null values, so we should
      exclude nulls when doing this computation.
      
      This is an aboriginal bug in our application of the Haas-Stokes formula,
      but we'll refrain from back-patching for fear of destabilizing plan
      choices in released branches.
      
      While at it, make the code a bit more readable by omitting unnecessary
      casts and intermediate variables.
      
      Observation and original patch by Tomas Vondra, adjusted to fix both
      uses of the formula by Alex Shulgin, cosmetic improvements by me
      be4b4dc7
    • Alvaro Herrera's avatar
      Fix logical_decoding_timelines test crashes · 82c83b33
      Alvaro Herrera authored
      In the test_slot_timelines test module, we were abusing passing NULL
      values which was received as zeroes in x86, but this breaks in ARM
      (buildfarm member hamster) by crashing instead.  Fix the breakage by
      marking these functions as STRICT; the InvalidXid value that was
      previously implicit in NULL values (on x86 at least) can now be passed
      as 0.  Failing to follow the fmgr protocol to check for NULLs beforehand
      was causing ARM to fail, as evidenced by segmentation faults in
      buildfarm member hamster.
      
      In order to use the new functionality in the test script, use COALESCE
      in the right spot to avoid forwarding NULL values.
      
      This was diagnosed from the hamster crash by Craig Ringer, who also
      proposed a different patch (checking for NULL values explicitely in the
      C function code, and keeping the non-strictness in the C functions).
      I decided to go with this approach instead.
      82c83b33
    • Teodor Sigaev's avatar
      Fixes in bloom contrib module missed during review · 27f3bbfa
      Teodor Sigaev authored
      - macroses llike (var & FLAG) are changed to ((var & FLAG) != 0)
      - do not copy uninitialized part of notFullPage array to page
      27f3bbfa
    • Alvaro Herrera's avatar
      Type names should not be quoted · f402b995
      Alvaro Herrera authored
      Our actual convention, contrary to what I said in 59a2111b, is not to
      quote type names, as evidenced by unquoted use of format_type_be()
      result value in error messages.  Remove quotes from recently tweaked
      messages accordingly.
      
      Per note from Tom Lane
      f402b995
    • Tom Lane's avatar
      Get rid of minus zero in box regression test. · a067b504
      Tom Lane authored
      Commit acdf2a8b added a test case involving minus zero as a box endpoint.
      This is not very portable, as evidenced by the several older buildfarm
      members that are failing on the test because they print minus zero as
      just "0".  If there were any significant reason to test this behavior,
      we could consider carrying a separate expected-file; but it doesn't look
      to me like there's adequate justification to accept such a maintenance
      burden.  Just change the test to use plain zero, instead.
      a067b504
    • Tom Lane's avatar
      Fix oversight in getParamDescriptions(), and improve comments. · 23066960
      Tom Lane authored
      When getParamDescriptions was changed to handle out-of-memory better
      by cribbing error recovery logic from getRowDescriptions/getAnotherTuple,
      somebody omitted to copy the stanza about checking for excess data in
      the message.  But you need to do that, since continue'ing out of the
      switch in pqParseInput3 means no such check gets applied there anymore.
      Noted while looking at Michael Paquier's patch that made yet another
      copy of this advance_and_error logic.
      
      (This whole business desperately needs refactoring, because I sure don't
      want to see a dozen copies of this code, but that's where we seem to be
      headed.  What's more, the "suspend parsing on EOF return" convention is a
      holdover from protocol 2 and shouldn't exist at all in protocol 3, because
      we don't process partial messages anymore.  But for now, just fix the
      obvious bug.)
      
      Also, fix some wrong/missing comments about what the API spec is
      for these three functions.
      
      This doesn't seem worthy of back-patching, even though it's a bug;
      the case shouldn't ever arise in the field.
      23066960
    • Teodor Sigaev's avatar
      Fix English in bloom module documentation · a361c22e
      Teodor Sigaev authored
      Author: Erik Rijkers
      a361c22e
    • Teodor Sigaev's avatar
      Bloom index contrib module · 9ee014fc
      Teodor Sigaev authored
      Module provides new access method. It is actually a simple Bloom filter
      implemented as pgsql's index. It could give some benefits on search
      with large number of columns.
      
      Module is a single way to test generic WAL interface committed earlier.
      
      Author: Teodor Sigaev, Alexander Korotkov
      Reviewers: Aleksander Alekseev, Michael Paquier, Jim Nasby
      9ee014fc
    • Teodor Sigaev's avatar
      Fix typo in generic wal docs · 4e56e5a6
      Teodor Sigaev authored
      Markus Nullmeier
      4e56e5a6
    • Teodor Sigaev's avatar
      Add Generic WAL interface · 65578341
      Teodor Sigaev authored
      This interface is designed to give an access to WAL for extensions which
      could implement new access method, for example. Previously it was
      impossible because restoring from custom WAL would need to access system
      catalog to find a redo custom function. This patch suggests generic way
      to describe changes on page with standart layout.
      
      Bump XLOG_PAGE_MAGIC because of new record type.
      
      Author: Alexander Korotkov with a help of Petr Jelinek, Markus Nullmeier and
      	minor editorization by my
      Reviewers: Petr Jelinek, Alvaro Herrera, Teodor Sigaev, Jim Nasby,
      	Michael Paquier
      65578341
  3. 31 Mar, 2016 6 commits
    • Tom Lane's avatar
      Another zic portability fix. · c202ecf9
      Tom Lane authored
      I should have remembered that we can't use INT64_MODIFIER with sscanf():
      configure chooses that to work with snprintf(), but it might be for our
      src/port/snprintf.c implementation and so not compatible with the
      platform's sscanf().  This appears to be the explanation for buildfarm
      member frogmouth's continuing unhappiness with the tzcode update.
      
      Fortunately, in all of the places where zic is attempting to read into
      an int64 variable, it's reading a year which certainly will fit just fine
      into an int.  So make it read into an int with %d, and then cast or copy
      as necessary.
      c202ecf9
    • Alvaro Herrera's avatar
      Fix recovery_min_apply_delay test · 61608d38
      Alvaro Herrera authored
      Previously this test was relying too much on WAL replay to occur in the
      exact configured interval, which was unreliable on slow or overly busy
      servers.  Use a custom loop instead of poll_query_until, which is
      hopefully more reliable.
      
      Per continued failures on buildfarm member hamster (which is probably
      the only one running this test suite)
      
      Author: Michaël Paquier
      61608d38
    • Tom Lane's avatar
      Support using index-only scans with partial indexes in more cases. · f9aefcb9
      Tom Lane authored
      Previously, the planner would reject an index-only scan if any restriction
      clause for its table used a column not available from the index, even
      if that restriction clause would later be dropped from the plan entirely
      because it's implied by the index's predicate.  This is a fairly common
      situation for partial indexes because predicates using columns not included
      in the index are often the most useful kind of predicate, and we have to
      duplicate (or at least imply) the predicate in the WHERE clause in order
      to get the index to be considered at all.  So index-only scans were
      essentially unavailable with such partial indexes.
      
      To fix, we have to do detection of implied-by-predicate clauses much
      earlier in the planner.  This patch puts it in check_index_predicates
      (nee check_partial_indexes), meaning it gets done for every partial index,
      whereas we previously only considered this issue at createplan time,
      so that the work was only done for an index actually selected for use.
      That could result in a noticeable planning slowdown for queries against
      tables with many partial indexes.  However, testing suggested that there
      isn't really a significant cost, especially not with reasonable numbers
      of partial indexes.  We do get a small additional benefit, which is that
      cost_index is more accurate since it correctly discounts the evaluation
      cost of clauses that will be removed.  We can also avoid considering such
      clauses as potential indexquals, which saves useless matching cycles in
      the case where the predicate columns aren't in the index, and prevents
      generating bogus plans that double-count the clause's selectivity when
      the columns are in the index.
      
      Tomas Vondra and Kyotaro Horiguchi, reviewed by Kevin Grittner and
      Konstantin Knizhnik, and whacked around a little by me
      f9aefcb9
    • Alvaro Herrera's avatar
      Fix broken variable declaration · 3501f71c
      Alvaro Herrera authored
      Author: Konstantin Knizhnik
      3501f71c
    • Alvaro Herrera's avatar
      Blind attempt at fixing Win32 issue on 24c5f1a1 · 3dd0792a
      Alvaro Herrera authored
      As best as I can tell, MyReplicationSlot needs to be PGDLLIMPORT in
      order for the new test_slot_timelines test module to compile.
      
      Per buildfarm
      3dd0792a
    • Fujii Masao's avatar
      Use proper format specifier %X/%X for LSN. · cee31f5f
      Fujii Masao authored
      cee31f5f
  4. 30 Mar, 2016 13 commits
    • Alvaro Herrera's avatar
      I forgot the alternate expected file in previous commit · 3a3b3090
      Alvaro Herrera authored
      Without this, the test_slot_timelines modules fails "make installcheck"
      because the required feature is not enabled in a stock server.
      
      Per buildfarm
      3a3b3090
    • Alvaro Herrera's avatar
      Enable logical slots to follow timeline switches · 24c5f1a1
      Alvaro Herrera authored
      When decoding from a logical slot, it's necessary for xlog reading to be
      able to read xlog from historical (i.e. not current) timelines;
      otherwise, decoding fails after failover, because the archives are in
      the historical timeline.  This is required to make "failover logical
      slots" possible; it currently has no other use, although theoretically
      it could be used by an extension that creates a slot on a standby and
      continues to replay from the slot when the standby is promoted.
      
      This commit includes a module in src/test/modules with functions to
      manipulate the slots (which is not otherwise possible in SQL code) in
      order to enable testing, and a new test in src/test/recovery to ensure
      that the behavior is as expected.
      
      Author: Craig Ringer
      Reviewed-By: Oleksii Kliukin, Andres Freund, Petr Jelínek
      24c5f1a1
    • Alvaro Herrera's avatar
      XLogReader general code cleanup · 3b02ea4f
      Alvaro Herrera authored
      Some minor tweaks and comment additions, for cleanliness sake and to
      avoid having the upcoming timeline-following patch be polluted with
      unrelated cleanup.
      
      Extracted from a larger patch by Craig Ringer, reviewed by Andres
      Freund, with some additions by myself.
      3b02ea4f
    • Tom Lane's avatar
      Improve portability of I/O behavior for the geometric types. · 50861cd6
      Tom Lane authored
      Formerly, the geometric I/O routines such as box_in and point_out relied
      directly on strtod() and sprintf() for conversion of the float8 component
      values of their data types.  However, the behavior of those functions is
      pretty platform-dependent, especially for edge-case values such as
      infinities and NaNs.  This was exposed by commit acdf2a8b, which
      added test cases involving boxes with infinity endpoints, and immediately
      failed on Windows and AIX buildfarm members.  We solved these problems
      years ago in the main float8in and float8out functions, so let's fix it
      by making the geometric types use that code instead of depending directly
      on the platform-supplied functions.
      
      To do this, refactor the float8in code so that it can be used to parse
      just part of a string, and as a convenience make the guts of float8out
      usable without going through DirectFunctionCall.
      
      While at it, get rid of geo_ops.c's fairly shaky assumptions about the
      maximum output string length for a double, by having it build results in
      StringInfo buffers instead of fixed-length strings.
      
      In passing, convert all the "invalid input syntax for type foo" messages
      in this area of the code into "invalid input syntax for type %s" to reduce
      the number of distinct translatable strings, per recent discussion.
      We would have needed a fair number of the latter anyway for code-sharing
      reasons, so we might as well just go whole hog.
      
      Note: this patch is by no means intended to guarantee that the geometric
      types uniformly behave sanely for infinity or NaN component values.
      But any bugs we have in that line were there all along, they were just
      harder to reach in a platform-independent way.
      50861cd6
    • Tom Lane's avatar
      Suppress uninitialized-variable warnings. · 818e5937
      Tom Lane authored
      My compiler doesn't like the lack of initialization of "flag", and
      I think it's right: if there were zero keys we'd have an undefined
      result.  The AND of zero items is TRUE, so initialize to TRUE.
      818e5937
    • Teodor Sigaev's avatar
      Bump catalog version, forget in acdf2a8b · 2d02a856
      Teodor Sigaev authored
      2d02a856
    • Teodor Sigaev's avatar
      Introduce SP-GiST operator class over box. · acdf2a8b
      Teodor Sigaev authored
      Patch implements quad-tree over boxes, naive approach of 2D quad tree will not
      work for any non-point objects because splitting space on node is not
      efficient. The idea of pathc is treating 2D boxes as 4D points, so,
      object will not overlap (in 4D space).
      
      The performance tests reveal that this technique especially beneficial
      with too much overlapping objects, so called "spaghetti data".
      
      Author: Alexander Lebedev with editorization by Emre Hasegeli and me
      acdf2a8b
    • Teodor Sigaev's avatar
      Use traversalValue in SP-GiST range opclass. · 87545f54
      Teodor Sigaev authored
      Author: Alexander Lebedev
      87545f54
    • Teodor Sigaev's avatar
      Introduce traversalValue for SP-GiST scan · ccd6eb49
      Teodor Sigaev authored
      During scan sometimes it would be very helpful to know some information about
      parent node or all 	ancestor nodes. Right now reconstructedValue could be used
      but it's not a right usage of it (range opclass uses that).
      
      traversalValue is arbitrary piece of memory in separate MemoryContext while
      reconstructedVale should have the same type as indexed column.
      
      Subsequent patches for range opclass and quad4d tree will use it.
      
      Author: Alexander Lebedev, Teodor Sigaev
      ccd6eb49
    • Magnus Hagander's avatar
      Add missing gss option to msvc config template · 3063e7a8
      Magnus Hagander authored
      Michael Paquier
      3063e7a8
    • Tom Lane's avatar
      Remove just-added tests for to_timestamp(float8) with out-of-range inputs. · c53ab8a3
      Tom Lane authored
      Reporting the specific out-of-range input value produces platform-dependent
      results.  We could skip reporting the value, but that's contrary to our
      message style guidelines and unhelpful to users.  Or we could add a
      separate expected-output file for Windows, but that would be a substantial
      maintenance burden, and these test cases seem unlikely to be worth it.
      
      Per buildfarm.
      c53ab8a3
    • Tom Lane's avatar
      Remove TZ environment-variable entry from postgres reference page. · c3834ef9
      Tom Lane authored
      The server hasn't paid attention to the TZ environment variable since
      commit ca4af308, but that commit missed removing this documentation
      reference, as did commit d883b916 which added the reference where
      it now belongs (initdb).
      
      Back-patch to 9.2 where the behavior changed.  Also back-patch
      d883b916 as needed.
      
      Matthew Somerville
      c3834ef9
    • Robert Haas's avatar
      Add new replication mode synchronous_commit = 'remote_apply'. · 314cbfc5
      Robert Haas authored
      In this mode, the master waits for the transaction to be applied on
      the remote side, not just written to disk.  That means that you can
      count on a transaction started on the standby to see all commits
      previously acknowledged by the master.
      
      To make this work, the standby sends a reply after replaying each
      commit record generated with synchronous_commit >= 'remote_apply'.
      This introduces a small inefficiency: the extra replies will be sent
      even by standbys that aren't the current synchronous standby.  But
      previously-existing synchronous_commit levels make no attempt at all
      to optimize which replies are sent based on what the primary cares
      about, so this is no worse, and at least avoids any extra replies for
      people not using the feature at all.
      
      Thomas Munro, reviewed by Michael Paquier and by me.  Some additional
      tweaks by me.
      314cbfc5
  5. 29 Mar, 2016 6 commits
    • Tom Lane's avatar
      Fix interval_mul() to not produce insane results. · a898b409
      Tom Lane authored
      interval_mul() attempts to prevent its calculations from producing silly
      results, but it forgot that zero times infinity yields NaN in IEEE
      arithmetic.  Hence, a case like '1 second'::interval * 'infinity'::float8
      produced a NaN for the months product, which didn't trigger the range
      check, resulting in bogus and possibly platform-dependent output.
      
      This isn't terribly obvious to the naked eye because if you try that
      exact case, you get "interval out of range" which is what you expect
      --- but if you look closer, the error is coming from interval_out not
      interval_mul.  interval_mul has allowed a bogus value into the system.
      
      Fix by adding isnan tests.
      
      Noted while testing Vitaly Burovoy's fix for infinity input to
      to_timestamp().  Given the lack of field complaints, I doubt this
      is worth a back-patch.
      a898b409
    • Tom Lane's avatar
      Allow to_timestamp(float8) to convert float infinity to timestamp infinity. · e511d878
      Tom Lane authored
      With the original SQL-function implementation, such cases failed because
      we don't support infinite intervals.  Converting the function to C lets
      us bypass the interval representation, which should be a bit faster as
      well as more flexible.
      
      Vitaly Burovoy, reviewed by Anastasia Lubennikova
      e511d878
    • Robert Haas's avatar
      Fix bug in aggregate (de)serialization commit. · 96f8373c
      Robert Haas authored
      resulttypeLen and resulttypeByVal must be set correctly when serializing
      aggregates, not just when finalizing them.  This was in David's final
      patch but I downloaded the wrong version by mistake and failed to spot
      the error.
      
      David Rowley
      96f8373c
    • Robert Haas's avatar
      Allow aggregate transition states to be serialized and deserialized. · 5fe5a2ce
      Robert Haas authored
      This is necessary infrastructure for supporting parallel aggregation
      for aggregates whose transition type is "internal".  Such values
      can't be passed between cooperating processes, because they are
      just pointers.
      
      David Rowley, reviewed by Tomas Vondra and by me.
      5fe5a2ce
    • Robert Haas's avatar
      Improve pgbench docs regarding per-transaction logging. · 7f0a2c85
      Robert Haas authored
      The old documentation didn't know about the new -b flag, only about -f.
      
      Fabien Coelho
      7f0a2c85
    • Robert Haas's avatar
      Fix pgbench documentation error. · d797bf7d
      Robert Haas authored
      The description of what the per-transaction log file says for skipped
      transactions is just plain wrong.
      
      Report and patch by Tomas Vondra, reviewed by Fabien Coelho and
      modified by me.
      d797bf7d