1. 09 May, 2014 8 commits
    • Heikki Linnakangas's avatar
      More jsonb cleanup. · d9daff0e
      Heikki Linnakangas authored
      Fix JSONB_MAX_ELEMS and JSONB_MAX_PAIRS macros to use CB_MASK in the
      calculation. JENTRY_POSMASK happens to have the same value at the moment,
      but that's just coincidental.
      
      Refactor jsonb iterator functions, for readability.
      
      Get rid of the JENTRY_ISFIRST flag. Whenever we handle JEntrys, we have
      access to the whole array and have enough context information to know
      which entry is the first. This frees up one bit in the JEntry header for
      future use. While we're at it, shuffle the JEntry bits so that boolean
      true and false go together, for aesthetic reasons.
      
      Bump catalog version as this changes the on-disk format slightly.
      d9daff0e
    • Tom Lane's avatar
      Improve key representation for GIN jsonb_ops, and fix existence-search bug. · 46dddf76
      Tom Lane authored
      Change the key representation so that values that would exceed 127 bytes
      are hashed into short strings, and so that the original JSON datatype of
      each value is recorded in the index.  The hashing rule eliminates the major
      objection to having this opclass be the default for jsonb, namely that it
      could fail for plausible input data (due to GIN's restrictions on maximum
      key length).  Preserving datatype information doesn't really buy us much
      right now, but it requires no extra space compared to the previous way,
      and it might be useful later.
      
      Also, change the consistency-checking functions to request recheck for
      exists (jsonb ? text) and related operators.  The original analysis that
      this is an exactly checkable query was incorrect, since the index does
      not preserve information about whether a key appears at top level in
      the indexed JSON object.  Add a test case demonstrating the problem.
      
      Make some other, mostly cosmetic improvements to the code in jsonb_gin.c
      as well.
      
      catversion bump due to on-disk data format change in jsonb_ops indexes.
      46dddf76
    • Heikki Linnakangas's avatar
      Minor cleanup of jsonb_util.c · ff7bbb01
      Heikki Linnakangas authored
      Move the functions around to group related functions together. Remove
      binequal argument from lengthCompareJsonbStringValue, moving that
      responsibility to lengthCompareJsonbPair. Fix typo in comment.
      ff7bbb01
    • Heikki Linnakangas's avatar
      Avoid some pnstrdup()s when constructing jsonb · d3c72e23
      Heikki Linnakangas authored
      This speeds up text to jsonb parsing and hstore to jsonb conversions
      somewhat.
      d3c72e23
    • Tom Lane's avatar
      Fix missing dependencies in ecpg's test Makefiles. · 14d309cc
      Tom Lane authored
      Ensure that ecpg preprocessor output files are rebuilt when re-testing
      after a change in the ecpg preprocessor itself, or a change in any of
      several include files that get copied verbatim into the output files.
      The lack of these dependencies was what created problems for Kevin Grittner
      after the recent pgindent run.  There's no way for --enable-depend to
      discover these dependencies automatically, so we've gotta put them into
      the Makefiles by hand.
      
      While at it, reduce the amount of duplication in the ecpg invocations.
      14d309cc
    • Tom Lane's avatar
      Document permissions needed for pg_database_size and pg_tablespace_size. · fb1974cc
      Tom Lane authored
      Back in 8.3, we installed permissions checks in these functions (see
      commits 8bc225e7 and cc26599b).  But we forgot to document that
      anywhere in the user-facing docs; it did get mentioned in the 8.3 release
      notes, but nobody's looking at that any more.  Per gripe from Suya Huang.
      fb1974cc
    • Tom Lane's avatar
      Increase the default value of effective_cache_size to 4GB. · b910d7ea
      Tom Lane authored
      Per discussion, the old value of 128MB is ridiculously small on modern
      machines; in fact, it's not even any larger than the default value of
      shared_buffers, which it certainly should be.  Increase to 4GB, which
      is unlikely to be any worse than the old default for anyone, and should
      be noticeably better for most.  Eventually we might have an autotuning
      scheme for this setting, but the recent attempt crashed and burned,
      so for now just do this.
      b910d7ea
    • Tom Lane's avatar
      Revert "Auto-tune effective_cache size to be 4x shared buffers" · a16d421c
      Tom Lane authored
      This reverts commit ee1e5662, as well as
      a remarkably large number of followup commits, which were mostly concerned
      with the fact that the implementation didn't work terribly well.  It still
      doesn't: we probably need some rather basic work in the GUC infrastructure
      if we want to fully support GUCs whose default varies depending on the
      value of another GUC.  Meanwhile, it also emerged that there wasn't really
      consensus in favor of the definition the patch tried to implement (ie,
      effective_cache_size should default to 4 times shared_buffers).  So whack
      it all back to where it was.  In a followup commit, I'll do what was
      recently agreed to, which is to simply change the default to a higher
      value.
      a16d421c
  2. 08 May, 2014 6 commits
    • Noah Misch's avatar
      Un-break ecpg test suite under --disable-integer-datetimes. · 08c8e896
      Noah Misch authored
      Commit 4318daec broke it.  The change in
      sub-second precision at extreme dates is normal.  The inconsistent
      truncation vs. rounding is essentially a bug, albeit a longstanding one.
      Back-patch to 8.4, like the causative commit.
      08c8e896
    • Tom Lane's avatar
      Fix comment. · 1e81f846
      Tom Lane authored
      Previous commit was confused about the case we're handling: actually,
      what the patch is dealing with is platforms that have optreset, *and*
      have <getopt.h>, but the latter fails to declare the former.  Because
      we use a linking probe to set HAVE_INT_OPTRESET, we need to be sure we
      have a declaration even if <getopt.h> doesn't think it exists.
      1e81f846
    • Tom Lane's avatar
      Allow for platforms that have optreset but not <getopt.h>. · 0c15a524
      Tom Lane authored
      Reportedly, some versions of mingw are like that, and it seems plausible
      in general that older platforms might be that way.  However, we'd
      determined experimentally that just doing "extern int" conflicts with
      the way Cygwin declares these variables, so explicitly exclude Cygwin.
      
      Michael Paquier, tweaked by me to hopefully not break Cygwin
      0c15a524
    • Heikki Linnakangas's avatar
      Protect against torn pages when deleting GIN list pages. · 4f7bb4b2
      Heikki Linnakangas authored
      To-be-deleted list pages contain no useful information, as they are being
      deleted, but we must still protect the writes from being torn by a crash
      after a partial write. To do that, re-initialize the pages on WAL replay.
      
      Jeff Janes caught this with a test program to test partial writes.
      Backpatch to all supported versions.
      4f7bb4b2
    • Heikki Linnakangas's avatar
      Include files copied from libpqport in .gitignore · 02c9a938
      Heikki Linnakangas authored
      Michael Paquier
      02c9a938
    • Tom Lane's avatar
      Avoid buffer bloat in libpq when server is consistently faster than client. · 2f557167
      Tom Lane authored
      If the server sends a long stream of data, and the server + network are
      consistently fast enough to force the recv() loop in pqReadData() to
      iterate until libpq's input buffer is full, then upon processing the last
      incomplete message in each bufferload we'd usually double the buffer size,
      due to supposing that we didn't have enough room in the buffer to finish
      collecting that message.  After filling the newly-enlarged buffer, the
      cycle repeats, eventually resulting in an out-of-memory situation (which
      would be reported misleadingly as "lost synchronization with server").
      Of course, we should not enlarge the buffer unless we still need room
      after discarding already-processed messages.
      
      This bug dates back quite a long time: pqParseInput3 has had the behavior
      since perhaps 2003, getCopyDataMessage at least since commit 70066eb1
      in 2008.  Probably the reason it's not been isolated before is that in
      common environments the recv() loop would always be faster than the server
      (if on the same machine) or faster than the network (if not); or at least
      it wouldn't be slower consistently enough to let the buffer ramp up to a
      problematic size.  The reported cases involve Windows, which perhaps has
      different timing behavior than other platforms.
      
      Per bug #7914 from Shin-ichi Morita, though this is different from his
      proposed solution.  Back-patch to all supported branches.
      2f557167
  3. 07 May, 2014 15 commits
    • Robert Haas's avatar
      When a background worker exists with code 0, unregister it. · be755816
      Robert Haas authored
      The previous behavior was to restart immediately, which was generally
      viewed as less useful.
      
      Petr Jelinek, with some adjustments by me.
      be755816
    • Heikki Linnakangas's avatar
      Fix build after removing JsonbValue.estSize field. · 7572b773
      Heikki Linnakangas authored
      Oops, I didn't realize that contrib/hstore refers to jsonb stuff.
      7572b773
    • Robert Haas's avatar
      When a bgworker exits, always call ReleasePostmasterChildSlot. · eee6cf1f
      Robert Haas authored
      Commit e2ce9aa2 was insufficiently
      well thought out.  Repair.
      eee6cf1f
    • Robert Haas's avatar
      Restart bgworkers immediately after a crash-and-restart cycle. · 970d1f76
      Robert Haas authored
      Just as we would start bgworkers immediately after an initial startup
      of the server, we should restart them immediately when reinitializing.
      
      Petr Jelinek and Robert Haas
      970d1f76
    • Heikki Linnakangas's avatar
      Clean up jsonb code. · 364ddc3e
      Heikki Linnakangas authored
      The main target of this cleanup is the convertJsonb() function, but I also
      touched a lot of other things that I spotted into in the process.
      
      The new convertToJsonb() function uses an output buffer that's resized on
      demand, so the code to estimate of the size of JsonbValue is removed.
      
      The on-disk format was not changed, even though I refactored the structs
      used to handle it. The term "superheader" is replaced with "container".
      
      The jsonb_exists_any and jsonb_exists_all functions no longer sort the input
      array. That was a premature optimization, the idea being that if there are
      duplicates in the input array, you only need to check them once. Also,
      sorting the array saves some effort in the binary search used to find a key
      within an object. But there were drawbacks too: the sorting and
      deduplicating obviously isn't free, and in the typical case there are no
      duplicates to remove, and the gain in the binary search was minimal. Remove
      all that, which makes the code simpler too.
      
      This includes a bug-fix; the total length of the elements in a jsonb array
      or object mustn't exceed 2^28. That is now checked.
      364ddc3e
    • Robert Haas's avatar
      Detach shared memory from bgworkers without shmem access. · 4d155d8b
      Robert Haas authored
      Since the postmaster won't perform a crash-and-restart sequence
      for background workers which don't request shared memory access,
      we'd better make sure that they can't corrupt shared memory.
      
      Patch by me, review by Tom Lane.
      4d155d8b
    • Tom Lane's avatar
      Fix failure to set ActiveSnapshot while rewinding a cursor. · 04e5025b
      Tom Lane authored
      ActiveSnapshot needs to be set when we call ExecutorRewind because some
      plan node types may execute user-defined functions during their ReScan
      calls (nodeLimit.c does so, at least).  The wisdom of that is somewhat
      debatable, perhaps, but for now the simplest fix is to make sure the
      required context is valid.  Failure to do this typically led to a
      null-pointer-dereference core dump, though it's possible that in more
      complex cases a function could be executed with the wrong snapshot
      leading to very subtle misbehavior.
      
      Per report from Leif Jensen.  It's been broken for a long time, so
      back-patch to all active branches.
      04e5025b
    • Robert Haas's avatar
      Never crash-and-restart for bgworkers without shared memory access. · e2ce9aa2
      Robert Haas authored
      The motivation for a crash and restart cycle when a backend dies is
      that it might have corrupted shared memory on the way down; and we
      can't recover reliably except by reinitializing everything.  But that
      doesn't apply to processes that don't touch shared memory.  Currently,
      there's nothing to prevent a background worker that doesn't request
      shared memory access from touching shared memory anyway, but that's a
      separate bug.
      
      Previous to this commit, the coding in postmaster.c was inconsistent:
      an exit status other than 0 or 1 didn't provoke a crash-and-restart,
      but failure to release the postmaster child slot did.  This change
      makes those cases consistent.
      e2ce9aa2
    • Tom Lane's avatar
      1891b415
    • Jeff Davis's avatar
      Fix interval test, which was broken for floating-point timestamps. · 348aa75a
      Jeff Davis authored
      Commit 4318daec introduced a test that
      couldn't be made consistent between integer and floating-point
      timestamps.
      
      It was designed to test the longest possible interval output length,
      so removing four zeros from the number of hours, as this patch does,
      is not ideal. But the test still has some utility for its original
      purpose, and there aren't a lot of other good options.
      
      Noah Misch suggested a different approach where we test that the
      output either matches what we expect from integer timestamps or what
      we expect from floating-point timestamps. That seemed to obscure an
      otherwise simple test, however.
      
      Reviewed by Tom Lane and Noah Misch.
      348aa75a
    • Tom Lane's avatar
      hash_any returns Datum, not uint32 (and definitely not "int"). · 2c22afaa
      Tom Lane authored
      The coding in JsonbHashScalarValue might have accidentally failed to fail
      given current representational choices, but the key word there would be
      "accidental".  Insert the appropriate datatype conversion macro.  And
      use the right conversion macro for hash_numeric's result, too.
      
      In passing make the code a bit cleaner and less repetitive by factoring
      out the xor step from the switch.
      2c22afaa
    • Jeff Davis's avatar
      Improve comment for tricky aspect of index-only scans. · 35c0cd3b
      Jeff Davis authored
      Index-only scans avoid taking a lock on the VM buffer, which would
      cause a lot of contention. To be correct, that requires some intricate
      assumptions that weren't completely documented in the previous
      comment.
      
      Reviewed by Robert Haas.
      35c0cd3b
    • Peter Eisentraut's avatar
      doc: Fix DocBook XML validity · 3a9d430a
      Peter Eisentraut authored
      The main problem is that DocBook SGML allows indexterm elements just
      about everywhere, but DocBook XML is stricter.  For example, this common
      pattern
      
          <varlistentry>
           <indexterm>...</indexterm>
           <term>...</term>
           ...
          </varlistentry>
      
      needs to be changed to something like
      
          <varlistentry>
           <term>...<indexterm>...</indexterm></term>
           ...
          </varlistentry>
      
      See also bb4eefe7.
      
      There is currently nothing in the build system that enforces that things
      stay valid, because that requires additional tools and will receive
      separate consideration.
      3a9d430a
    • Bruce Momjian's avatar
      With ecpg exclusion removed, re-run pgindent for 9.4 · 84288a86
      Bruce Momjian authored
      Report by Tom Lane
      84288a86
    • Bruce Momjian's avatar
      Remove pgindent ecpg exclusion pattern · 9516668e
      Bruce Momjian authored
      Report by Tom Lane
      9516668e
  4. 06 May, 2014 11 commits