1. 03 Mar, 2014 10 commits
    • Robert Haas's avatar
      Introduce logical decoding. · b89e1510
      Robert Haas authored
      This feature, building on previous commits, allows the write-ahead log
      stream to be decoded into a series of logical changes; that is,
      inserts, updates, and deletes and the transactions which contain them.
      It is capable of handling decoding even across changes to the schema
      of the effected tables.  The output format is controlled by a
      so-called "output plugin"; an example is included.  To make use of
      this in a real replication system, the output plugin will need to be
      modified to produce output in the format appropriate to that system,
      and to perform filtering.
      
      Currently, information can be extracted from the logical decoding
      system only via SQL; future commits will add the ability to stream
      changes via walsender.
      
      Andres Freund, with review and other contributions from many other
      people, including Álvaro Herrera, Abhijit Menon-Sen, Peter Gheogegan,
      Kevin Grittner, Robert Haas, Heikki Linnakangas, Fujii Masao, Abhijit
      Menon-Sen, Michael Paquier, Simon Riggs, Craig Ringer, and Steve
      Singer.
      b89e1510
    • Peter Eisentraut's avatar
      Fix whitespace · de94b47c
      Peter Eisentraut authored
      de94b47c
    • Heikki Linnakangas's avatar
      Rename huge_tlb_pages to huge_pages, and improve docs. · f8ce16d0
      Heikki Linnakangas authored
      Christian Kruse
      f8ce16d0
    • Alvaro Herrera's avatar
      pg_dump et al: Add --if-exists option · 9067310c
      Alvaro Herrera authored
      This option makes pg_dump, pg_dumpall and pg_restore inject an IF EXISTS
      clause to each DROP command they emit.  (In pg_dumpall, the clause is
      not added to individual objects drops, but rather to the CREATE DATABASE
      commands, as well as CREATE ROLE and CREATE TABLESPACE.)
      
      This allows for a better user dump experience when using --clean in case
      some objects do not already exist.  Per bug #7873 by Dave Rolsky.
      
      Author: Pavel Stěhule
      Reviewed-by: Jeevan Chalke, Álvaro Herrera, Josh Kupershmidt
      9067310c
    • Robert Haas's avatar
      Use a longer buffer in libpqrcv_startstreaming. · 34c6d961
      Robert Haas authored
      Because of the new SLOT clause in the START_REPLICATION command, it's
      possible for the command to end up too long for the old maximum buffer
      length.
      
      Andres Freund
      34c6d961
    • Robert Haas's avatar
      Bump catversion. · a8e9b86b
      Robert Haas authored
      The previous patch should have entailed a catversion bump, but I
      forgot.
      a8e9b86b
    • Robert Haas's avatar
      Corrections to replication slots code and documentation. · d83ee622
      Robert Haas authored
      Andres Freund, per a report from Vik Faering
      d83ee622
    • Robert Haas's avatar
      pageinspect: Use new pg_lsn datatype. · 45ffeb7e
      Robert Haas authored
      Michael Paquier, with slight comment changes by me
      45ffeb7e
    • Robert Haas's avatar
      Define LSNOID in pg_type.h. · ae95f5f7
      Robert Haas authored
      Most other built-in types have a similarly-named constant, so this
      type should probably have one, too.
      
      Michael Paquier
      ae95f5f7
    • Stephen Frost's avatar
      Another round of Coverity fixes · 5592ebac
      Stephen Frost authored
      Additional non-security issues/improvements spotted by Coverity.
      
      In backend/libpq, no sense trying to protect against port->hba being
      NULL after we've already dereferenced it in the switch() statement.
      
      Prevent against possible overflow due to 32bit arithmitic in
      basebackup throttling (not yet released, so no security concern).
      
      Remove nonsensical check of array pointer against NULL in procarray.c,
      looks to be a holdover from 9.1 and earlier when there were pointers
      being used but now it's just an array.
      
      Remove pointer check-against-NULL in tsearch/spell.c as we had already
      dereferenced it above (in the strcmp()).
      
      Remove dead code from adt/orderedsetaggs.c, isnull is checked
      immediately after each tuplesort_getdatum() call and if true we return,
      so no point checking it again down at the bottom.
      
      Remove recently added minor error-condition memory leak in pg_regress.
      5592ebac
  2. 02 Mar, 2014 1 commit
    • Stephen Frost's avatar
      Various Coverity-spotted fixes · b1aebbb6
      Stephen Frost authored
      A number of issues were identified by the Coverity scanner and are
      addressed in this patch.  None of these appear to be security issues
      and many are mostly cosmetic changes.
      
      Short comments for each of the changes follows.
      
      Correct the semi-colon placement in be-secure.c regarding SSL retries.
      Remove a useless comparison-to-NULL in proc.c (value is dereferenced
        prior to this check and therefore can't be NULL).
      Add checking of chmod() return values to initdb.
      Fix a couple minor memory leaks in initdb.
      Fix memory leak in pg_ctl- involves free'ing the config file contents.
      Use an int to capture fgetc() return instead of an enum in pg_dump.
      Fix minor memory leaks in pg_dump.
        (note minor change to convertOperatorReference()'s API)
      Check fclose()/remove() return codes in psql.
      Check fstat(), find_my_exec() return codes in psql.
      Various ECPG memory leak fixes.
      Check find_my_exec() return in ECPG.
      Explicitly ignore pqFlush return in libpq error-path.
      Change PQfnumber() to avoid doing an strdup() when no changes required.
      Remove a few useless check-against-NULL's (value deref'd beforehand).
      Check rmtree(), malloc() results in pg_regress.
      Also check get_alternative_expectfile() return in pg_regress.
      b1aebbb6
  3. 01 Mar, 2014 1 commit
    • Tom Lane's avatar
      Allow regex operations to be terminated early by query cancel requests. · 9662143f
      Tom Lane authored
      The regex code didn't have any provision for query cancel; which is
      unsurprising given its non-Postgres origin, but still problematic since
      some operations can take a long time.  Introduce a callback function to
      check for a pending query cancel or session termination request, and
      call it in a couple of strategic spots where we can make the regex code
      exit with an error indicator.
      
      If we ever actually split out the regex code as a standalone library,
      some additional work will be needed to let the cancel callback function
      be specified externally to the library.  But that's straightforward
      (certainly so by comparison to putting the locale-dependent character
      classification logic on a similar arms-length basis), and there seems
      no need to do it right now.
      
      A bigger issue is that there may be more places than these two where
      we need to check for cancels.  We can always add more checks later,
      now that the infrastructure is in place.
      
      Since there are known examples of not-terribly-long regexes that can
      lock up a backend for a long time, back-patch to all supported branches.
      I have hopes of fixing the known performance problems later, but adding
      query cancel ability seems like a good idea even if they were all fixed.
      9662143f
  4. 28 Feb, 2014 2 commits
    • Heikki Linnakangas's avatar
      Remove bogus while-loop. · d8a42b15
      Heikki Linnakangas authored
      Commit abf5c5c9 added a bogus while-
      statement after the for(;;)-loop. It went unnoticed in testing, because
      it was dead code.
      
      Report by KONDO Mitsumasa. Backpatch to 9.3. The commit that introduced
      this was also applied to 9.2, but not the bogus while-loop part, because
      the code in 9.2 looks quite different.
      d8a42b15
    • Peter Eisentraut's avatar
      pgbench: Fix help message · 192d8e04
      Peter Eisentraut authored
      Add NUM placeholder to -t option in help message.  It got lost in
      79cddb18.
      
      Author: Fabien COELHO <coelho@cri.ensmp.fr>
      192d8e04
  5. 27 Feb, 2014 4 commits
    • Alvaro Herrera's avatar
      Allow BASE_BACKUP to be throttled · ef5856fd
      Alvaro Herrera authored
      A new MAX_RATE option allows imposing a limit to the network transfer
      rate from the server side.  This is useful to limit the stress that
      taking a base backup has on the server.
      
      pg_basebackup is now able to specify a value to the server, too.
      
      Author: Antonin Houska
      
      Patch reviewed by Stefan Radomski, Andres Freund, Zoltán Böszörményi,
      Fujii Masao, and Álvaro Herrera.
      ef5856fd
    • Tom Lane's avatar
      Remove dependency on database encoding in citext regression test. · 1161d895
      Tom Lane authored
      Testing convert_to(..., 'ISO-8859-1') fails if there isn't a conversion
      function available from the database encoding to ISO-8859-1.  This has
      been broken since day one, but the breakage was hidden by
      pg_do_encoding_conversion's failure to complain, up till commit
      49c817ea.
      
      Since the data being converted in this test is plain ASCII, no actual
      conversion need happen (and if it did, it would prove little about citext
      anyway).  So that we still have some code coverage of the convert() family
      of functions, let's switch to using convert_from, with SQL_ASCII as the
      specified source encoding.  Per buildfarm.
      1161d895
    • Alvaro Herrera's avatar
      doc: bgw_main takes a Datum argument, not void *. · 4333eee8
      Alvaro Herrera authored
      Per report from James Harper.
      4333eee8
    • Alvaro Herrera's avatar
      Fix WAL replay of locking an updated tuple · 6bfa88ac
      Alvaro Herrera authored
      We were resetting the tuple's HEAP_HOT_UPDATED flag as well as t_ctid on
      WAL replay of a tuple-lock operation, which is incorrect when the tuple
      is already updated.
      
      Back-patch to 9.3.  The clearing of both header elements was there
      previously, but since no update could be present on a tuple that was
      being locked, it was harmless.
      
      Bug reported by Peter Geoghegan and Greg Stark in
      CAM3SWZTMQiCi5PV5OWHb+bYkUcnCk=O67w0cSswPvV7XfUcU5g@mail.gmail.com and
      CAM-w4HPTOeMT4KP0OJK+mGgzgcTOtLRTvFZyvD0O4aH-7dxo3Q@mail.gmail.com
      respectively; diagnosis by Andres Freund.
      6bfa88ac
  6. 26 Feb, 2014 2 commits
    • Heikki Linnakangas's avatar
      btbuild no longer calls _bt_doinsert(), update comment. · 00976f20
      Heikki Linnakangas authored
      Peter Geoghegan
      00976f20
    • Jeff Davis's avatar
      Fix crash in json_to_record(). · 486ea0b1
      Jeff Davis authored
      json_to_record() depends on get_call_result_type() for the tuple
      descriptor of the record that should be returned, but in some cases
      that cannot be determined. Add a guard to check if the tuple
      descriptor has been properly resolved, similar to other callers of
      get_call_result_type().
      
      Also add guard for two other callers of get_call_result_type() in
      jsonfuncs.c. Although json_to_record() is the only actual bug, it's a
      good idea to follow convention.
      486ea0b1
  7. 25 Feb, 2014 5 commits
    • Tom Lane's avatar
      Use SnapshotDirty rather than an active snapshot to probe index endpoints. · fccebe42
      Tom Lane authored
      If there are lots of uncommitted tuples at the end of the index range,
      get_actual_variable_range() ends up fetching each one and doing an MVCC
      visibility check on it, until it finally hits a visible tuple.  This is
      bad enough in isolation, considering that we don't need an exact answer
      only an approximate one.  But because the tuples are not yet committed,
      each visibility check does a TransactionIdIsInProgress() test, which
      involves scanning the ProcArray.  When multiple sessions do this
      concurrently, the ensuing contention results in horrid performance loss.
      20X overall throughput loss on not-too-complicated queries is easy to
      demonstrate in the back branches (though someone's made it noticeably
      less bad in HEAD).
      
      We can dodge the problem fairly effectively by using SnapshotDirty rather
      than a normal MVCC snapshot.  This will cause the index probe to take
      uncommitted tuples as good, so that we incur only one tuple fetch and test
      even if there are many such tuples.  The extent to which this degrades the
      estimate is debatable: it's possible the result is actually a more accurate
      prediction than before, if the endmost tuple has become committed by the
      time we actually execute the query being planned.  In any case, it's not
      very likely that it makes the estimate a lot worse.
      
      SnapshotDirty will still reject tuples that are known committed dead, so
      we won't give bogus answers if an invalid outlier has been deleted but not
      yet vacuumed from the index.  (Because btrees know how to mark such tuples
      dead in the index, we shouldn't have a big performance problem in the case
      that there are many of them at the end of the range.)  This consideration
      motivates not using SnapshotAny, which was also considered as a fix.
      
      Note: the back branches were using SnapshotNow instead of an MVCC snapshot,
      but the problem and solution are the same.
      
      Per performance complaints from Bartlomiej Romanski, Josh Berkus, and
      others.  Back-patch to 9.0, where the issue was introduced (by commit
      40608e7f).
      fccebe42
    • Robert Haas's avatar
      Update a few comments to mention materialized views. · cf6aa68b
      Robert Haas authored
      Etsuro Fujita
      cf6aa68b
    • Robert Haas's avatar
      Show xid and xmin in pg_stat_activity and pg_stat_replication. · dd1a3bcc
      Robert Haas authored
      Christian Kruse, reviewed by Andres Freund and myself, with further
      minor adjustments by me.
      dd1a3bcc
    • Robert Haas's avatar
      pg_basebackup: Skip only the *contents* of pg_replslot. · 278c9420
      Robert Haas authored
      Include the directory itself.
      
      Fujii Masao
      278c9420
    • Peter Eisentraut's avatar
      Update and clarify ssl_ciphers default · 32001ab0
      Peter Eisentraut authored
      - Write HIGH:MEDIUM instead of DEFAULT:!LOW:!EXP for clarity.
      - Order 3DES last to work around inappropriate OpenSSL default.
      - Remove !MD5 and @STRENGTH, because they are irrelevant.
      - Add clarifying documentation.
      
      Effectively, the new default is almost the same as the old one, but it
      is arguably easier to understand and modify.
      
      Author: Marko Kreen <markokr@gmail.com>
      32001ab0
  8. 24 Feb, 2014 10 commits
  9. 23 Feb, 2014 3 commits
    • Tom Lane's avatar
      Prefer pg_any_to_server/pg_server_to_any over pg_do_encoding_conversion. · 769065c1
      Tom Lane authored
      A large majority of the callers of pg_do_encoding_conversion were
      specifying the database encoding as either source or target of the
      conversion, meaning that we can use the less general functions
      pg_any_to_server/pg_server_to_any instead.
      
      The main advantage of using the latter functions is that they can make use
      of a cached conversion-function lookup in the common case that the other
      encoding is the current client_encoding.  It's notationally cleaner too in
      most cases, not least because of the historical artifact that the latter
      functions use "char *" rather than "unsigned char *" in their APIs.
      
      Note that pg_any_to_server will apply an encoding verification step in
      some cases where pg_do_encoding_conversion would have just done nothing.
      This seems to me to be a good idea at most of these call sites, though
      it partially negates the performance benefit.
      
      Per discussion of bug #9210.
      769065c1
    • Tom Lane's avatar
      Plug some more holes in encoding conversion. · 49c817ea
      Tom Lane authored
      Various places assume that pg_do_encoding_conversion() and
      pg_server_to_any() will ensure encoding validity of their results;
      but they failed to do so in the case that the source encoding is SQL_ASCII
      while the destination is not.  We cannot perform any actual "conversion"
      in that scenario, but we should still validate the string according to the
      destination encoding.  Per bug #9210 from Digoal Zhou.
      
      Arguably this is a back-patchable bug fix, but on the other hand adding
      more enforcing of encoding checks might break existing applications that
      were being sloppy.  On balance there doesn't seem to be much enthusiasm
      for a back-patch, so fix in HEAD only.
      
      While at it, remove some apparently-no-longer-needed provisions for
      letting pg_do_encoding_conversion() "work" outside a transaction ---
      if you consider it "working" to silently fail to do the requested
      conversion.
      
      Also, make a few cosmetic improvements in mbutils.c, notably removing
      some Asserts that are certainly dead code since the variables they
      assert aren't null are never null, even at process start.  (I think
      this wasn't true at one time, but it is now.)
      49c817ea
    • Peter Eisentraut's avatar
      configure.in: Use dnl in place of # where appropriate · 2c65856b
      Peter Eisentraut authored
      The comment added by ed011d97 used #,
      which means it gets copied into configure, but it doesn't make sense
      there.  So use dnl, which gets dropped when creating configure.
      2c65856b
  10. 22 Feb, 2014 1 commit
  11. 21 Feb, 2014 1 commit
    • Tom Lane's avatar
      Do ScalarArrayOp estimation correctly when array is a stable expression. · 77585bce
      Tom Lane authored
      Most estimation functions apply estimate_expression_value to see if they
      can reduce an expression to a constant; the key difference is that it
      allows evaluation of stable as well as immutable functions in hopes of
      ending up with a simple Const node.  scalararraysel didn't get the memo
      though, and neither did gincost_opexpr/gincost_scalararrayopexpr.  Fix
      that, and remove a now-unnecessary estimate_expression_value step in the
      subsidiary function scalararraysel_containment.
      
      Per complaint from Alexey Klyukin.  Back-patch to 9.3.  The problem
      goes back further, but I'm hesitant to change estimation behavior in
      long-stable release branches.
      77585bce