1. 06 Feb, 2012 8 commits
    • Tom Lane's avatar
      Fix postmaster to attempt restart after a hot-standby crash. · 442231d7
      Tom Lane authored
      The postmaster was coded to treat any unexpected exit of the startup
      process (i.e., the WAL replay process) as a catastrophic crash, and not try
      to restart it. This was OK so long as the startup process could not have
      any sibling postmaster children.  However, if a hot-standby backend
      crashes, we SIGQUIT the startup process along with everything else, and the
      resulting exit is hardly "unexpected".  Treating it as such meant we failed
      to restart a standby server after any child crash at all, not only a crash
      of the WAL replay process as intended.  Adjust that.  Back-patch to 9.0
      where hot standby was introduced.
      442231d7
    • Michael Meskes's avatar
    • Tom Lane's avatar
      Avoid throwing ERROR during WAL replay of DROP TABLESPACE. · 5fc78efc
      Tom Lane authored
      Although we will not even issue an XLOG_TBLSPC_DROP WAL record unless
      removal of the tablespace's directories succeeds, that does not guarantee
      that the same operation will succeed during WAL replay.  Foreseeable
      reasons for it to fail include temp files created in the tablespace by Hot
      Standby backends, wrong directory permissions on a standby server, etc etc.
      The original coding threw ERROR if replay failed to remove the directories,
      but that is a serious overreaction.  Throwing an error aborts recovery,
      and worse means that manual intervention will be needed to get the database
      to start again, since otherwise the same error will recur on subsequent
      attempts to replay the same WAL record.  And the consequence of failing to
      remove the directories is only that some probably-small amount of disk
      space is wasted, so it hardly seems justified to throw an error.
      Accordingly, arrange to report such failures as LOG messages and keep going
      when a failure occurs during replay.
      
      Back-patch to 9.0 where Hot Standby was introduced.  In principle such
      problems can occur in earlier releases, but Hot Standby increases the odds
      of trouble significantly.  Given the lack of field reports of such issues,
      I'm satisfied with patching back as far as the patch applies easily.
      5fc78efc
    • Robert Haas's avatar
      pg_dump: Remove global Archive pointer. · 3b157cf2
      Robert Haas authored
      Instead, everything that needs the Archive object now gets it as a
      parameter.  This is necessary infrastructure for parallel pg_dump,
      but is also amply justified by the ugliness of the current code
      (though a lot more than this is needed to fix that problem).
      3b157cf2
    • Robert Haas's avatar
      pg_dump: Reduce dependencies on global variables. · 622f8628
      Robert Haas authored
      Change various places in the code that are referencing the global
      Archive object g_fout to instead reference the Archive object fout
      which is already being passed as a parameter.  For parallel pg_dump to
      work, we're going to need multiple Archive(Handle) objects, so the
      real solution here is to pass down the Archive object to everywhere
      that it needs to go, but we might as well pick the low-hanging fruit
      first.
      622f8628
    • Tom Lane's avatar
      Add locking around WAL-replay modification of shared-memory variables. · c6d76d7c
      Tom Lane authored
      Originally, most of this code assumed that no Postgres backends could be
      running concurrently with it, and so no locking could be needed.  That
      assumption fails in Hot Standby.  While it's still true that Hot Standby
      backends should never change values like nextXid, they can examine them,
      and consistency is important in some cases such as when computing a
      snapshot.  Therefore, prudence requires that WAL replay code obtain the
      relevant locks when modifying such variables, even though it can examine
      them without taking a lock.  We were following that coding rule in some
      places but not all.  This commit applies the coding rule uniformly to all
      updates of ShmemVariableCache and MultiXactState fields; a search of the
      replay routines did not find any other cases that seemed to be at risk.
      
      In addition, this commit fixes a longstanding thinko in replay of NEXTOID
      and checkpoint records: we tried to advance nextOid only if it was behind
      the value in the WAL record, but the comparison would draw the wrong
      conclusion if OID wraparound had occurred since the previous value.
      Better to just unconditionally assign the new value, since OID assignment
      shouldn't be happening during replay anyway.
      
      The additional locking seems to be more in the nature of future-proofing
      than fixing any live bug, so I am not going to back-patch it.  The NEXTOID
      fix will be back-patched separately.
      c6d76d7c
    • Robert Haas's avatar
      Remove dead declaration. · 96abd817
      Robert Haas authored
      96abd817
    • Alvaro Herrera's avatar
      fe-misc.c depends on pg_config_paths.h · 0c88086d
      Alvaro Herrera authored
      Declare this in Makefile to avoid failures in parallel compiles.
      
      Author: Lionel Elie Mamane
      0c88086d
  2. 05 Feb, 2012 4 commits
    • Tom Lane's avatar
      Fix transient clobbering of shared buffers during WAL replay. · 17118825
      Tom Lane authored
      RestoreBkpBlocks was in the habit of zeroing and refilling the target
      buffer; which was perfectly safe when the code was written, but is unsafe
      during Hot Standby operation.  The reason is that we have coding rules
      that allow backends to continue accessing a tuple in a heap relation while
      holding only a pin on its buffer.  Such a backend could see transiently
      zeroed data, if WAL replay had occasion to change other data on the page.
      This has been shown to be the cause of bug #6425 from Duncan Rance (who
      deserves kudos for developing a sufficiently-reproducible test case) as
      well as Bridget Frey's re-report of bug #6200.  It most likely explains the
      original report as well, though we don't yet have confirmation of that.
      
      To fix, change the code so that only bytes that are supposed to change will
      change, even transiently.  This actually saves cycles in RestoreBkpBlocks,
      since it's not writing the same bytes twice.
      
      Also fix seq_redo, which has the same disease, though it has to work a bit
      harder to meet the requirement.
      
      So far as I can tell, no other WAL replay routines have this type of bug.
      In particular, the index-related replay routines, which would certainly be
      broken if they had to meet the same standard, are not at risk because we
      do not have coding rules that allow access to an index page when not
      holding a buffer lock on it.
      
      Back-patch to 9.0 where Hot Standby was added.
      17118825
    • Tom Lane's avatar
      Improve comment. · ee68a441
      Tom Lane authored
      ee68a441
    • Tom Lane's avatar
      Add missing Assert and fix inaccurate elog message in standby_redo(). · 2af72cef
      Tom Lane authored
      All other WAL redo routines either call RestoreBkpBlocks() or Assert that
      they haven't been passed any backup blocks.  Make this one do likewise.
      Also, fix incorrect routine name in its failure message.
      2af72cef
    • Tom Lane's avatar
      Allow SQL-language functions to reference parameters by name. · 9bff0780
      Tom Lane authored
      Matthew Draper, reviewed by Hitoshi Harada
      9bff0780
  3. 04 Feb, 2012 3 commits
  4. 03 Feb, 2012 1 commit
  5. 02 Feb, 2012 3 commits
    • Peter Eisentraut's avatar
      ecpg: Improve test building · 69e9768e
      Peter Eisentraut authored
      Further improve on commit c75e1436.
      Instead of building both .o files and binaries in the same make rule,
      just rely on the normal .c -> .o rule.  This will ensure that
      dependency tracking is used when enabled.  To do this, disable the
      implicit direct .c -> binary rule globally, which will also prevent
      the original problem (*.dSYM junk) from reappearing elsewhere.
      69e9768e
    • Robert Haas's avatar
      Allow spgist's text_ops to handle pattern-matching operators. · 0ed7445d
      Robert Haas authored
      This was presumably intended to work this way all along, but a few key
      bits of indxpath.c didn't get the memo.
      
      Robert Haas and Tom Lane
      0ed7445d
    • Robert Haas's avatar
      Avoid re-checking for visibility map extension too frequently. · b4e07417
      Robert Haas authored
      When testing bits (but not when setting or clearing them), we now
      won't check whether the map has been extended.  This significantly
      improves performance in the case where the visibility map doesn't
      exist yet, by avoiding an extra system call per tuple.  To make
      sure backends notice eventually, send an smgr inval on VM extension.
      
      Dean Rasheed, with minor modifications by me.
      b4e07417
  6. 01 Feb, 2012 7 commits
  7. 31 Jan, 2012 6 commits
  8. 30 Jan, 2012 8 commits
    • Heikki Linnakangas's avatar
      Fix bug in the new wait-until-lwlock-is-free mechanism. · 82d4b262
      Heikki Linnakangas authored
      If there was a wait-until-free process in the head of the wait queue,
      followed by an exclusive locker, the exclusive locker was not be woken up
      as it should.
      82d4b262
    • Peter Eisentraut's avatar
      Add sequence USAGE privileges to information schema · 82e83f46
      Peter Eisentraut authored
      The sequence USAGE privilege is sufficiently similar to the SQL
      standard that it seems reasonable to show in the information schema.
      Also add some compatibility notes about it on the GRANT reference
      page.
      82e83f46
    • Peter Eisentraut's avatar
      PL/Python: Add result metadata functions · ee7fa66b
      Peter Eisentraut authored
      Add result object functions .colnames, .coltypes, .coltypmods to
      obtain information about the result column names and types, which was
      previously not possible in the PL/Python SPI interface.
      
      reviewed by Abhijit Menon-Sen
      ee7fa66b
    • Peter Eisentraut's avatar
      Use abort() instead of exit() to abort library functions · c6ea8cce
      Peter Eisentraut authored
      In some hopeless situations, certain library functions in libpq and
      libpgport quit the program.  Use abort() for that instead of exit(),
      so we don't interfere with the normal exit codes the program might
      use, we clearly signal the abnormal termination, and the caller has a
      chance of catching the termination.
      
      This was originally pointed out by Debian's Lintian program.
      c6ea8cce
    • Robert Haas's avatar
      Remove prototype for nonexistent function. · 423ee49b
      Robert Haas authored
      423ee49b
    • Heikki Linnakangas's avatar
      Make group commit more effective. · 9b38d46d
      Heikki Linnakangas authored
      When a backend needs to flush the WAL, and someone else is already flushing
      the WAL, wait until it releases the WALInsertLock and check if we still need
      to do the flush or if the other backend already did the work for us, before
      acquiring WALInsertLock. This helps group commit, because when the WAL flush
      finishes, all the backends that were waiting for it can be woken up in one
      go, and the can all concurrently observe that they're done, rather than
      waking them up one by one in a cascading fashion.
      
      This is based on a new LWLock function, LWLockWaitUntilFree(), which has
      peculiar semantics. If the lock is immediately free, it grabs the lock and
      returns true. If it's not free, it waits until it is released, but then
      returns false without grabbing the lock. This is used in XLogFlush(), so
      that when the lock is acquired, the backend flushes the WAL, but if it's
      not, the backend first checks the current flush location before retrying.
      
      Original patch and benchmarking by Peter Geoghegan and Simon Riggs, although
      this patch as committed ended up being very different from that.
      9b38d46d
    • Simon Riggs's avatar
    • Simon Riggs's avatar