1. 06 Feb, 2012 6 commits
    • Tom Lane's avatar
      Avoid throwing ERROR during WAL replay of DROP TABLESPACE. · 5fc78efc
      Tom Lane authored
      Although we will not even issue an XLOG_TBLSPC_DROP WAL record unless
      removal of the tablespace's directories succeeds, that does not guarantee
      that the same operation will succeed during WAL replay.  Foreseeable
      reasons for it to fail include temp files created in the tablespace by Hot
      Standby backends, wrong directory permissions on a standby server, etc etc.
      The original coding threw ERROR if replay failed to remove the directories,
      but that is a serious overreaction.  Throwing an error aborts recovery,
      and worse means that manual intervention will be needed to get the database
      to start again, since otherwise the same error will recur on subsequent
      attempts to replay the same WAL record.  And the consequence of failing to
      remove the directories is only that some probably-small amount of disk
      space is wasted, so it hardly seems justified to throw an error.
      Accordingly, arrange to report such failures as LOG messages and keep going
      when a failure occurs during replay.
      
      Back-patch to 9.0 where Hot Standby was introduced.  In principle such
      problems can occur in earlier releases, but Hot Standby increases the odds
      of trouble significantly.  Given the lack of field reports of such issues,
      I'm satisfied with patching back as far as the patch applies easily.
      5fc78efc
    • Robert Haas's avatar
      pg_dump: Remove global Archive pointer. · 3b157cf2
      Robert Haas authored
      Instead, everything that needs the Archive object now gets it as a
      parameter.  This is necessary infrastructure for parallel pg_dump,
      but is also amply justified by the ugliness of the current code
      (though a lot more than this is needed to fix that problem).
      3b157cf2
    • Robert Haas's avatar
      pg_dump: Reduce dependencies on global variables. · 622f8628
      Robert Haas authored
      Change various places in the code that are referencing the global
      Archive object g_fout to instead reference the Archive object fout
      which is already being passed as a parameter.  For parallel pg_dump to
      work, we're going to need multiple Archive(Handle) objects, so the
      real solution here is to pass down the Archive object to everywhere
      that it needs to go, but we might as well pick the low-hanging fruit
      first.
      622f8628
    • Tom Lane's avatar
      Add locking around WAL-replay modification of shared-memory variables. · c6d76d7c
      Tom Lane authored
      Originally, most of this code assumed that no Postgres backends could be
      running concurrently with it, and so no locking could be needed.  That
      assumption fails in Hot Standby.  While it's still true that Hot Standby
      backends should never change values like nextXid, they can examine them,
      and consistency is important in some cases such as when computing a
      snapshot.  Therefore, prudence requires that WAL replay code obtain the
      relevant locks when modifying such variables, even though it can examine
      them without taking a lock.  We were following that coding rule in some
      places but not all.  This commit applies the coding rule uniformly to all
      updates of ShmemVariableCache and MultiXactState fields; a search of the
      replay routines did not find any other cases that seemed to be at risk.
      
      In addition, this commit fixes a longstanding thinko in replay of NEXTOID
      and checkpoint records: we tried to advance nextOid only if it was behind
      the value in the WAL record, but the comparison would draw the wrong
      conclusion if OID wraparound had occurred since the previous value.
      Better to just unconditionally assign the new value, since OID assignment
      shouldn't be happening during replay anyway.
      
      The additional locking seems to be more in the nature of future-proofing
      than fixing any live bug, so I am not going to back-patch it.  The NEXTOID
      fix will be back-patched separately.
      c6d76d7c
    • Robert Haas's avatar
      Remove dead declaration. · 96abd817
      Robert Haas authored
      96abd817
    • Alvaro Herrera's avatar
      fe-misc.c depends on pg_config_paths.h · 0c88086d
      Alvaro Herrera authored
      Declare this in Makefile to avoid failures in parallel compiles.
      
      Author: Lionel Elie Mamane
      0c88086d
  2. 05 Feb, 2012 4 commits
    • Tom Lane's avatar
      Fix transient clobbering of shared buffers during WAL replay. · 17118825
      Tom Lane authored
      RestoreBkpBlocks was in the habit of zeroing and refilling the target
      buffer; which was perfectly safe when the code was written, but is unsafe
      during Hot Standby operation.  The reason is that we have coding rules
      that allow backends to continue accessing a tuple in a heap relation while
      holding only a pin on its buffer.  Such a backend could see transiently
      zeroed data, if WAL replay had occasion to change other data on the page.
      This has been shown to be the cause of bug #6425 from Duncan Rance (who
      deserves kudos for developing a sufficiently-reproducible test case) as
      well as Bridget Frey's re-report of bug #6200.  It most likely explains the
      original report as well, though we don't yet have confirmation of that.
      
      To fix, change the code so that only bytes that are supposed to change will
      change, even transiently.  This actually saves cycles in RestoreBkpBlocks,
      since it's not writing the same bytes twice.
      
      Also fix seq_redo, which has the same disease, though it has to work a bit
      harder to meet the requirement.
      
      So far as I can tell, no other WAL replay routines have this type of bug.
      In particular, the index-related replay routines, which would certainly be
      broken if they had to meet the same standard, are not at risk because we
      do not have coding rules that allow access to an index page when not
      holding a buffer lock on it.
      
      Back-patch to 9.0 where Hot Standby was added.
      17118825
    • Tom Lane's avatar
      Improve comment. · ee68a441
      Tom Lane authored
      ee68a441
    • Tom Lane's avatar
      Add missing Assert and fix inaccurate elog message in standby_redo(). · 2af72cef
      Tom Lane authored
      All other WAL redo routines either call RestoreBkpBlocks() or Assert that
      they haven't been passed any backup blocks.  Make this one do likewise.
      Also, fix incorrect routine name in its failure message.
      2af72cef
    • Tom Lane's avatar
      Allow SQL-language functions to reference parameters by name. · 9bff0780
      Tom Lane authored
      Matthew Draper, reviewed by Hitoshi Harada
      9bff0780
  3. 04 Feb, 2012 3 commits
  4. 03 Feb, 2012 1 commit
  5. 02 Feb, 2012 3 commits
    • Peter Eisentraut's avatar
      ecpg: Improve test building · 69e9768e
      Peter Eisentraut authored
      Further improve on commit c75e1436.
      Instead of building both .o files and binaries in the same make rule,
      just rely on the normal .c -> .o rule.  This will ensure that
      dependency tracking is used when enabled.  To do this, disable the
      implicit direct .c -> binary rule globally, which will also prevent
      the original problem (*.dSYM junk) from reappearing elsewhere.
      69e9768e
    • Robert Haas's avatar
      Allow spgist's text_ops to handle pattern-matching operators. · 0ed7445d
      Robert Haas authored
      This was presumably intended to work this way all along, but a few key
      bits of indxpath.c didn't get the memo.
      
      Robert Haas and Tom Lane
      0ed7445d
    • Robert Haas's avatar
      Avoid re-checking for visibility map extension too frequently. · b4e07417
      Robert Haas authored
      When testing bits (but not when setting or clearing them), we now
      won't check whether the map has been extended.  This significantly
      improves performance in the case where the visibility map doesn't
      exist yet, by avoiding an extra system call per tuple.  To make
      sure backends notice eventually, send an smgr inval on VM extension.
      
      Dean Rasheed, with minor modifications by me.
      b4e07417
  6. 01 Feb, 2012 7 commits
  7. 31 Jan, 2012 6 commits
  8. 30 Jan, 2012 10 commits
    • Heikki Linnakangas's avatar
      Fix bug in the new wait-until-lwlock-is-free mechanism. · 82d4b262
      Heikki Linnakangas authored
      If there was a wait-until-free process in the head of the wait queue,
      followed by an exclusive locker, the exclusive locker was not be woken up
      as it should.
      82d4b262
    • Peter Eisentraut's avatar
      Add sequence USAGE privileges to information schema · 82e83f46
      Peter Eisentraut authored
      The sequence USAGE privilege is sufficiently similar to the SQL
      standard that it seems reasonable to show in the information schema.
      Also add some compatibility notes about it on the GRANT reference
      page.
      82e83f46
    • Peter Eisentraut's avatar
      PL/Python: Add result metadata functions · ee7fa66b
      Peter Eisentraut authored
      Add result object functions .colnames, .coltypes, .coltypmods to
      obtain information about the result column names and types, which was
      previously not possible in the PL/Python SPI interface.
      
      reviewed by Abhijit Menon-Sen
      ee7fa66b
    • Peter Eisentraut's avatar
      Use abort() instead of exit() to abort library functions · c6ea8cce
      Peter Eisentraut authored
      In some hopeless situations, certain library functions in libpq and
      libpgport quit the program.  Use abort() for that instead of exit(),
      so we don't interfere with the normal exit codes the program might
      use, we clearly signal the abnormal termination, and the caller has a
      chance of catching the termination.
      
      This was originally pointed out by Debian's Lintian program.
      c6ea8cce
    • Robert Haas's avatar
      Remove prototype for nonexistent function. · 423ee49b
      Robert Haas authored
      423ee49b
    • Heikki Linnakangas's avatar
      Make group commit more effective. · 9b38d46d
      Heikki Linnakangas authored
      When a backend needs to flush the WAL, and someone else is already flushing
      the WAL, wait until it releases the WALInsertLock and check if we still need
      to do the flush or if the other backend already did the work for us, before
      acquiring WALInsertLock. This helps group commit, because when the WAL flush
      finishes, all the backends that were waiting for it can be woken up in one
      go, and the can all concurrently observe that they're done, rather than
      waking them up one by one in a cascading fashion.
      
      This is based on a new LWLock function, LWLockWaitUntilFree(), which has
      peculiar semantics. If the lock is immediately free, it grabs the lock and
      returns true. If it's not free, it waits until it is released, but then
      returns false without grabbing the lock. This is used in XLogFlush(), so
      that when the lock is acquired, the backend flushes the WAL, but if it's
      not, the backend first checks the current flush location before retrying.
      
      Original patch and benchmarking by Peter Geoghegan and Simon Riggs, although
      this patch as committed ended up being very different from that.
      9b38d46d
    • Simon Riggs's avatar
    • Simon Riggs's avatar
    • Heikki Linnakangas's avatar
      Accept a non-existent value in "ALTER USER/DATABASE SET ..." command. · a5782570
      Heikki Linnakangas authored
      When default_text_search_config, default_tablespace, or temp_tablespaces
      setting is set per-user or per-database, with an "ALTER USER/DATABASE SET
      ..." statement, don't throw an error if the text search configuration or
      tablespace does not exist. In case of text search configuration, even if
      it doesn't exist in the current database, it might exist in another
      database, where the setting is intended to have its effect. This behavior
      is now the same as search_path's.
      
      Tablespaces are cluster-wide, so the same argument doesn't hold for
      tablespaces, but there's a problem with pg_dumpall: it dumps "ALTER USER
      SET ..." statements before the "CREATE TABLESPACE" statements. Arguably
      that's pg_dumpall's fault - it should dump the statements in such an order
      that the tablespace is created first and then the "ALTER USER SET
      default_tablespace ..." statements after that - but it seems better to be
      consistent with search_path and default_text_search_config anyway. Besides,
      you could still create a dump that throws an error, by creating the
      tablespace, running "ALTER USER SET default_tablespace", then dropping the
      tablespace and running pg_dumpall on that.
      
      Backpatch to all supported versions.
      a5782570
    • Tom Lane's avatar
      Assorted comment fixes, mostly just typos, but some obsolete statements. · ad10853b
      Tom Lane authored
      YAMAMOTO Takashi
      ad10853b