Commit be1e8053 authored by Tom Lane's avatar Tom Lane

Use a non-locking test in TAS_SPIN() on all IA64 platforms.

Per my testing, this works just as well with gcc as it does with HP's
compiler; and there is no reason to think that the effect doesn't occur
with icc, either.

Also, rewrite the header comment about enforcing sequencing around spinlock
operations, per Robert's gripe that it was misleading.
parent c01c25fb
...@@ -41,7 +41,7 @@ ...@@ -41,7 +41,7 @@
* *
* int TAS_SPIN(slock_t *lock) * int TAS_SPIN(slock_t *lock)
* Like TAS(), but this version is used when waiting for a lock * Like TAS(), but this version is used when waiting for a lock
* previously found to be contended. Typically, this is the * previously found to be contended. By default, this is the
* same as TAS(), but on some architectures it's better to poll a * same as TAS(), but on some architectures it's better to poll a
* contended lock using an unlocked instruction and retry the * contended lock using an unlocked instruction and retry the
* atomic test-and-set only when it appears free. * atomic test-and-set only when it appears free.
...@@ -54,10 +54,22 @@ ...@@ -54,10 +54,22 @@
* on Alpha TAS() will "fail" if interrupted. Therefore a retry loop must * on Alpha TAS() will "fail" if interrupted. Therefore a retry loop must
* always be used, even if you are certain the lock is free. * always be used, even if you are certain the lock is free.
* *
* ANOTHER CAUTION: be sure that TAS(), TAS_SPIN(), and S_UNLOCK() represent * Another caution for users of these macros is that it is the caller's
* sequence points, ie, loads and stores of other values must not be moved * responsibility to ensure that the compiler doesn't re-order accesses
* across a lock or unlock. In most cases it suffices to make the operation * to shared memory to precede the actual lock acquisition, or follow the
* be done through a "volatile" pointer. * lock release. Typically we handle this by using volatile-qualified
* pointers to refer to both the spinlock itself and the shared data
* structure being accessed within the spinlocked critical section.
* That fixes it because compilers are not allowed to re-order accesses
* to volatile objects relative to other such accesses.
*
* On platforms with weak memory ordering, the TAS(), TAS_SPIN(), and
* S_UNLOCK() macros must further include hardware-level memory fence
* instructions to prevent similar re-ordering at the hardware level.
* TAS() and TAS_SPIN() must guarantee that loads and stores issued after
* the macro are not executed until the lock has been obtained. Conversely,
* S_UNLOCK() must guarantee that loads and stores issued before the macro
* have been executed before the lock is released.
* *
* On most supported platforms, TAS() uses a tas() function written * On most supported platforms, TAS() uses a tas() function written
* in assembly language to execute a hardware atomic-test-and-set * in assembly language to execute a hardware atomic-test-and-set
...@@ -229,6 +241,9 @@ typedef unsigned int slock_t; ...@@ -229,6 +241,9 @@ typedef unsigned int slock_t;
#define TAS(lock) tas(lock) #define TAS(lock) tas(lock)
/* On IA64, it's a win to use a non-locking test before the xchg proper */
#define TAS_SPIN(lock) (*(lock) ? 1 : TAS(lock))
#ifndef __INTEL_COMPILER #ifndef __INTEL_COMPILER
static __inline__ int static __inline__ int
...@@ -735,7 +750,8 @@ typedef unsigned int slock_t; ...@@ -735,7 +750,8 @@ typedef unsigned int slock_t;
#include <ia64/sys/inline.h> #include <ia64/sys/inline.h>
#define TAS(lock) _Asm_xchg(_SZ_W, lock, 1, _LDHINT_NONE) #define TAS(lock) _Asm_xchg(_SZ_W, lock, 1, _LDHINT_NONE)
#define TAS_SPIN(lock) (*(lock) ? 1 : TAS(lock)) /* On IA64, it's a win to use a non-locking test before the xchg proper */
#define TAS_SPIN(lock) (*(lock) ? 1 : TAS(lock))
#endif /* HPUX on IA64, non gcc */ #endif /* HPUX on IA64, non gcc */
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment