On 2 Mar 2011, at 17:43, Manik Surtani wrote:
As consistency models go, Infinispan is primarily strongly consistent
(with 2-phase commit between data owners), with the exception of during a rehash where
because of eventual consistency (inability to get a valid response to a remote GET) forces
us to wait for more responses, a quorum if you like. Not dissimilar to PAXOS [1] in some
ways.
We are strongly consistent even during the rehash - we make sure that the user
receive's the last piece of data or no data at all (TimeouException).
I'm wondering whether, for the sake of performance, we should also offer a fully
eventually consistent model? What I am thinking is that changes *always* occur only on
the primary data owner. Single phase, no additional round trips, etc. The primary owner
then asynchronously propagates changes to the other data owners. This would mean things
run much faster in a stable cluster, and durability is maintained.
Don't we
already do that with "eagerLockSingleNode"?
However, during rehashes when keys are moved, the notion of the
primary owner may change. So to deal with this, we could use vector clocks [2] to version
each entry. Vector clocks allow us to "merge" state nicely in most cases, and
in the case of reads, we'd flip back to a PAXOS style quorum during a rehash to get
the most "correct" version.
In terms of implementation, almost all of this would only affect the
DistributionInterceptor and the DistributionManager, so we could easily have eventually
consistent flavours of these two components.
Thoughts?
+1