[infinispan-dev] [Cloudtm-discussion] Primary-Backup replication scheme in Infinispan

Mark Little mlittle at redhat.com
Fri Feb 18 07:03:58 EST 2011


Yeah, I know all this ... recall my PhD was on transactions and replication :-)

You can tolerate partitions if you add weighted voting schemes to the group mechanism so that only a primary in the majority partition can do any work.

Mark.


On 18 Feb 2011, at 00:59, Paolo Romano wrote:

> well, trying to make a very long story (probably too) short one may say that:
> 
> - in an asynchronous system (even if augmented with eventually perfect failure detection), classic 2PC blocks upon coordinator crashes. This, pragmatically, forces to heuristic decisions that may lead to atomicity violations. Using more expensive commit protocols, such as Paxos Commit (http://research.microsoft.com/apps/pubs/default.aspx?id=64636), one can enforce atomicity also in eventually synchronous systems (tolerating f faults out of 2f+1 replicas). 
> 
> - With primary backup (PB), in an asynchronous system the issue is the split-brain syndrome (this wiki definition is not the best ever but it's the only I could rapidly find: http://en.wikipedia.org/wiki/Split-brain_(Computing) ). Unsurprisingly, also for PB, it is also possible to design (more expensive) variants working in partially synchronous systems. An example is Vertical Paxos (research.microsoft.com/pubs/80907/podc09v6.pdf). Or assuming virtual synchrony (http://en.wikipedia.org/wiki/Virtual_synchrony),  by propagating the primary's updates via uniform reliable broadcast (also called safe delivery in virtual synchrony's gergon,http://www.cs.huji.ac.il/labs/transis/lab-projects/guide/chap3.html#safe). Or, mapping the solution to classic Paxos, by having the role of the primary coincide with that of the leader in the Multi-paxos protocol (http://en.wikipedia.org/wiki/Paxos_algorithm#Multi-Paxos). 
> 
> Note that, in an asynchronous system, the PB implementation shown in the plots might violate consistency in case of false failure suspicions of the primary, just like the current Infinispan's 2PC based replication scheme might do in case of false failure suspicions of the coordinator. So, in terms of required synchrony assumptions, the 2 considered protocols are fairly comparable.

---
Mark Little
mlittle at redhat.com

JBoss, by Red Hat
Registered Address: Red Hat UK Ltd, Amberley Place, 107-111 Peascod Street, Windsor, Berkshire, SI4 1TE, United Kingdom.
Registered in UK and Wales under Company Registration No. 3798903 Directors: Michael Cunningham (USA), Charlie Peters (USA), Matt Parsons (USA) and Brendan Lane (Ireland).




-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.jboss.org/pipermail/infinispan-dev/attachments/20110218/7075744d/attachment.html 


More information about the infinispan-dev mailing list