Server location hints in Infinispan

Monday, 22 March 2010

This relates to https://jira.jboss.org/jira/browse/ISPN-180.

In JBoss Cache, we had a provision to allow for pluggable buddy selection algorithms.  By
default, the buddy selection process would first try and pick a buddy in the same buddy
group, failing which any buddy *not* on the same physical machine, failing which any buddy
not in the same JVM, and finally any buddy at all.  Further, being pluggable, people could
write their own buddy selection algorithms to pick buddies based on any additional
metrics, such as machine performance by hooking into monitoring tools, etc.

In Infinispan we do not have an equivalent as yet.  The consistent hash approach to
distribution takes a hash of each server's address and uses this to place the server
on a consistent hash wheel.  Owners for keys are picked based on consecutive places on the
wheel.  So there is every possibility that nodes on the same physical host or rack are
selected to back each other up, which is not optimal for data durability.  

One approach is for each node to provide additional hints as to where it is - hints
including "machine id", "rack id" and maybe even "site id". 
The hash function that calculates an addresses position on the hash wheel would take these
3 metrics into account, so this should be robust and pretty efficient.  The only drawback
with this approach is that for each address, this additional data needs to be globally
available since CH's need to work globally and deterministically.  This information
could be a part of a DIST JOIN request, which would work well.

What do people think?  Any interesting alternate approaches to this problem?

Cheers
Manik

--
Manik Surtani
manik(a)jboss.org
Lead, Infinispan
Lead, JBoss Cache
http://www.infinispan.org
http://www.jbosscache.org

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009