Hey all,
I am trying to get Infinispan 4.2.0.FINAL up and running on a large (think 1000) nodes
for a project at work. I'm running into some issues and have been scouring the JIRA
issues and forums. I have gotten almost zero responses on the forums. The Infinispan
configuration is:
locking: isolation_level: READ_COMMITTED, lockAcquisitionTimout=50000,
writeSkewCheck=false, concurrencyLevel=512, useLockStriping=false
transaction: syncRollbackPhase=false, syncCommitPhase=false, useEagerLocking=false
lazyDeserialization: enabled = false
invocationBatching: enabled = true
eviction: wakeUpInterval=1000 maxEntries=-1 strategy=FIFO
clustering: distribution
sync
hash numOwners = 2, rehashRpcTimeout=600000
l1 enabled=true lifespan=600000
I think this has more to do with JGroups than Infinispan. Initially we were seeing lots of
little clusters form, increasing the number of initial members in the PING section of the
jgroups-udp.xml file and increasing the timeouts seems to have made things better but not
great. Even at 20 nodes, we are seeing messages where the coordinator is failing to flush.
Does anyone have any experience running Infinispan at a large scale (100+ nodes) that may
be able to shed some light on the items that need to be changed in the configuration to
run at this scale? It does not appear that the default configuration scales to a large
size. Any help would be appreciated as my colleagues are starting to question my choice of
cache implementation.
Dave Marion