[JBoss JIRA] (ISPN-3281) Deadlock in non-transactional caches during rebalance

Friday, 28 June 2013

Dan Berindei created ISPN-3281:
----------------------------------

             Summary: Deadlock in non-transactional caches during rebalance
                 Key: ISPN-3281
                 URL: https://issues.jboss.org/browse/ISPN-3281
             Project: Infinispan
          Issue Type: Bug
          Components: Locking and Concurrency, State transfer
    Affects Versions: 5.3.0.Final
            Reporter: Dan Berindei
            Assignee: Mircea Markus
             Fix For: 6.0.0.Final

Say we have a cache with node A and node B joins. The cache topology id is 1,
primary_owner(k) = A in the current CH and primary_owner(k) = B in the pending CH.

1. Node A starts a put(k, v) command during the rebalance. It thinks it's the primary
owner, so it acquires the lock locally and it forwards the command to B.
2. B installs topology 2, primary_owner(k) = B in the current CH, and there is no pending
CH.
3. B receives the put(k, v) command from A. It thinks it's the primary owner, so it
acquires the lock locally and it forwards the command to A.
4. A receives the put(k, v) command from B. Again it thinks it's the primary owner and
tries to acquire the lock locally, but it times out because the lock is held by another
thread (from step 1).

I think it may be enough to update the topology id in the put(k, v) command on node B,
before forwarding it back to A. That way, the command will block on node A until topology
2 is installed, and it won't try to lock the key again.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009