Radim Vansa created ISPN-4310:
---------------------------------
Summary: StateResponse chunk with lastChunk=true from cancelled ST stops
receiving data in next ST
Key: ISPN-4310
URL:
https://issues.jboss.org/browse/ISPN-4310
Project: Infinispan
Issue Type: Bug
Components: State Transfer
Affects Versions: 7.0.0.Alpha4, 6.0.2.Final
Reporter: Radim Vansa
Assignee: Dan Berindei
Priority: Critical
1. A requests segment from B (there are multiple chunks)
2. B sends all chunks, but before A receives them, new topology arrives and A cancels the
ST.
3. Another topology comes and A requests this segment again
4. A receives the old StateResponseCommand with lastChunk=true and thinks that it got all
segments, therefore, it discards further chunks.
Result is inconsistent cluster, and after further rebalances completely lost data.
This ought to be rare, but was repeatedly observed when gracefully stopping coordinator on
a 32-node cluster full of data.
--
This message was sent by Atlassian JIRA
(v6.2.3#6260)