[infinispan-issues] [JBoss JIRA] (ISPN-2950) In distributed mode cache store data should be read through the main data owner (vs directly from the store)

Friday, 5 July 2013

    [
https://issues.jboss.org/browse/ISPN-2950?page=com.atlassian.jira.plugin....
] 

William Burns commented on ISPN-2950:
-------------------------------------

Also I don't think it is feasible to limit this to only the primary data owner.  By
doing that we would also have to limit all gets when a cache loader is enabled to only
ever hit the data owner which would cause a read scalability issue.  Also owners that
aren't primary don't ask the primary owner for data.  A non shared cache store
would persist to both locations as well.  As long as the cluster doesn't go down then
it should be consistent.  Once the cluster goes down and if purgeOnStartup isn't
enabled then there is no way to guarantee if data is stale or not in the cache store.

...
 In distributed mode cache store data should be read through the main
data owner (vs directly from the store)

------------------------------------------------------------------------------------------------------------

                 Key: ISPN-2950
                 URL: https://issues.jboss.org/browse/ISPN-2950
             Project: Infinispan
          Issue Type: Bug
          Components: Loaders and Stores
            Reporter: Sanne Grinovero
            Assignee: William Burns
            Priority: Blocker
              Labels: onboard
             Fix For: 6.0.0.Final

 Dist cache with a cache store (shared or not), k owned by \{N1, N2\}. k is read on N3.
What currently happens at this stage, if k is not present in N3's memory (likely
unless L1 is configured), the N3's cache store is queried and data is loaded from
there. This has several drawbacks:
 - the data might already be in the memory of the owner node (N1,N2) so reading it from
the disk is highly inefficient. Especially for hot data: data requested from various nodes
at the same time (see also mailing list discussion around lucene query performance
depending on this)
 - if this is a local cache store, it might contain stale data which would be returned to
the user
 - for async configured cache store this would result in dirty reads, given that a change
might be in the async store's memory but not in the store at the moment when it is in
read by N3. (Note that using async stores still leaves place to inconsistencies when a
node leaves, e.g. because of node crashing before managing to flush the async store.)
 This JIRA is about changing the distribution mode: when asked for a specific key, a node
would only touch a cache store if it is an owner of that key, otherwise would first go to
the main owner of the key to read the value from there. The ClusterCacheLoader should be
deprecated as well. 
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

[infinispan-issues] [JBoss JIRA] (ISPN-2950) In distributed mode cache store data should be read through the main data owner (vs directly from the store)