[infinispan-issues] [JBoss JIRA] (ISPN-11512) The mass indexer should iterate over the cache only once
Gustavo Fernandes (Jira)
issues at jboss.org
Wed Mar 25 05:13:24 EDT 2020
[ https://issues.redhat.com/browse/ISPN-11512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gustavo Fernandes updated ISPN-11512:
-------------------------------------
Status: Open (was: New)
> The mass indexer should iterate over the cache only once
> --------------------------------------------------------
>
> Key: ISPN-11512
> URL: https://issues.redhat.com/browse/ISPN-11512
> Project: Infinispan
> Issue Type: Enhancement
> Components: Indexing
> Affects Versions: 10.1.5.Final, 11.0.0.Dev03
> Reporter: Dan Berindei
> Assignee: Gustavo Fernandes
> Priority: Major
> Fix For: 11.0.0.Final
>
>
> {{DistributedExecutorMassIndexer}} starts an {{IndexWorker}} for each indexed type and submits them to all the nodes in parallel. Each {{IndexWorker}} runs a blocking iteration over the cache, and when the cache has a store, that iteration is very expensive.
> Each iteration will load and deserialize all the entries in the store. Most stores don't implement {{AbstractSegmentedStoreConfiguration}}, so Infinispan wraps them in a {{ComposedSegmentedLoadWriteStore}}, which iterates over each segment in parallel on the persistence/blocking executor. Since the default number of segments is 256 and the persistence/blocking executor has 4*cpu_count/150 max threads, it doesn't take a lot of parallel iterations to fill the blocking executor's threads and prevent the cache from doing other, more urgent, work.
--
This message was sent by Atlassian Jira
(v7.13.8#713008)
More information about the infinispan-issues
mailing list