[infinispan-issues] [JBoss JIRA] (ISPN-4372) Map/Reduce performance is dependent on cache value size

Dan Berindei (JIRA) issues at jboss.org
Tue Jun 10 02:59:15 EDT 2014


    [ https://issues.jboss.org/browse/ISPN-4372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12974755#comment-12974755 ] 

Dan Berindei commented on ISPN-4372:
------------------------------------

Yes, please run the test with {{WordCountMapper}}. I would expect the memory usage of Infinispan 6 to be only slightly worse than {{WordCountMapperEmitPerValue}} with 1k values, especially if there are > 100 unique words.

It's true, we don't know whether the performance with {{WordCountMapper}} also depends on the cache value size or not. But that's exactly why I wrote my comment, we've been testing a single M/R job, so we can't really say how a generic M/R job will perform (or even the same job but with a slightly different mapper).

> Map/Reduce performance is dependent on cache value size
> -------------------------------------------------------
>
>                 Key: ISPN-4372
>                 URL: https://issues.jboss.org/browse/ISPN-4372
>             Project: Infinispan
>          Issue Type: Feature Request
>          Components: Distributed Execution and Map/Reduce
>    Affects Versions: 7.0.0.Alpha4
>            Reporter: Alan Field
>            Assignee: Vladimir Blagojevic
>              Labels: performance
>
> Performance testing the Map/Reduce changes has shown that the performance improvements vary based on the size of the values in the cache. [1] Using values from 8kB to 128kB shows a large performance increase over Infinispan 6, but smaller and larger values are the same or slower than Infinispan 6.
> http://blog.infinispan.org/2014/06/mapreduce-performance-improvements.html



--
This message was sent by Atlassian JIRA
(v6.2.3#6260)


More information about the infinispan-issues mailing list