Yes, that is what I had in mind Gustavo. I'll see how I can adapt these Mapper/Reducer classes for the sorting benchmark. We can run our native, Hadoop/HDFS and Hadoop/Infinispan variants. That way we could get some really interesting results.

Vladimir
On 2014-07-31, 6:03 AM, Gustavo Fernandes wrote:
Yes, that would be interesting!
 
Also, what do you think of running this benchmark on top of Infinispan native map-reduce implementation, to compare with Hadoop/HDFS and Hadoop/Infinispan?


Gustavo


On Thu, Jul 31, 2014 at 9:20 AM, Vladimir Blagojevic <vblagoje@redhat.com> wrote:
Pedro/Gustavo,

How do you plan to benchmark our Hadoop implementation? It seems TeraSort benchmark suite is an interesting option. Maybe not using 1 TB data set right away, but eventually, why not? Especially now that we can easily run 500 nodes cluster on GCE. I would love to see if we can, when you guys start benchmarking our Hadoop impl, give TeraSort a run on a regular Map/Reduce implementation as well.

What do you think?

Vladimir

[1] http://www.michael-noll.com/blog/2011/04/09/benchmarking-and-stress-testing-an-hadoop-cluster-with-terasort-testdfsio-nnbench-mrbench/#terasort-benchmark-suite

_______________________________________________
infinispan-dev mailing list
infinispan-dev@lists.jboss.org
https://lists.jboss.org/mailman/listinfo/infinispan-dev



_______________________________________________
infinispan-dev mailing list
infinispan-dev@lists.jboss.org
https://lists.jboss.org/mailman/listinfo/infinispan-dev