Hi,
ATM the default value for virtualNodes is 1. This means that the wheel-share each node has
can be very uneven[1] for smalls(up to 15 nodes) clusters.
Increasing this value even to a small number(10-30) would significantly improve each
node's share of wheel and the chance for a well balanced data distribution over the
cluster.
So I think that increasing the default value would make sense. What are the drawbacks
though? I'm thinking performance and HR wise...
[1] a random example of uneven distribution obtained with radargun
Cluster size: 4 -> ( 15505 13698 5918 4482)
Cluster size: 6 -> ( 8761 7820 17145 8188 12827 4183)
Cluster size: 8 -> ( 8391 6302 10773 22068 3589 200 3050 25211)