Hello.
I had the same problem but I use lockmode="force" for the join activities and it works.
My process has asynchronous nested forks, asynchronous java activities, and I loop
over these forks up to 100 times.
PS: The user guide does not mention force as one of the choices for lockmode.