It looks like you are missing a human (User) Task then in the middle. You point the exact point where the human interaction happens but you don't have a node which represent that.
If you add a Human Task in there, you will basically get the desired behavior. Because the process will be Pending until the Human Task is completed.
Cheers