We have tried invoking the single node directly and we have had even worst performance. The response time shooted upto 3 times than expected.
If we try to check the trace, the most of the time is spent during the xml functionality at different parts of the code. Its parsing, retrieving the attribute, or transformation, and its quite random during different invocations.