There appears to be some performance improvements that can be made to the interceptors stack.
In particular every invocation calls Class.getDeclaredMethod() and InterceptionUtils.isIntercptionCandidate(). It should be possible to cache these which should give a performance improvement.