I sometimes use Semaphores to limit the number of requests to a subsystem (See
https://github.com/Netflix/Hystrix for more elaborate solutions). However, I'm not sure what you can do with the request apart from block, queue or reject once your limit is reached.