Foundry APIs enforce both rate limits and concurrency limits to ensure fair resource allocation for all users:
Rate limits | Concurrency limits | |
---|---|---|
Individual users | 5,000 requests per minute | 30 simultaneous requests |
Service users | No request limit | 800 simultaneous requests |
Requests that exceed these limits will be throttled and receive 429
or 503
error responses. Implement retries using exponential backoff in your applications to handle these errors should they occur.
The effective limits experienced by users may vary from the values given above due to several factors:
We recommend performance testing when working with Palantir SDKs, especially for cases where usage is expected to be high-scale or "spiky". If you need help ensuring that your application performs at scale, contact Palantir Support.