Limits & Quotas
Rate Limits
Rate limits are applied per API key. Contact Shunyalabs to request limit increases for high-volume production workloads.
Rate limits
| Limit Type | Default | Notes |
|---|---|---|
| Concurrent Requests (batch) | 16 | RateLimitError (429) raised when exceeded. |
| Concurrent WebSocket sessions | 16 | Per API key. |