Concurrency & Rate Limiting
Concurrency & Rate Limiting
Concurrency & Rate Limiting
Rate limits are enforced per IP address.
When rate limited, the API returns a 429 response:
Concurrency refers to the number of generations that can be submitted/processed concurrently. Requests to create new generations will fail with a 429 error if the concurrency limit is exceeded.
To check your generations currently in PENDING/PROCESSING state, you can use the List Generations endpoint.
Concurrency limits are defined in the subscription plan. Current limits are:
When you exceed a rate limit or concurrency limit, the API returns a 429 Too Many Requests response. This applies to both per-minute rate limits and concurrent generation limits.
What to do when you hit a 429:
Do not retry 429 responses immediately in a tight loop. This wastes requests and delays recovery. Use the retry strategy below instead.
Exponential backoff is the recommended approach for handling both rate limit and transient errors from the Sync Labs lip sync API. Each retry waits progressively longer, reducing pressure on the API and improving your success rate.
To get the most out of your plan’s concurrent generation slots:
For API rate limiting best practices and general error handling, see the Error Handling guide.