When you exceed the limit
Requests over the limit receive an HTTP429 Too Many Requests response. Treat a 429 as a signal to slow down, not as a failure to retry immediately.
- Back off exponentially, with jitter. On a 429, wait before retrying and double the wait on each subsequent 429 (for example 0.5s, 1s, 2s, 4s), adding a small random jitter so retries from parallel workers don’t resynchronize.
- Honor
Retry-Afterwhen present. If a 429 response includes aRetry-Afterheader, wait at least that long before retrying instead of using your own delay. - Retry idempotently. Send your own
externalIdon order and job requests so a retried call references the same record instead of creating a duplicate (see API Key and Org ID). - Smooth your own traffic. For bulk work, cap your client’s outbound rate below the limit (a token-bucket or a small concurrency limit) rather than bursting and relying on retries.