Page 1 of 12
LLM Systems and Python Integration Assessment
When calling an LLM API from Python, you encounter intermittent 429 and 503 errors. What is the most appropriate production approach?
*
When calling an LLM API from Python, you encounter intermittent 429 and 503 errors. What is the most appropriate production approach?
A
Retry immediately in a tight loop until the request succeeds
B
Switch to a different model automatically on every failure
C
Add exponential backoff with jitter, honor Retry-After headers, cap retries, and degrade gracefully if limits persist
D
Increase max tokens to reduce error frequency
E
Remove timeouts so the request can eventually complete
Next