All Open AI APIs potentially impose rate limiting. Ideally, any library designed to abstract the APIs should support exponential backoff. https://beta.openai.com/docs/guides/production-best-practices/managing-rate-limits-and-latency