Was this page helpful?

Rate limits

Reviewed on 09 December 2024 • Published on 27 August 2024

What are the limits?Link to this anchor

Any model served through Scaleway Generative APIs gets rate limited based on:

Base limits apply if you registered a valid payment method, and are increased automatically if you also verify your identity.

Exact Limit values are detailed in Organization quotas for Generative APIs.

Tip

If you created a Scaleway Account but did not register a valid payment method, stricter limits apply to ensure usage stays within Free Tier only.

We actively monitor usage and will improve rates based on feedback. If you need to increase your rate limits:

Verify your identity to automatically increase your rate limit as described below
Contact our support team, providing details on the model used and specific use case, for additional increase. Note that for increases of up to x5 or x10 volumes, we highly recommend using dedicated deployments with Managed Inference, which provides exactly the same features and API compatibility.

These limits safeguard against abuse or misuse of Scaleway Generative APIs, helping to ensure fair access to the API with consistent performance.

Was this page helpful?