Quickstart
Learn how to access, configure and use a Generative APIs endpoint in a few steps.
View QuickstartGenerative APIs provide access to pre-configured serverless endpoints of the most popular AI models, hosted in European data centers and priced per 1M tokens used.
Generative APIs QuickstartLearn how to access, configure and use a Generative APIs endpoint in a few steps.
View QuickstartCore concepts that give you a better understanding of Scaleway Generative APIs.
View ConceptsCheck our guides about using Generative APIs endpoints.
View How-tosGuides to help you choose a Generative APIs endpoint, understand pricing and advanced configuration.
View additional contentLlama 3.1 70B is now deprecated. The new Llama 3.3 70B is available with similar or better performance in most use cases. Llama 3.1 70B will remain available through the API until May 25th 2025. You will then be redirected automatically to the Llama 3.3 70B API afterwards. Llama 3.1 8B is not affected by this change and remains supported.
DeepSeek R1 and DeepSeek R1 Distilled Llama 70B are now available in Preview on Generative APIs.
DeepSeek R1 is an open-weight reasoning model matching proprietary models performance. Distilled version improves Llama model performance on reasoning tasks like mathematics or code.
Llama 3.3 70B is now available on Generative APIs.
Llama 3.3 is a fine-tuned version of the Llama 3.1 70b model, which was designed to approach the performance of Llama 3.1 405B on some applications.
Visit our Help Center and find the answers to your most frequent questions.
Visit Help CenterYour opinion helps us make a better documentation.