Managed Inference

Effortlessly deploy AI models on a sovereign infrastructure, manage and scale inference with full data privacy. Start now with a simple interface for creating dedicated endpoints.

Managed Inference Quickstart

Getting StartedLink to this anchor

Quickstart

Learn how to create, connect to, and delete a Managed Inference endpoint in a few steps.

View Quickstart

Concepts

Core concepts that give you a better understanding of Scaleway Managed Inference.

View Concepts

How-tos

Check our guides about creating and managing Managed Inference endpoints.

View How-tos

Additional content

Guides to help you choose a Managed Inference endpoint, understand pricing and advanced configuration.

View additional content

Managed Inference API

Learn how to create and manage your Scaleway Managed Inference endpoints through the API.

Go to Managed Inference API

ChangelogLink to this anchor

April 2025

Managed Inference
Added
Custom models deployment available in beta
Custom models can now be deployed on Managed Inference.
Try it now by providing a Hugging Face URL from a compatible model.

February 2025

Managed Inference
Added
New model preview DeepSeek R1 Distilled Llama 70B
DeepSeek R1 Distilled Llama 70B is now available on Managed Inference.

DeepSeek R1 Distilled Llama improves Llama model performance on reasoning use cases like mathematics or code.

November 2024

Managed Inference
Added
New models support Nemotron and Molmo
Llama 3.1 Nemotron 70B and Molmo 72B are available for deployment on Managed Inference.

Nemotron improves human-like responses in complex tasks, while Molmo provides increased accuracy on multimodal inputs (text and images).

View the full changelog

Questions?

Visit our Help Center and find the answers to your most frequent questions.

Visit Help Center

Was this page helpful?

Managed Inference

Getting StartedLink to this anchor

.css-81tis8{vertical-align:middle;fill:#3f4250;height:1em;width:1em;min-width:1em;min-height:1em;}.css-81tis8 .fillStroke{stroke:#3f4250;fill:none;}Quickstart

.css-81tis8{vertical-align:middle;fill:#3f4250;height:1em;width:1em;min-width:1em;min-height:1em;}.css-81tis8 .fillStroke{stroke:#3f4250;fill:none;}Concepts

.css-81tis8{vertical-align:middle;fill:#3f4250;height:1em;width:1em;min-width:1em;min-height:1em;}.css-81tis8 .fillStroke{stroke:#3f4250;fill:none;}How-tos

.css-81tis8{vertical-align:middle;fill:#3f4250;height:1em;width:1em;min-width:1em;min-height:1em;}.css-81tis8 .fillStroke{stroke:#3f4250;fill:none;}Additional content

ChangelogLink to this anchor