Model-as-a-service Solutions Pricing

Model	Type	Input tokens	Output tokens
gemma-3-27b-it	Text generation & Image analysis	€0.25^{/million tokens}	€0.50^{/million tokens}
mistral-small-3.1-24b-instruct-2503	Text generation & Image analysis	€0.15^{/million tokens}	€0.35^{/million tokens}
llama-3.1-8b-instruct	Text generation	€0.20^{/million tokens}	€0.20^{/million tokens}
llama-3.1-70b-instruct	Text generation	€0.90^{/million tokens}	€0.90^{/million tokens}
llama-3.3-70b-instruct	Text generation	€0.90^{/million tokens}	€0.90^{/million tokens}
mistral-nemo-instruct-2407	Text generation	€0.20^{/million tokens}	€0.20^{/million tokens}
qwen2.5-coder-32b-instruct	Code Generation	€0.90^{/million tokens}	€0.90^{/million tokens}
pixtral-12b-2409	Image analysis	€0.20^{/million tokens}	€0.20^{/million tokens}
bge-multilingual-gemma2	Embedding	€0.10^{/million tokens}	N/A
deepseek-r1-distill-llama-70b	Text Generation	€0.90^{/million tokens}	€0.90^{/million tokens}

Model

Type

Input tokens

Output tokens

gemma-3-27b-it

Text generation & Image analysis

€0.25^{/million tokens}

€0.50^{/million tokens}

mistral-small-3.1-24b-instruct-2503

Text generation & Image analysis

€0.15^{/million tokens}

€0.35^{/million tokens}

llama-3.1-8b-instruct

Text generation

€0.20^{/million tokens}

llama-3.1-70b-instruct

Text generation

€0.90^{/million tokens}

llama-3.3-70b-instruct

Text generation

€0.90^{/million tokens}

mistral-nemo-instruct-2407

Text generation

€0.20^{/million tokens}

qwen2.5-coder-32b-instruct

Code Generation

€0.90^{/million tokens}

pixtral-12b-2409

Image analysis

€0.20^{/million tokens}

bge-multilingual-gemma2

Embedding

€0.10^{/million tokens}

N/A

deepseek-r1-distill-llama-70b

Text Generation

€0.90^{/million tokens}

Managed Inference

Deploy your managed AI infrastructure with dedicated GPUs and optimized models. You are charged for usage of the GPU type you choose. Billing only starts once the model is deployed

Model	GPU	Price	Approx. per month
llama-3.1-8b-instruct	L4-1-24G	€0.93^/hour	~€679^/month
	L40S-1-48G	€1.72^/hour	~€1256^/month
	H100-1-80G	€3.40^/hour	~€2482^/month
	H100-2-80G	€6.68^/hour	~€4876^/month
llama-3.3-70b-instruct	H100-2-80G	€6.68^/hour	~€4876^/month
llama-3.1-70b-instruct	H100-1-80G	€3.40^/hour	~€2482^/month
llama-3.1-70b-instruct	H100-2-80G	€6.68^/hour	~€4876^/month
llama-3.1-nemotron-70b-instruct	H100-1-80G	€3.40^/hour	~€2482^/month
llama-3.1-nemotron-70b-instruct	H100-2-80G	€6.68^/hour	~€4876^/month
mistral-7b-instruct-v0.3	L4-1-24G	€0.93^/hour	~€679^/month
	L40S-1-48G	€1.72^/hour	~€1256^/month
	H100-1-80G	€3.40^/hour	~€2482^/month
	H100-2-80G	€6.68^/hour	~€4876^/month
mixtral-8x7b-instruct-v0.1	H100-1-80G	€3.40^/hour	~€2482^/month
mixtral-8x7b-instruct-v0.1	H100-2-80G	€6.68^/hour	~€4876^/month
mistral-nemo-instruct-2407	L40S-1-48G	€1.72^/hour	~€1256^/month
	H100-1-80G	€3.40^/hour	~€2482^/month
	H100-2-80G	€6.68^/hour	~€4876^/month
pixtral-12b-2409	L40S-1-48G	€1.72^/hour	~€1256^/month
	H100-1-80G	€3.40^/hour	~€2482^/month
	H100-2-80G	€6.68^/hour	~€4876^/month
molmo-72b-0924	H100-2-80G	€6.68^/hour	~€4876^/month
qwen2.5-coder-32b-instruct	H100-1-80G	€3.40^/hour	~€2482^/month
qwen2.5-coder-32b-instruct	H100-2-80G	€6.68^/hour	~€4876^/month
bge-multilingual-gemma2	L4-1-24G	€0.93^/hour	~€679^/month
bge-multilingual-gemma2	L40S-1-48G	€1.72^/hour	~€1256^/month
sentence-t5-xxl	L4-1-24G	€0.93^/hour	~€679^/month

Legal notice

Prices before tax

Go to product page Create your account

Model-as-a-service

Generative APIs

Managed Inference