Was this page helpful?

How to query embedding models

Reviewed on 30 October 2024 • Published on 28 August 2024

Scaleway’s Generative APIs service allows users to interact with embedding models hosted on the platform. The embedding API provides a simple interface for generating vector representations (embeddings) based on your input data. The embedding service is OpenAI compatible. Refer to OpenAI’s embedding documentation for more detailed information.

Before you startLink to this anchor

To complete the actions presented below, you must have:

A Scaleway account logged into the console
Owner status or IAM permissions allowing you to perform actions in the intended Organization
A valid API key for API authentication
Python 3.7+ installed on your system

Querying embedding models via APILink to this anchor

The embedding model inputs text and outputs a vector (list) of floating point numbers to use for tasks like similarity comparisons and search. The instructions below show you how to query the model programmatically using the OpenAI SDK.

Installing the OpenAI SDKLink to this anchor

First, ensure you have the OpenAI SDK installed in your development environment. You can install it using pip:

pip install openai

Initializing the clientLink to this anchor

Initialize the OpenAI client with your base URL and API key:

from openai import OpenAI
# Initialize the client with your base URL and API key
client = OpenAI(
    base_url="https://api.scaleway.ai/v1",  # Scaleway's Generative APIs service URL
    api_key="<SCW_API_KEY>"  # Your unique API key from Scaleway
)

Generating embeddings with bge-multilingual-gemma2Link to this anchor

You can now generate embeddings using the bge-multilingual-gemma2 model, such as the following example:

# Generate embeddings using the 'bge-multilingual-gemma2' model
embedding_response = client.embeddings.create(
    input= "Artificial Intelligence is transforming the world.",
    model= "bge-multilingual-gemma2"
)
# Output the embedding vector
print(embedding_response.data[0].embedding)

This code sends input text to the bge-multilingual-gemma2 embedding model and returns a vector representation of the text. The bge-multilingual-gemma2 model is specifically designed for generating high-quality sentence embeddings.

Model parameters and their effectsLink to this anchor

The following parameters can be adjusted to influence the output of the embedding model:

input (string or array of strings): The text or data you want to convert into vectors.
model (string): The specific embedding model to use, find all our supported models.

If you encounter an error such as “Forbidden 403” refer to the API documentation for troubleshooting tips.