NavigationContentFooter
Suggest an edit

How to connect to a Distributed Data Lab

Reviewed on 31 July 2024Published on 31 July 2024

Before you start

To complete the actions presented below, you must have:

  • A Scaleway account logged into the console
  • Owner status or IAM permissions allowing you to perform actions in the intended Organization
  • Signed up to the private beta and received a confirmation email.
  • Created a Distributed Data Lab cluster
  • A valid API key
  1. Click Data Lab under Managed Services on the side menu. The Distributed Data Lab page displays.

  2. Click the name of the Data Lab cluster you want to connect to. The cluster Overview page displays.

  3. Click Open Notebook in the Notebook section. You are directed to the notebook login page.

  4. Enter your API secret key when prompted for a password, then click Log in. You are directed to the lab’s home screen.

  5. In the files list on the left, double-click the quickstart.ipynb file to open it.

  6. Update the first cell of the file with your API access key and secret key, as shown below:

    "spark.hadoop.fs.s3a.access.key": "[your-api-access-key]",
    "spark.hadoop.fs.s3a.secret.key": "[your-api-secret-key]",

    Your notebook environment is now ready to be used.

  7. Optionally, follow the instructions contained in the quickstart.ipynb file to process a test batch of data.

See also
How to create a Data LabHow to manage and delete a Data Lab
API DocsScaleway consoleDedibox consoleScaleway LearningScaleway.comPricingBlogCareers
© 2023-2024 – Scaleway