How to connect to a Distributed Data Lab
Before you start
To complete the actions presented below, you must have:
- A Scaleway account logged into the console
- Owner status or IAM permissions allowing you to perform actions in the intended Organization
- Signed up to the private beta and received a confirmation email.
- Created a Distributed Data Lab cluster
- A valid API key
-
Click Data Lab under Managed Services on the side menu. The Distributed Data Lab page displays.
-
Click the name of the Data Lab cluster you want to connect to. The cluster Overview page displays.
-
Click Open Notebook in the Notebook section. You are directed to the notebook login page.
-
Enter your API secret key when prompted for a password, then click Log in. You are directed to the lab’s home screen.
-
In the files list on the left, double-click the
quickstart.ipynb
file to open it. -
Update the first cell of the file with your API access key and secret key, as shown below:
"spark.hadoop.fs.s3a.access.key": "[your-api-access-key]","spark.hadoop.fs.s3a.secret.key": "[your-api-secret-key]",Your notebook environment is now ready to be used.
-
Optionally, follow the instructions contained in the
quickstart.ipynb
file to process a test batch of data.