How to connect to a Data Lab for Apache Spark™
Before you startLink to this anchor
To complete the actions presented below, you must have:
- A Scaleway account logged into the console
- Owner status or IAM permissions allowing you to perform actions in the intended Organization
- Created a Data Lab for Apache Spark™ cluster
- A valid API key
-
Click Data Lab under Managed Services on the side menu. The Data Lab for Apache Spark™ page displays.
-
Click the name of the Data Lab cluster you want to connect to. The cluster Overview page displays.
-
Click Open Notebook in the Notebook section. You are directed to the notebook login page.
-
Enter your API secret key when prompted for a password, then click Log in. You are directed to the lab’s home screen.
-
In the files list on the left, double-click the
DatalabDemo.ipynb
file to open it. -
Update the first cell of the file with your API access key and secret key, as shown below:
"spark.hadoop.fs.s3a.access.key": "[your-api-access-key]","spark.hadoop.fs.s3a.secret.key": "[your-api-secret-key]",Your notebook environment is now ready to be used.
-
Optionally, follow the instructions contained in the
DatalabDemo.ipynb
file to process a test batch of data.