Tuesday, December 25, 2018

How to create a new Cluster in Databricks?

Cluster is the place where the actual code is executed. The code is generally written in notebooks and it can be executed in 2 ways (a. Interactively in Notebook or b. By Scheduled Jobs). With Databricks, we can easily manage clusters (like Create, Edit, Start, Terminate, Delete, Clone and etc…)



Steps to create a new cluster


Step 1:  Login to Databricks account






Step 2: You will get “Welcome to databricks” screen




Step 3: Click on Clusters










Step 4: Click on “Create Cluster”, you will get below screen





Step 5: Give Cluster Name (in below example its “ClusterOne”), select Databricks Runtime Version, Python Version and Availability Zone. Click on “Create Cluster” button.



Step 6: A new cluster will be created after few mins, as shown below.




Step 7 : You can click on Spark UI to find more details about cluster





ClusterOne is created in AWS cloud service



Additional Info :
Please note: In trail version, if the cluster is inactive for 2 hours or more continuously, then it will be terminated automatically.  Hence you need to create the cluster again.



Next Step: Create a Notebook



No comments:

Post a Comment