Technical: How to create table and load data in Databricks from a file (in CSV format or any structured)? (Approach 2)

Step 1: Login to Databricks account

URL : https://community.cloud.databricks.com/login.html

Step 2: You will get “Welcome to databricks” screen

Step 3: Go and create “Cluster”, if it is not running.

(Please see “How to create a new Cluster in Databricks?” blog for further details)

Step 4: Screenshot of sample Employee CSV file.

Step 5: Click on “Data” and “AddData” button.

Step 6: Click on “Browse” link to choose the CSV file.

Step 7: You will get below screen with selected CSV file.

Step 8 : Click on “Create Table with UI” and then select “Cluster”.

Step 9: Click on “Preview Table”. You will get below screen.

Step 10: Give a new table name (if it already exists), then select “First row is header” and “Infer Schema”. These 2 options will be immediately reflected in “Table Preview”.

Step 11: Click on “Create Table”. This will create a new table and will show the schema and sample data in a new screen as shown below.

PS: You can use this table in other notebooks, because table is created permanently.

Technical

Thursday, December 27, 2018

How to create table and load data in Databricks from a file (in CSV format or any structured)? (Approach 2)

No comments:

Post a Comment