Azure Databricks for Data Engineers - Project on Formula 1 Racing
Collection of Sample Databricks Spark Notebooks ( using for Azure Databricks )
Notebook | Description | Lang |
---|---|---|
Mount Setup | Configuration for storage and mount | Python |
Data Ingestion: CSV - Databricks - Circuits | In this notebook, you ingest data from CSV into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Parquet as processed | Python |
Data Ingestion: CSV - Databricks - Races | In this notebook, you ingest data from CSV into Databricks cluster, run transformations on the data in Databricks cluster, and then load the transformed data into Parquet as processed | Python |
Bug reports and pull requests are welcome on GitHub at https://github.com/UcheIgbokwe/FormulaOneDataETL