This 10-day course offers a comprehensive, hands-on introduction to Azure's data platform and key services, with a focus on Azure Databricks, Delta Lake, and data pipeline creation. Participants will gain a deep understanding of Azure's core services, including storage, data management, and redundancy, followed by a dive into Azure SQL, NoSQL databases like Cosmos DB, and advanced analytics services such as Azure Synapse Analytics and Azure Data Factory. The course covers the essentials of Spark and Databricks, from architecture to execution plans, alongside advanced topics like Lakehouse and Medallion architecture, and the benefits of Delta Lake for data management and transformation. Labs throughout the course offer practical experience in accessing data from Azure Data Lake, building data pipelines, and working with Databricks workflows and Unity Catalog. By the end of the course, participants will be proficient in leveraging Azure's powerful data services for real-time analytics, ETL processes, and building scalable, reliable data solutions.
Module 1: Azure Platform Fundamentals
Introduction to Azure
Azure Regions and AZ’s
Redundancy in Azure
Azure Portal
Azure Resources and Compute Options
Configuration and Management
Azure Storage – BLOB and ADLS Gen 2
Lab: Tour of Azure Portal, Resources, Creating a Resource Group
Lab: Working with Azure Storage, Redundancy Options, Accessing data files, Storing data Files
Module 2: Overview of Azure Data Services
Azure SQL
World of NoSQL – Cosmos DB
Azure Synapse Analytics
Azure Data Factory
Azure Stream Analytics and Event Hubs
ADLS
Lab: ADF and Synapse Analytics – Quick Demo
Module 3: Spark, Databricks Essentials & Architecture Deep Dive