PG Program in Azure Data Engineering

PG Program in Azure Data Engineering

Overview

SQL Python, Pyspark, Databrick, Azure Cloud Services(Azure KeyVault, BlobStorage, ADLS gen2) ADF

Curriculum

Module 1 : SQL for Data Analysis & Engineering

  • Introduction to Relational Databases
  • Writing Basic to Complex SQL Queries
  • Writing Basic to Complex SQL Queries
  • Data Cleansing & Filtering Techniques
  • Query Optimization and Best Practices

Module 2: Python for Data Engineering

  • Python Fundamentals & Data Structures
  • Working with Files and APIs
  • Error Handling, Functions & Modules
  • Introduction to Pandas & Data Manipulation
  • Writing Production-Ready Scripts

Module 3: Big Data Processing using PySpark

  • Introduction to Spark Architecture
  • DataFrames, RDDs, and Lazy Evaluation
  • Transformations, Actions, and UDFs
  • Managing Large-Scale Data Workloads
  • Performance Tuning in PySpark

Module 4: Azure Cloud Services for Data Engineering

  • Overview of Azure Architecture
  • Azure Storage Services (Blob, ADLS Gen2)
  • Azure Key Vault & Secrets Management
  • Introduction to Azure Synapse & Data Lakes
  • Cost Management & Resource Scaling

Module 5: Azure Databricks for Scalable Data Processing

  • Databricks Environment Setup and Configuration
  • Notebooks, Workflows, and Delta Lake
  • Real-Time and Batch Data Processing
  • Cluster Management & Job Scheduling
  • CI/CD Integration with Git & DevOps

Module 6: Orchestrating Workflows using Azure Data Factory

  • Building & Managing ETL Pipelines
  • Linked Services, Datasets, and Parameters
  • Triggers, Pipeline Schedules & Dependency Handling
  • Data Flow Design Patterns
  • Monitoring, Logging, and Debugging Pipelines

Ready to start your journey?

Enroll now and master Azure Data Engineering with hands-on, expert-led training.