Visão Geral
Este curso foi desenvolvido para preparar profissionais para o exame oficial de certificação Databricks Certified Data Engineer Associate, abordando os conceitos, práticas e laboratórios necessários para atuar com engenharia de dados na plataforma Databricks. O treinamento contempla desde os fundamentos da arquitetura Lakehouse até a implementação de pipelines de dados utilizando Delta Lake, Spark SQL, DataFrames e workflows automatizados.
Ao longo do curso, os participantes desenvolverão competências práticas alinhadas aos objetivos oficiais do exame, realizando exercícios e simulados para maximizar as chances de aprovação na certificação.
Conteúdo Programatico
Module 1: Databricks Lakehouse Fundamentals
- Introduction to Databricks Platform
- Lakehouse Architecture
- Data Engineering Concepts
- Databricks Workspace Overview
- Clusters and Compute Resources
- Databricks Runtime
- Workspace Administration Basics
Module 2: Working with Databricks Notebooks
- Notebook Fundamentals
- Multi-Language Support
- Python in Databricks
- SQL in Databricks
- Notebook Workflows
- Magic Commands
- Parameterization Techniques
Module 3: Spark DataFrame Fundamentals
- DataFrame Architecture
- Reading Data Sources
- Transforming Data
- Filtering and Aggregation
- Joining DataFrames
- Working with Nested Data
- DataFrame Best Practices
Module 4: Spark SQL Fundamentals
- SQL Warehouses
- Query Execution
- SQL Functions
- Views and Temporary Views
- Advanced Queries
- Data Manipulation Operations
- Query Optimization Basics
Module 5: Delta Lake Essentials
- Introduction to Delta Lake
- Delta Table Creation
- ACID Transactions
- Time Travel
- Schema Enforcement
- Schema Evolution
- Delta Lake Best Practices
Module 6: Data Ingestion and ETL
- Batch Data Processing
- Incremental Data Loads
- Auto Loader Fundamentals
- ETL Pipeline Development
- Data Quality Validation
- Error Handling
- Pipeline Monitoring
Module 7: Data Transformation Techniques
- Data Cleansing
- Data Enrichment
- Window Functions
- Complex Transformations
- Incremental Processing
- Performance Considerations
- Reusable Transformation Patterns
Module 8: Streaming Data Processing
- Structured Streaming Fundamentals
- Streaming Sources
- Streaming Sinks
- Watermarking
- Checkpointing
- Trigger Configuration
- Streaming Monitoring
Module 9: Data Management and Governance
- Unity Catalog Fundamentals
- Catalog Structure
- Schemas and Tables
- Data Permissions
- Governance Best Practices
- Lineage Overview
- Security Fundamentals
Module 10: Workflow Orchestration
- Databricks Jobs
- Task Dependencies
- Scheduling Workflows
- Notifications and Alerts
- Monitoring Executions
- Troubleshooting Jobs
- Operational Best Practices
Module 11: Performance Optimization
- Partitioning Strategies
- File Management
- Caching Techniques
- Query Optimization
- Delta Optimization Commands
- Performance Monitoring
- Cost Optimization
Module 12: Certification Exam Preparation
- Certification Exam Overview
- Exam Domains Review
- Sample Questions Analysis
- Scenario-Based Exercises
- Practice Tests
- Exam Strategies
- Final Review Session
Laboratórios Práticos
Lab 1: Creating and Managing Databricks Clusters
- Cluster Deployment
- Runtime Configuration
- Resource Management
Lab 2: Building Data Pipelines
- Data Ingestion
- Data Transformation
- Data Validation
Lab 3: Delta Lake Implementation
- Delta Table Creation
- Time Travel Operations
- Schema Evolution
Lab 4: Streaming Data Pipeline
- Structured Streaming Setup
- Checkpoint Configuration
- Stream Monitoring
Lab 5: Workflow Automation
- Job Creation
- Scheduling
- Dependency Management
Lab 6: Performance Tuning
- Query Optimization
- Data Layout Optimization
- Cluster Performance Analysis
Lab 7: End-to-End Data Engineering Project
- Ingest Raw Data
- Build Bronze Layer
- Build Silver Layer
- Build Gold Layer
- Create Automated Workflows
- Implement Data Governance
- Monitor and Optimize Performance
Lab 8: Certification Mock Exam
- Simulado Completo
- Correção Comentada
- Revisão dos Tópicos Críticos
- Estratégias para Aprovação no Exame Oficial Databricks Certified Data Engineer Associate.