Skip to Content
Databricks Certified Data Engineer Associate
video

Databricks Certified Data Engineer Associate

by Alfredo Deza, Noah Gift
December 2023
Intermediate
2h 24m
English
Pragmatic AI Labs
Closed Captioning available in German, English, Spanish, French, Italian, Japanese

Overview

Databricks Certified Data Engineer Associate Course 1: Databricks Lakehouse Platform

Description

Learn foundational Databricks capabilities including compute, storage, notebooks, and jobs to build scalable data solutions.

Learning Objectives

  • Create clusters and configure runtime environments
  • Perform exploratory analysis with notebooks
  • Schedule and monitor multi-task workflows
Course 2: Databricks SQL

Description

Master Spark SQL for reading, transforming, and loading data at scale. Learn techniques like data validation, custom business logic, and slowly changing dimensions.

Learning Objectives

  • Query data in notebooks with Spark SQL
  • Handle complex data types
  • Apply data quality rules
  • Implement slowly changing dimensions
Course 3: Databricks ML

Description

Build ML models with Python and Scala APIs in Databricks. Learn best practices for feature engineering, hyperparameter tuning, and model evaluation.

Learning Objectives

  • Engineer features from raw data
  • Tune models with cross validation
  • Evaluate model performance
  • Operationalize models with MLflow
Course 4: Databricks Data Engineering

Description

Architect reliable and performant data infrastructure with Delta Lake, streaming, and autoscaling.

Learning Objectives

  • Implement ACID transactions
  • Build streaming ETL solutions
  • Autoscale infrastructure to meet SLAs
  • Migrate data warehouses to lakehouse
Course 5: Workloads with Jobs

Description

Orchestrate workloads using multi-task Jobs with configurable scheduling, dependencies, and error handling.

Learning Objectives

  • Schedule notebooks, jobs and pipelines
  • Set dependencies across tasks
  • Handle and retry failures
  • Monitor runs using the Jobs UI
Course 6: Data Access with Unity Catalog

Description

Provide governed data access across storage like ADLS, S3, and GCS using Unity Catalog.

Learning Objectives

  • Deploy a Unity Catalog
  • Manage credentials securely
  • Apply object-level security
  • Query data from storage tiers
Additional Popular Resources
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Databricks Certified Data Engineer Associate Study Guide

Databricks Certified Data Engineer Associate Study Guide

Derar Alhussein

Publisher Resources

ISBN: 12212024VIDEOPAIMLOtherPublisher WebsiteOther