Microsoft Fabric modules and learning paths (DP-600, DP-700, others)

Learn.microsoft.com links

Microsoft Fabric Analytics Engineer (DP-600)
Implement data engineering solutions using Microsoft Fabric (DP-700)

My Samples

https://tinyurl.com/cbmctsamples

Terminology

https://learn.microsoft.com/en-us/fabric/get-started/fabric-terminology

Experience, Workload

"Experience: A collection of capabilities targeted to a specific functionality. The Fabric experiences include Fabric Data Warehouse, Fabric Data Engineering, Fabric Data Science, Real-Time Intelligence, Data Factory, and Power BI."

"Workloads expand functionality in Fabric. Users with the relevant permissions can add workloads and make them available either to the entire tenant or to a specific capacity."

Workloads are collections of frontend and backend functions. The workloads installed in your Fabric account determine what items are available to create. In the Fabric admin center, use the Workloads item in the Navbar to see the Workloads that are currently installed. The Create item in the Navbar groups items by Workload (the +New Item button on the menubar does not do this). In item lists (for example in a Workspace) the Filter option at the top-right allows filtering by Workload. However, some items are in different Workloads in the create screen and the filter screen (for example, Notebooks).

Experiences are marketing terms for functionality in Microsoft Fabric. As far as I can see, the term "Experience" is not used by the current Fabric admin center, just by the learn.microsoft.com articles.

Admin Centers

The Fabric admin center (app.fabric.microsoft.com) and the Power BI admin center (app.powerbi.com) are the same tool. The bottom button on the Navbar allows you to change between the two UIs.

Data Stores

https://learn.microsoft.com/en-nz/fabric/get-started/decision-guide-data-store

https://learn.microsoft.com/en-nz/fabric/get-started/decision-guide-lakehouse-warehouse

Note that Microsoft have a number of other data services, including Azure SQL Database (as well as PaaS services for HorizonDB, PostgreSQL, MySQL, Cassandra, MongoDB, and others), Azure Cosmos DB, Databricks, Azure Synapse Analytics (which is no longer receiving feature updates), Microsoft Dataverse (which hardly anyone knows about), Azure Managed Redis, Azure Analysis Services, and probably others I've missed.

 

General Modules

 

Introduction to end-to-end analytics using Microsoft Fabric

DP-600 LP 1 & 2, DP-700 LP 2 & 5

Key point: Jack of all trades SaaS big data services. Discussion is ongoing about "master of none".

Note that there is only one OneLake per tenant. This is to avoid data silos. Also note that the Fabric tenant is just the Azure AD tenant.

Workload/Experience discussion goes here. Then Workspace, Item, Workload Filter.

 

Administer a Microsoft Fabric environment

DP-600 LP 1 & 4, DP-700 LP 5

Delivery hint: Present this module early!

Capacities reside in a tenant. It is a pool of resources. It's size (and of course price) determines how much processing power the pool has. Additionally, some features and services are gated behind capacity levels.

Workspaces reside in a capacity. Workspaces have a license, which affects what types of capacity they can be created in. Your user account has a license, which affects what types of workspaces you can view and what items you can access. Yes, I'm confused too.

Every tenant gets a shared capacity. Everyone's My Workspace workspaces are created in here by default, as are Pro and Premium Per User license mode workspaces.

Items reside in a workspace.

Endorsement, promotion and certification are a key feature of Fabric governance. Exam hint.

 

Orchestrate processes and data movement with Microsoft Fabric

DP-600 LP 1, DP-700 LP 1 & 2

Fabric Pipelines (the topic of this module) are built on Azure Data Factory (which is still available as a standalone PaaS service).

 

Ingest Data with Dataflows Gen2 in Microsoft Fabric

DP-600 LP 1, DP-700 LP 1 & 2

Dataflow Gen 2 is Power Query in Fabric; Dataflow Gen 1 is Power Query in Power BI. There is no upgrade from gen 1 to gen 2.

Core point: A dataflow includes all of the transformations needed to reduce data prep time. A pipeline can run, loading data into a new table. It can also be a data source for other tools (though limited, not supporting row-level security for example).

They are a little bit like a DFT in SSIS.

Key terms: easy, reusable ETL.

 

Get started with data science in Microsoft Fabric

DP-600 LP 1

Data Analysts use visualisations to find patterns. Data Scientists train AIs to find them. :-)

The four models (classification, regression, clustering, forecasting) are usually mutually exclusive. That is, one AI will follow one model.

Note the first step of the process - "Define the problem"!

Some of the phrasing in this module is strange. I'm not sure exactly what the authors are trying to say in a few places.

 

Secure data access in Microsoft Fabric

DP-600 LP 4, DP-700 LP 5

 

Govern data in Microsoft Fabric with Purview

DP-600 LP 4

 

Implement continuous integration and continuous delivery (CI/CD) in Microsoft Fabric

DP-700 LP 5

 

Monitor activities in Microsoft Fabric

DP-700 LP 5

 

Warehouse modules

 

Get started with data warehouses in Microsoft Fabric

DP-600 LP 1 & 2, DP-700 LP 4

Note the explanation of having two (or more) keys.

https://en.wikipedia.org/wiki/Slowly_changing_dimension

Data warehouses often don't enforce primary key and foreign key constraints, usually for performance reasons. Additionally, we often trust that the operational sources are enforcing these.
https://www.reddit.com/r/dataengineering/comments/122eo0k/why_do_data_warehouses_dont_have_a_primary_key/

CREATE TABLE AS CLONE OF supports AS OF for 30 days. Note that lots of things are cloned (e.g. constraints) - not just data. Compare with SELECT TOP(0) * INTO.

When you create a Warehouse, Fabric used to create a default semantic model - this feature was sunsetted in October 2025.

DP-600 slide deck 2: Review question "What is a dataset in the data warehouse experience?" is using old terminology. The term used now is just "semantic model".

 

Load data into a Microsoft Fabric data warehouse

DP-600 LP 2, DP-700 LP 4

Ingestion = often used when moving raw data into a central repository.
Load = often used when moving and transforming data into final storage.

Staging = a temporary storage location, a buffer, sometimes an abstraction layer.

Typically we load dimension tables then fact tables, though it often doesn't matter because data warehouses don't enforce relational integrity.

Note the distinction between a business key (from an operational source) and a surrogate key (from this data warehouse).

There are 7 types of slowly changing dimension. Exam hint.

COPY INTO has a couple of party tricks - errorfile, wildcards.

DP-600 slide deck 2: Review question "What are the four data ingestion options available in Microsoft Fabric for loading data into a data warehouse?" is just dodgy. What do "cross-warehouse" and "cross-platform" mean in this context?

 

Query a data warehouse in Microsoft Fabric

DP-600 LP 2, DP-700 LP 4

Queries typically JOIN fact and dimension tables (with snowflake models requiring more JOINs that star models), aggregate values in fact tables, and GROUP BY / ORDER BY on columns in dimension tables.

ROW_NUMBER, RANK, DENSE_RANK, NTILE. Exam hint.

DP-600 slide deck 2: None of the answers are correct for review question "What is the purpose of a fact table in a data warehouse?".

 

Monitor a Microsoft Fabric data warehouse

DP-600 LP 2, DP-700 LP 4

The Microsoft Fabric Capacity Metrics App is a Power BI app and is installed into your Power BI environment (from app.powerbi.com).

The query insights DMVs (queryinsights.exec_requests_history, queryinsights.long_running_queries, queryinsights.frequently_run_queries) require a premium capacity.

The query insights DMVs are also available in a lakehouse, via the SQL analytics endpoint.

 

Secure a Microsoft Fabric data warehouse

DP-600 LP 2 & 4, DP-700 LP 4

Dynamic data masking is great, but note that is is a hiding tool, not a security tool. It's primary purpose is hiding data from people looking over your shoulder at your monitor.

Tow level security using a function is a lot more complex to set up and troubleshoot than using views, but is more powerful and flexible.

Delivery hint: I should be able to show these things in my local copy of SQL Server.

 

Lakehouse modules

 

Get started with lakehouses in Microsoft Fabric

DP-600 LP 1, DP-700 LP 2

 

Use Apache Spark in Microsoft Fabric

DP-600 LP 1, DP-700 LP 1 & 2.

This is just an overview. There is a world of complexity in Apache Spark.

Discuss Databricks here.

 

Work with Delta Lake tables in Microsoft Fabric

DP-600 LP 1, DP-700 LP 2

Not in the DP-600 slide deck.

 

Organize a Fabric lakehouse using medallion architecture design

DP-700 LP 2

 

Power BI modules

 

Create DAX calculations in semantic models

DP-600 LP 3, PL-300 LP3

Calendar-based time intelligence functions (preview) don't need a date table, though Microsoft still recommend a dedicated date table..
https://learn.microsoft.com/en-us/power-bi/transform-model/desktop-time-intelligence#calendar-based-time-intelligence-preview

 

Design scalable semantic models

DP-600 LP 3

 

Optimize a model for performance in Power BI

DP-600 LP 3, PL-300 LP3

 

Create and manage Power BI assets

DP-600 LP 3

 

Enforce Power BI model security

DP-600 LP 3

 

Real-time Intelligence (RTI) modules

Note that RTI was removed from the DP-600 exam domain in early 2025.

When doing real-time and/or distributed processing, there is a fundamental tradeoff we must always consider - consistency vs availability vs partition tolerance (the CAP theorem).

 

Get started with Real-Time Intelligence in Microsoft Fabric

DP-600 LP 1, DP-700 LP 3

Eventstream = a processing engine that does ingestion and transformation.
Eventhouse = a place to store real-time data.

 

Use Eventstream in Microsoft Fabric

DP-700 LP 1 & 3

 

Work with real-time data in an Eventhouse in Microsoft Fabric

DP-700 LP 1 & 3

Fun with temporal windows! Tumbling, sliding, hopping, oh my!

 

Create Real-Time Dashboards with Microsoft Fabric

DP-700 LP 3

Real-time dashboards need to be enabled at the tenant level.

A question I can't find the answer to: WHat is the purpose of Real-time Dashboards? Who is the intended audience? Do Microsoft intend people to use the app.fabric.microsoft.com tool to do actual BI stuff?

 

Use Activator in Microsoft Fabric

DP-700 LP 3

Activator = automated processing of real-time events.