Transform data with DataFrames in Apache Spark Pools in Azure Synapse Analytics
Learn how to transform data using DataFrames in Apache Spark Pools in Azure Synapse Analytics.
Data Engineer
Synapse Analytics
Module Objectives
After completing this module, you will be able to:
- Understand DataFrames in Spark Pools in Azure Synapse Analytics
- Load data into a Spark DataFrame
- Create a Spark table
- Write Data to and from a storage account
- Load a streaming DataFrame into Apache Spark
- Flatten nested structures and explode arrays with Apache Spark
Units
Prerequisites
Before taking this module, it is recommended that you complete the following modules:
- Data Fundamentals
- Introduction to Azure Data Factory
- Introduction to Azure Synapse Analytics