Transform data with DataFrames in Apache Spark Pools in Azure Synapse Analytics

Transform data with DataFrames in Apache Spark Pools in Azure Synapse Analytics

Learn how to transform data using DataFrames in Apache Spark Pools in Azure Synapse Analytics.

Data Engineer
Synapse Analytics

Module Objectives

After completing this module, you will be able to:

  • Understand DataFrames in Spark Pools in Azure Synapse Analytics
  • Load data into a Spark DataFrame
  • Create a Spark table
  • Write Data to and from a storage account
  • Load a streaming DataFrame into Apache Spark
  • Flatten nested structures and explode arrays with Apache Spark

Prerequisites

Before taking this module, it is recommended that you complete the following modules:

  • Data Fundamentals
  • Introduction to Azure Data Factory
  • Introduction to Azure Synapse Analytics