Apache Spark Fundamentals
placeVeenendaal 13 jul. 2026 tot 14 jul. 2026Toon rooster event 13 juli 2026, 09:00-16:00, Veenendaal event 14 juli 2026, 09:00-16:00, Veenendaal |
placeVeenendaal 13 aug. 2026 tot 14 aug. 2026Toon rooster event 13 augustus 2026, 09:00-16:00, Veenendaal event 14 augustus 2026, 09:00-16:00, Veenendaal |
Meer weten over de onderwerpen die aan bod komen en de vereiste voorkennis? Neem vrijblijvend contact met ons op.
Get started processing data with Apache Spark and PySpark
With the rise of cloud computing, distributed storage and (big) data processing, many organisations are starting to use Apache Spark for their data processes. Whether it is for data science, data analysis or data engineering, Apache Spark can be the right tool for the job. It is a foundation under Azure Synapse Analytics, Microsoft Fabric and Databricks.
This training aims to walk you through the fundamentals of working with Apache Spark, starting with what it is and how it works. You will then continue to read, transform and write data using PySpark.
Finally, to make sure your code can be safely used in production, there will be an a…
Er zijn nog geen veelgestelde vragen over dit product. Als je een vraag hebt, neem dan contact op met onze klantenservice.
Meer weten over de onderwerpen die aan bod komen en de vereiste voorkennis? Neem vrijblijvend contact met ons op.
Get started processing data with Apache Spark and PySpark
With the rise of cloud computing, distributed storage and (big)
data processing, many organisations are starting to use Apache
Spark for their data processes. Whether it is for data science,
data analysis or data engineering, Apache Spark can be the right
tool for the job. It is a foundation under Azure Synapse Analytics,
Microsoft Fabric and Databricks.
This training aims to walk you through the fundamentals of working
with Apache Spark, starting with what it is and how it works. You
will then continue to read, transform and write data using
PySpark.
Finally, to make sure your code can be safely used in production,
there will be an added focus on using development best
practices.
What is Spark, where did it come from, why was it created? And how
does it work?
Lessons
- History of Apache Spark
- Technical Architecture (Driver, Cluster Manager, Executors)
- RDD and Dataframe
- Pyspark
- Benefits of using Spark
- Running Spark locally
After completing this module, students will be able to:
- Explain how Spark works
To work with data, we first need to retrieve it from wherever it is
located. This is done through spark.read.
Lessons
- spark.read
- read options…
Er zijn nog geen veelgestelde vragen over dit product. Als je een vraag hebt, neem dan contact op met onze klantenservice.

