Course Kingdom

- Course -

Introduction to Python for Big Data Engineering with PySpark



IT & Software

10 November, 2020

Learn how to use Python and Spark 3.0 (PySpark) for Data Engineering and Data Analytics on Big Data Cloud Platforms

$89.00 FREE

The key objectives of this course are as follows;

  • Learn Spark Architecture

  • Learn Spark Execution Concepts

  • Learn Spark Transformations and Actions using the Structured API

  • Learn Spark Transformations and Actions using the RDD (Resilient Distributed Datasets) API

  • Learn how to set up your own local PySpark Environment

  • Learn how to interpret the Spark Web UI

  • Learn how to interpret DAG (Directed Acyclic Graph) for Spark Execution


The Python Spark project that we are going to do together;

Sales Data

  • Create a Spark Session

  • Read a CSV file into a Spark Dataframe

  • Learn to Infer a Schema

  • Select data from the Spark Dataframe

  • Produce analytics that shows the topmost sales orders per Region and Country



Join us on Telegram



Join our Udemy Courses Telegram Channel



Enroll Now

Subscribe us on Youtube