This course runs for a duration of 5 days.
The class will run daily from 10 AM ET to 6 PM ET.
Class Location: Virtual LIVE Instructor Led - Virtual Live Classroom.
This hands-on Data Engineering Bootcamp teaches attendees the foundations of data engineering using Python and Spark SQL. Students learn how to build production-ready data-driven solutions and gain a comprehensive understanding of data engineering.
For more Python training you may be interested in, click here.
Big Data Concepts and Systems Overview for Data Engineers
Defining Data Engineering
Data Processing Phases
Python 3 Introduction
Python Variables and Types
Control Statements and Data Collections
Functions and Modules
File I/O and Useful Modules
Practical Introduction to NumPy
Practical Introduction to pandas
Data Grouping and Aggregation with pandas
Repairing and Normalizing Data
Data Visualization in Python
Python as a Cloud Scripting Language
Introduction to Apache Spark
The Spark Shell
Spark RDDs
Parallel Data Processing with Spark
Introduction to Spark SQL
Lab Exercises