About the course
This course focuses on the process used to create usable data for downstream analysis. You'll analyze and compare available technologies in order to make informed decisions as data engineers. You'll also learn how to run a data processing workflow through several data stack platforms and design a data pipeline for a business-use case.
What You’ll Learn
- How to use Spark for batch and streaming processing
- How to use Kafka for low-latency and real-time processing
- Data acquisition and modeling techniques
- Workflow orchestration and automations
- The pros and cons of available technologies
- Pipeline design and integration
This course is part of a certificate program. You can take this course without enrolling in the certificate program, but it won't automatically count toward earning the certificate. To apply to the full certificate program instead, visit the Certificate in Big Data Technologies page.
Jason Kolter is an experienced software engineer and architect dedicated to helping people get the most out of their data. He's currently working with Foster America, a nonprofit group focused on transforming the foster care system, where he's responsible for leading innovative reform projects us...
Because of COVID-19, many providers are cancelling or postponing in-person programs or providing online participation options.
We are happy to help you find a suitable online alternative.