About the course
Using real data sets from open source data repositories such as data.gov, the DC Open Data Catalog, and Kaggle.com, students create a data product to address a real-world problem. Students work in a data science team to apply the data science pipeline (data ingestion, data munging and wrangling, computation and analysis, modeling and application, and reporting and visualization) to a real-world problem or issue. The course involves the practical application and presentation of concepts and tools learned during the core courses. All completed pieces of the project will be hosted online to help students build a data science project portfolio.
Upon successful completion of the course, students will:
- Apply the knowledge, skills, and abilities applicable to the data science pipeline to a real world problem and data set
- Work in a data science team to create a data product
- Present a completed project and product to faculty and peers
- Build a data science project portfolio
Kyle is a budding analyst in the field of data science. Having graduated from the Georgetown SCS Data Science certificate in 2015, his capstone data product was a flight recommender application. Currently he is a researcher and faculty at District Data Labs, focusing on the spring 2016 topic of e...
Benjamin Bengfort is an experienced data scientist and software engineer who focuses on implementing data products that can learn from real-time streaming data. Benjamin is the program director of the Georgetown Data Science Certificate program where he also teaches Machine Learning. He is also ...
Because of COVID-19, many providers are cancelling or postponing in-person programs or providing online participation options.
We are happy to help you find a suitable online alternative.