In this course students will focus on the data science pipeline including problem formulation, data cleaning and preprocessing, exploration of data with visualization, model prediction and inference for decision making. Students will use different software tools and programming for each step of the data science pipeline, include data exploration and transformation, algorithms for machine learning concepts such as classification, regression, and clustering. In addition, students will learn how to effectively present any findings to an audience.