Data Science along with artificial intelligence (AI) and its various components such as statistical learning (SL), machine learning (ML) and deep learning algorithms (DL) are recognized as main drivers of organizational value creation. According to Dr Jim Gray, Data Science is the fourth paradigm which drives innovative solutions to organizational problems.
In this course we will start with basic concepts in probability such as joint and conditional probabilities. We will discuss the implementation of these concepts in ML algorithms for Market Basket Analysis and Recommender Systems. After covering basic probability concepts, we move on to random variables, discrete and continuous probability distributions, sampling, estimation and central limit theorem.
An important step in ML model building is feature selection to avoid overfitting and underfitting. ML models such as regression and logistic regression use hypothesis testing to select features. We will discuss various hypothesis tests and how they are used in feature selection.
Every ML model has an optimization stage, either to fine-tune the feature weights, or to find an optimal set of features. We will discuss important optimization techniques, and algorithms such as Gradient Descent, that play an important role in AI and ML model development.
Data must be represented in a matrix for AI and ML model development. Matrix operations such as matrix inverse and multiplication are elementary steps in model development. These fundamental concepts in linear algebra will be discussed.
This course is suitable for students/practitioners interested in improving their knowledge in the fundamental concepts of Data Science. The course will also prepare the learner for a career in the field of Data Analytics.