ABOUT THE COURSE:The proposed course aims to provide an understanding of the data analysis techniques and visualization tools for analysis of biological data sets. The course will use the statistical programming language R and introduce requisite packages for such analyses and visualization. The course will cover the theoretical understanding of the methods of analyses and will also provide hands-on demonstration on real biological datasets. The students will thus be able to gain exposure to data analysis and visualization tools which will be directly applicable to real-world datasets.INTENDED AUDIENCE: UG/PG/PhD studentsPREREQUISITES: Should be at least at the UG level withknowledge of probability and statisticsINDUSTRY SUPPORT: Companies working in the domain of biological data science and bioinformatics
Overview
Syllabus
Week 1:Introduction and set up for biological data analysis with RWeek 2:Basic statistical analysis and data visualization techniquesWeek 3:Bioconductor packagesWeek 4:Gene expression analysis and co-expression networkWeek 5:Analysis of ChIP-seq data in RWeek 6:Regression models on biological dataWeek 7:Dimensionality reduction techniquesWeek 8:Decision trees and Random Forest
Taught by
Prof. Riddhiman Dhar