Machine Learning for Biobank-Scale Genomic Data - CGSI 2022
Computational Genomics Summer Institute CGSI via YouTube
Overview
Syllabus
Intro
Machine learning for genomic data
Growth of Biobanks
Key inference problems
Genetic architecture of complex traits
Variance components model
Estimating variance components
Alternate estimator Method of Moments (HE-regression)
Randomized HE-regression (RHE) Work with a "sketch" of the genotype
RHE is accurate and scalable
Insights from applying RHE to UK Biobank
Dominance deviation effects
Dominance deviance effects
Gene-environment interactions (GxE)
Gene-gene interactions (GxG)
Beyond pair-wise effects
Random Fourier Features (RFF)
Missing data in Biobanks
Taught by
Computational Genomics Summer Institute CGSI