Overview
Syllabus
Multi-objective Bayesian Optimization with Heuristic Objectives for Biomedical and Molecular Data Analysis Workflows
Overview
Bayesian optimization in bioinformatics has been applied to protein design
Most bioinformatic analyses are unsupervised
A typical workflow involves many steps and parameters
Difficulty with defining objectives make AutoML challenging to apply in bioinformatics
AutoML approaches construct objectives for a given problem
Motivation for AutoGeneS constructed objectives
Our method automatically infers which objectives are useful to guide optimization
MOBO basics
We build on the random scalarizations approach that returns a subset of the Pareto front
We determine the region of the Pareto front using objective behaviours
Three examples of desirable behaviours
Toy data simulating useful and not useful objectives
Optimizing cofactor in the analysis of Imaging Mass Cytometry (IMC) data
We construct objectives for clustering workflow using pairs of co-expressed proteins
We construct two meta-objectives using expert annotations
Parameters selected by our method led to clusterings that agree with expert annotations
Quantitative evaluation of performance
Summary
Taught by
AutoML Seminars