Overview
Syllabus
Intro
A story we all know: Regular expressions
When's the last time you heard...?
Problem Statement: HTTP Proxy Logs
Machine Assisted Analysis
Two different types of machine learning
Supervised: Binary Classification
Classification With Random Forests
Generating synthetic abnormal data
Decision Trees
Unsupervised: Outlier Detection
Isolation Forests Liu, Ting, Zhao
A quick note about parameters
Classification With Isolation Forests
The beauty of scikit leam & python
Identifying Training & Test Data
Training, Testing & Evaluating a Model
Bonus: Most influential Features with
Analyzing Log Files
Bonus: Classifier Explanations with
Ideas for improvement
Taught by
Security Onion