Discover the latest advancements in scikit-learn 0.21 through this 45-minute EuroPython Conference talk. Explore the new implementation of Gradient Boosted Trees, focusing on the histogram-based approach for evaluating tree node split candidates. Learn how this update improves computational performance, making it competitive with specialized libraries like XGBoost and LightGBM. Gain insights into the efficient use of multi-core CPUs and compare the numba-based prototype (pygbm) with the final cython implementation. Understand the developer experience differences between numba and cython in the context of machine learning model development.
Overview
Syllabus
Olivier Grisel - Histogram-based Gradient Boosting in scikit-learn 0.21
Taught by
EuroPython Conference