Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Building and Operating an Open Source Data Science Platform

Toronto Machine Learning Series (TMLS) via YouTube

Overview

Explore the intricacies of building and operating an open-source data science platform in this comprehensive workshop led by Jörg Schad, Head of Machine Learning at ArangoDB. Delve into the entire deep learning pipeline, from exploratory analysis to model deployment and monitoring. Learn how to enable data scientists to develop models exploratively, automate distributed training and serving using CI/CD, deploy frameworks on various infrastructures, manage multiple deep learning frameworks on a single cluster, store and serve models at scale, track essential metadata, and monitor pipeline performance. Gain hands-on experience constructing an end-to-end data analytics pipeline, incorporating tools such as TFX, Kubeflow, Airflow, Apache Spark, Jupyter Notebooks, TensorFlow, Jenkins, Argo, and more. Acquire valuable insights into pipeline orchestration, data preparation, distributed training, automation, model storage, serving, and monitoring throughout this intensive 2-hour and 57-minute session.

Syllabus

Jörg Schad - Workshop: Building and Operating an Open Source Data Science Platform

Taught by

Toronto Machine Learning Series (TMLS)

Reviews

Start your review of Building and Operating an Open Source Data Science Platform

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.