Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore repository data mining techniques on GitHub in this conference talk from WeAreDevelopers Conference 2017. Dive into machine learning applications at GitHub, including text classification and convolutional networks. Learn about data preprocessing, distributional hypothesis, and the Stanza flow. Discover how GitHub leverages these technologies for improved collaboration and project management. Gain insights into the competition overview and architecture used for mining repository data. Understand the importance of GitHub in modern software development and how machine learning enhances its capabilities.
Syllabus
Introduction
GitHub
Why use GitHub
Machine Learning at GitHub
Airbnb
Topics
Competition
Overview
Data Preprocessing
Similar Words
Distributional Hypothesis
Machine Learning
StanzaFlow
Convolutional Networks
Classification
Text
Text Classification
Categories
Architecture
Conclusion
Collaboration with GitHub
Taught by
WeAreDevelopers