Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the insights gained from analyzing 1.1 billion GitHub events and 42 TB of code in this 42-minute conference talk. Discover how to leverage Google BigQuery to examine five years of GitHub metadata and open source code. Learn to understand community dynamics and code patterns for any programming language or project. Gain valuable knowledge for open source creators, users, and decision-makers to make informed choices. Delve into topics such as top contributing companies, star metrics, project health indicators, geographical contributions, and the impact of platforms like Hacker News. Investigate code import patterns, Stack Overflow's influence, and user-defined functions. Uncover answers to important questions about successful projects and programming language preferences. Stay curious and learn how to harness this vast dataset to enhance your understanding of the open source ecosystem.
Syllabus
Intro
What do you see
Who wants to analyze GitHub
Top companies contributing
The basics
BigQuery
Stars
Not all stars are equal
Can we trust the stars
How does hacker news affect projects
What projects are more healthy
Super healthy community
Top countries
Color flow
Europe
Code
Import
Stack Overflow
How Many People Copy Stack Overflow
Requesting New Features
UserDefined Functions
Favorite Programming Language
Important Questions
What Projects Are More Successful
Stay Curious
Questions
Taught by
WeAreDevelopers