Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

What Can We Learn from 750 Billion GitHub Events and 42 TB of Code

Devoxx via YouTube

Overview

Explore the vast world of GitHub data in this 42-minute Devoxx conference talk. Dive into an analysis of 750 billion GitHub events and 42 TB of code to gain valuable insights into software development trends, open-source community dynamics, and effective project management strategies. Learn how to leverage this massive dataset to guide project design decisions, measure community health, and understand coding patterns over time. Discover techniques for running static code analysis at scale, evaluating the impact of social media on project popularity, and identifying the most effective ways to request changes. Gain a deeper understanding of your project's audience by examining who starred it and their other interests. Through live on-stage analysis, uncover fascinating insights about coding preferences, engagement patterns, and geographical distribution of contributions. Whether you're a developer, project manager, or data enthusiast, this talk offers a unique perspective on the collaborative nature of software development and the power of big data analysis in the open-source ecosystem.

Syllabus

Intro
Who wants to analyze GitHub
GitHub Stars
Not all projects are equal
What else are they interested in
Thank you and stars
Engagement
Text analysis
Size and countries
Top projects by country
Looking at code
Prototool
Java
Requesting features
Code analysis numerically
Conclusion

Taught by

Devoxx

Reviews

Start your review of What Can We Learn from 750 Billion GitHub Events and 42 TB of Code

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.