Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the vast world of GitHub data through a comprehensive analysis of 750 billion events and 42 TB of code. Dive into insights on software development trends, open source community dynamics, and coding patterns over time. Learn how to leverage this rich dataset to guide project design decisions, request features based on data, and measure community health. Discover the most effective ways to phrase change requests and understand the impact of social media on project popularity. Investigate who starred your project and their other interests. Gain practical knowledge on running static code analysis at scale and settle the age-old debate of tabs vs. spaces. Presented by Felipe Hoffa, a Google Developer Advocate, this talk offers a deep dive into the world of big data analysis using Google Cloud Platform tools, demonstrating how to extract valuable insights from one of the largest datasets of collaborative software development.
Syllabus
Intro
What do we see
Who wants to analyze GitHub
How GitHub events started
Google BigQuery
Comparing projects
Looking for stars
Looking at other projects
Text analysis
Country analysis
New Zealand
Weather
Code analysis
Stack Overflow
Go Query
Static Code Analysis
Questions
Query Analysis
Taught by
Devoxx