Overview
Learn web scraping with Python and Beautiful Soup in this 34-minute tutorial. Parse web pages into data files using BeautifulSoup, demonstrated through scraping graphics cards from NewEgg.com. Set up Anaconda, install necessary packages, and follow step-by-step instructions to build a scraper, test it, and create a CSV file. Gain practical skills in retrieving web pages, evaluating HTML, converting listings to line items, and using functions like "findAll". Perfect for data professionals looking to harness the power of web scraping and expand their data collection capabilities.
Syllabus
- Introduction
- Setting up Anaconda
- Installing Beautiful Soup
- Setting up urllib
- Retrieving the Web Page
- Evaluating Web Page
- Converting Listings into Line Items
- Using jsbeautiful
- Reading Raw HTML for Items to Scrape
- Building the Scraper
- Using the "findAll" Function
- Testing the Scraper
- Creating the .csv File
- End Result
Taught by
Data Science Dojo