Overview
Syllabus
- Intro & Overview
- Identifying HTML Structure for Scraping from Walmart
- Writing Python BeautifulSoup Code to Extract Info from Walmart.com
- Implementing modified request headers to avoid detection
- Handling Dynamic Content
- Implementing Modified Request Headers to Avoid Detection look more human when scraping
- Parsing Complicated JSON Data Using LLMs to help
- Extending our Code to Collect Info on Many Products Automating Search
- Improving our Code avoiding duplicates, multiple search terms, using a queue, etc.
- Setting Up Proxies with Bright Data Get around IP Address blocks
- Error Handling and Retries
- Automating actions on pages with Selenium
- Conclusion & Next Steps
Taught by
Keith Galli