Overview
Watch a live-coding conference talk from Conf42 JS 2024 that demonstrates advanced web scraping techniques and anti-ban strategies using Scrapoxy. Learn through a practical example following Isabella's data collection journey, starting with exploring Trekkie reviews and implementing solutions using the Scrapy framework. Master essential concepts in handling proxy systems and circumventing anti-bot measures, while diving into advanced implementations with Playwright. Discover effective methods for deobfuscating source code and implementing robust scraping solutions. The 33-minute presentation provides hands-on demonstrations and practical insights for developers looking to build resilient web scraping systems.
Syllabus
Introduction to Scrapoxy
Isabella's Story: The Need for Data
Exploring Trekkie Review
Introduction to Scrapy Framework
Handling Proxy and Anti-Bot Systems
Advanced Techniques with Playwright
Deobfuscating Source Code
Conclusion and Final Thoughts
Taught by
Conf42