Project information
- Category: Web Scraper
- Target: Scrap GetYourGuide products
- Project URL: Github repository
Objective
To get the data! Having the data is the first part of everything behind including analysis/dashboard/automated alerts/actions etc.
Methodology
Scrapy, Selenium, Puppet, Requests, XPath, Pandas, Distributed programming.
Insight
Well. That's just a crawler I replicate the one I built for my previous company, and of course the most critical optimization part cannot be posted on Github :)