Speaker: Jon Reiter
There are myriad web scraping tools available in Python spanning a broad range of use cases. At the same time there are many surprising gaps in coverage. Further complicating matters, differences which look innocuous in a browser can have an outsized impact on the design of an automated browsing system. In this talk we survey a collection of common web scraping frameworks and work out a mapping from real-world use cases to packages. Along the way we address common questions like:
How do I choose among content parsers? What if a page is dominated by JavaScript or HTML5? If I'm going to control a browser which one should I choose? Can I run this in the cloud with no access to a display? Can I download files?
About Jon:
Jon worked in finance as a derivative trader for a long time before moving back to technology. He is now heading a start-up which aims to re-cast and modernize the way the financial services industry processes market data.
Event Page: https://www.meetup.com/Singapore-Python-User-Group/events/249344900/
Produced by Engineers.SG
Help us caption & translate this video!