Settings

Theme

Build a Trial Court Records Scraper Using Ruby

joelc.io

4 points by Heavywater 3 years ago · 2 comments

Reader

HeavywaterOP 3 years ago

How it works. We will use the Ruby programming language and a few open source software tools (i.e., Nokogiri, Watir, Selenium, and ChromeDriver) to deploy a hidden ("headless") browser to the OECI case index

mdaniel 3 years ago

I will never in my life understand why people go through all the trouble of booting up a headless browser, only then to slurp the HTML back across the WebDriver interface so they can _re-parse_ it using some rando library. Not only is that inefficient, it almost guarantees questions on r/webscraping or SO about "but I see some element in the browser, why is $random_library not parsing it the same as the browser?!11"

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection