WebScraper application icon

v1.4.6

watch video

WebScraper

Scrape data or archive content from a website.

Screenshots

  • WebScraper Screenshot 1
  • WebScraper Screenshot 2
  • WebScraper Screenshot 4
  • WebScraper Screenshot 3

WebScraper uses the Integrity v6 Engine to quickly scan a website, and can output the data (currently) as csv or json. The output can include various meta data, the entire content of each page (as text, html or markdown), extract data using a Regex pattern, and/or divs, spans, paras or dd's extracted by class or id.

Webscraper is new. Please use it for free and please get in touch with any requests, bug reports or observations.

System Requirements

Current version requires Mac OSX 10.8 or higher

 


What should I do with the downloaded file?

Open the .dmg file and find the application inside. If you want to keep using WebScraper, drag and drop it into your Applications folder. To keep it in your dock, right-click or click-and-hold on its dock icon and choose 'Keep in dock'.


Contributors

Developer: Shiela Dixon


Version History

Version 1.4.6 released Mar 2017

Version 1.4.4 / 1.4.5 released Mar 2017

Version 1.4.3 released Mar 2017

Version 1.4.2 released Feb 2017

Version 1.4.1 released Jan 2017

Version 1.4 released Jan 2017

  • Improvement to class helper window, handles clicks in the web preview pane, user can click to deeper pages (and back up) and displayed url and class list is updated accordingly
  • Improvement to handle properly the situation where user's classes or ids having the same name as one of the core fields like 'headings', 'description', 'title' etc.
  • Version 1.3 released Jan 2017

    Version 1.2 (no longer beta) released Jan 2017

    Version 1.1 (still beta) released Dec 2016

    Version 1.0.3 released Nov 2016

    Version 1.0.2 released Oct 2016

    Version 1.0.1 released June 2016

    Version 1.0. released May 2016