Library for extracting streaming site data without official APIs
The complete web scraping toolkit for PHP
A scalable web crawler framework for Java
Open source enterprise search server for websites, files, and data
Java library for working with real-world HTML
Collection of 100+ Python web scraping projects and crawler examples
Distributed web crawler admin platform for spiders management
Distributed Crawler Management Framework Based on Scrapy
The next generation web scraping framework
ACHE is a web crawler for domain-specific search
A service daemon to run Scrapy spiders
Simple Python framework for building multithreaded web crawlers
Educational Python web scraping case collection for many sites
Async Python framework for fast and flexible web scraping spiders
An Android rich text class library that supports graphic & text mixing
Collection of Python ecommerce and website crawler examples projects
Ever wanted to download only a part of a Git repository.
Open source web crawler for Java
Lightweight Java web crawler framework with jQuery-style extraction
DSTK - DataScience ToolKit for All of Us
Android app for saving webpages for offline reading