Cheerio 27.9k The fast, flexible, and elegant library for parsing and manipulating HTML a... HTML & text parsing Scraping	Pushed 2 days ago 135 contributors Created 13 years ago	27.9k
website-scraper 1.52k Download website to local directory (including all css, images, js, etc.) Scraping	Pushed 20 days ago 16 contributors Created 10 years ago	1.52k
Readibility 8.18k Extract the Readable Content from an HTML Document Scraping Accessibility	Pushed 2 days ago 74 contributors Created 9 years ago	8.18k
X-ray 5.84k The next web scraper. See through the <html> noise. Scraping	Pushed 4 years ago 41 contributors Created 9 years ago	5.84k
Article Extractor 1.41k Extract main article, main image and meta data from URL Scraping	Pushed 12 days ago 16 contributors Created 8 years ago	1.41k
scrape-it 3.98k A Node.js scraper for humans. Scraping	Pushed a month ago 19 contributors Created 8 years ago	3.98k
Metascraper 2.24k Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitt... Scraping	Pushed 11 days ago 36 contributors Created 8 years ago	2.24k
Postlight Parser 5.3k Extract meaningful content from the chaos of a web page HTML & text parsing Scraping	Pushed a year ago 57 contributors Created 8 years ago	5.3k
Crawlee 12.4k The scalable web scraping and crawling library for JavaScript/Node.js. Enab... Scraping	Pushed a day ago 89 contributors Created 8 years ago	12.4k
Unfurl 468 Metadata scraper with support for oEmbed, Twitter Cards and Open Graph Prot... Scraping	Pushed 3 months ago 22 contributors Created 7 years ago	468
Percollate 4.14k A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Mar... Scraping PDF	Pushed 13 days ago 19 contributors Created 6 years ago	4.14k
linkinator 986 A super simple site crawler and broken link checker Scraping	Pushed 3 days ago 24 contributors Created 5 years ago	986

Projects

Scraping•12 projects