Search results12 projects

Repomix
17.7k
Pack an entire repository into a single, AI-friendly file. Perfect for when...
Extract the main content from web pages.
AI-powered query language for web scraping and automation. It uses natural ...
Download website to local directory (including all css, images, js, etc.)
Extract main article, main image and meta data from URL
Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitt...
A super simple site crawler and broken link checker
Extract the Readable Content from an HTML Document
A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Mar...
Crawlee
18.5k
The scalable web scraping and crawling library for JavaScript/Node.js. Enab...
A Node.js scraper for humans.
Cheerio
29.6k
The fast, flexible, and elegant library for parsing and manipulating HTML a...