![Cheerio](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F7230330%3Fv%3D3%26s%3D96&w=150&q=75) | The fast, flexible, and elegant library for parsing and manipulating HTML a... | Pushed 2 days ago 142 contributors Created 13 years ago | 29.1k |
![Crawlee](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F24586296%3Fv%3D3%26s%3D96&w=150&q=75) | The scalable web scraping and crawling library for JavaScript/Node.js. Enab... | Pushed 2 days ago 101 contributors Created 8 years ago | 16.8k |
![Readibility](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F131524%3Fv%3D3%26s%3D96&w=150&q=75) | Extract the Readable Content from an HTML Document | Pushed a month ago 82 contributors Created 10 years ago | 9.42k |
![Postlight Parser](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F12798053%3Fv%3D3%26s%3D96&w=150&q=75) | Extract meaningful content from the chaos of a web page | Pushed 2 years ago 57 contributors Created 8 years ago | 5.54k |
![Percollate](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F205375%3Fv%3D3%26s%3D96&w=150&q=75) | A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Mar... | Pushed 2 months ago 21 contributors Created 6 years ago | 4.37k |
![scrape-it](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F2864371%3Fv%3D3%26s%3D96&w=150&q=75) | A Node.js scraper for humans. | Pushed 7 days ago 20 contributors Created 9 years ago | 4.04k |
![Metascraper](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F29799436%3Fv%3D3%26s%3D96&w=150&q=75) | Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitt... | Pushed 18 days ago 40 contributors Created 9 years ago | 2.4k |
![Article Extractor](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F119390214%3Fv%3D3%26s%3D96&w=150&q=75) | Extract main article, main image and meta data from URL | Pushed 12 days ago 16 contributors Created 9 years ago | 1.65k |
![website-scraper](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F31221461%3Fv%3D3%26s%3D96&w=150&q=75) | Download website to local directory (including all css, images, js, etc.) | Pushed 18 days ago 16 contributors Created 10 years ago | 1.58k |
![linkinator](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F534619%3Fv%3D3%26s%3D96&w=150&q=75) | A super simple site crawler and broken link checker | Pushed 3 months ago 26 contributors Created 6 years ago | 1.05k |
![AgentQL](/_next/image?url=https%3A%2F%2Favatars.githubusercontent.com%2Fu%2F119996419%3Fv%3D3%26s%3D96&w=150&q=75) | AI-powered query language for web scraping and automation. It uses natural ... | Pushed 2 days ago 20 contributors Created a year ago | 525 |