Home Projects Tags Monthly

...

Projects

Loading projects...

Direct links

Projects Tags Monthly Rankings JavaScript Hall of Fame About

Related projects

Rising Stars State of JS

Best of JS is a project by Michael Rambeau, made in Osaka, Japan.

Powered by

Scraping

11 active projects

HTML & text parsing AI Accessibility Browser automation

30.4k

Fast, flexible, and lean implementation of core jQuery designed specifically for the server.

HTML & text parsing

Pushed 4 days ago

170 contributors

Created 15 years ago

30.4k

27.6k

Pack an entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

Pushed a day ago

81 contributors

Created 2 years ago

27.6k

25.1k

The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

Pushed 2 days ago

138 contributors

Created 10 years ago

25.1k

11.4k

Extract the Readable Content from an HTML Document

Pushed 25 days ago

99 contributors

Created 11 years ago

11.4k

8.71k

Extract the main content from web pages.

HTML & text parsing

Pushed 7 days ago

37 contributors

Created a year ago

8.71k

4.66k

A command-line tool to grab web pages as beautifully formatted PDFs

Pushed a year ago

18 contributors

Created 8 years ago

4.66k

2.72k

A library to easily scrape metadata from an article on the web using Open Graph, JSON+LD, regular HTML metadata, and series of fallbacks.

Pushed 4 days ago

42 contributors

Created 10 years ago

2.72k

Article Extractor

1.9k

Extract main article, main image and meta data from URL

Pushed 3 months ago

16 contributors

Created 11 years ago

1.9k

website-scraper

1.74k

Download website to local directory (including all css, images, js, etc.)

Pushed 13 days ago

20 contributors

Created 12 years ago

1.74k

1.43k

AI-powered query language for web scraping and automation. It uses natural language selectors to find data on any page, including authenticated content. AgentQL queries are self-healing as UI changes and work across similar sites. Users can define structured data output, making AgentQL versatile for developers and data scientists.

Browser automation

HTML & text parsing

Pushed 4 days ago

19 contributors

Created 2 years ago

1.43k

1.25k

A super simple site crawler and broken link checker

Pushed 4 days ago

30 contributors

Created 7 years ago

1.25k

Best of JS • Scraping projects