Claudius
Status

webclaw

MCP

Rust-based MCP server for local-first web content extraction with support for scraping, crawling, and structured data extraction for LLMs.

by 0xMassi·0xMassi/webclaw·Rust·v0.6.9
86· A
Install
git clone https://github.com/0xMassi/webclaw
Stars
1,353
7d change
Downloads / week
Last active
today
About

Turn websites into clean markdown, JSON, and LLM-ready context. CLI, MCP server, REST API, and SDKs for AI agents and RAG pipelines.

Most web scraping tools give your agent one of two bad outputs:

a blocked page, login wall, or empty app shell raw HTML full of nav, scripts, styling, ads, and duplicated boilerplate

webclaw.io is the hosted web extraction API for webclaw. This repo contains the open-source CLI, MCP server, extraction engine, and self-hostable server.

Read more on GitHub →
30-day stars
Trust factors
Source
community
Known advisories
0
Maintenance
active
License
AGPL-3.0
Age
3 months
web-scrapingdeveloper-tools#rust#scraping#crawling#data-extraction#html-to-markdown#self-hosted#ai#ai-agents#ai-scraping#cli#crawler#firecrawl-alternative#llm#markdown#mcp#rag#tls-fingerprinting#web-crawler#web-extraction#web-scraper