$ timeahead.in
/ servers/pypi/scrapling
pypi

scrapling

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

61k stars187k/wkupdated 1d agogithub ↗
94excellent
▣ Overview

What it does

Scrapling is an adaptive web scraping framework that scales from single HTTP requests to full-site crawls. It provides multiple fetchers—including StealthyFetcher and DynamicFetcher—that bypass anti-bot systems like Cloudflare Turnstile. The selection API includes adaptive learning: when a website's structure changes, adding adaptive=True to your selectors relocates elements automatically rather than breaking. For scaling up, the Spider framework enables concurrent crawls with built-in pause/resume, proxy rotation, and real-time statistics.

Who it's for

Backend engineers and data engineers building data pipelines, researchers gathering datasets from public web sources, and operators maintaining scrapers that need to adapt to frequent website redesigns. Teams comfortable with Python and needing sub-request-level control over fetch behavior and retry logic will find the most value.

Common use cases

  • Extract product listings, pricing, or reviews from e-commerce sites despite anti-bot protection
  • Monitor websites for content changes by re-parsing with adaptive selectors after design updates
  • Build multi-session crawlers for large sites with automatic proxy rotation and pause/resume
  • Gather training data or datasets from public sources at scale with concurrent workers
  • Fetch and parse dynamic (JavaScript-rendered) pages using the DynamicFetcher

Setup pitfalls

  • Requires network access to target sites and to proxy services if rotation is enabled; validate credentials upfront
  • Filesystem write access needed for internal caching and adaptive model state; ensure working directory is writable
  • Anti-bot systems may still block requests if issued too rapidly; respect robots.txt and implement delays between requests
  • Fetchers that render JavaScript (e.g., DynamicFetcher) may require additional browser dependencies; check documentation for your target fetcher
▣ Score BreakdownMCPScore = Σ(raw × weight)
DimensionRawWeighted
Security
35%
100
35.0
Freshness
25%
100
25.0
Adoption
20%
100
20.0
Quality
10%
90
9.0
Trust
10%
50
5.0
Total
94.0
⚿ Capabilities & Risk Explainer
fs readfs writenetworksecrets
◆ Risk level: medium
fs read + fs write + network + secrets — requires access to credentials or environment secrets.
⚙ Install config
Claude Desktop · Cursor · Windsurf · VS Code (Copilot) · Claude Code
add to your MCP client config:
{
  "mcpServers": {
    "scrapling-1": {
      "command": "uvx",
      "args": [
        "scrapling"
      ]
    }
  }
}
📈 Score historylast 27 snapshots
5/10/20266/6/2026 · 27 snapshots
⚙ Maintenance health
59/ 100 · is this project alive?
contributors (1y)17
top contributor share95%
releases (1y)24
last release26d ago
ci✓ passing
⛁ Raw data
weekly downloads187k
github stars61k
forks6k
open issues25
license✓ present
readme length31949 chars
last publish0d ago
last commit1d ago
last updated6h ago
install verified✓ pass · 20d ago
owner of this server? claim your listing to get a verified badgeclaim →
🔔 Score drop alerts
get notified by email when this server's score drops 5+ points