Scrapling Fetcher
Web scraping using Scrapling — a Python framework with anti-bot bypass (Cloudflare Turnstile, finger
- Rating
- 4.1 (175 reviews)
- Downloads
- 1,014 downloads
- Version
- 1.0.0
Overview
Web scraping using Scrapling — a Python framework with anti-bot bypass (Cloudflare Turnstile, fingerprint.
Complete Documentation
View Source →
Scrapling Skill
Source: https://github.com/D4Vinci/Scrapling (open source, MIT-like license)
PyPI: scrapling — install before first use (see below)
⚠️ Only scrape sites you have permission to access. Respect robots.txt and Terms of Service. Do not use stealth modes to bypass paywalls or access restricted content without authorization.Installation (one-time, confirm with user before running)
pip install scrapling[all]
patchright install chromium # required for stealth/dynamic modes
scrapling[all]installspatchright(a stealth fork of Playwright, bundled as a PyPI package — not a typo),curl_cffi, MCP server deps, and IPython shell.patchright install chromiumdownloads Chromium (~100 MB) via patchright's own installer (same mechanism asplaywright install chromium).- Confirm with user before running — installs ~200 MB of dependencies and browser binaries.
Script
scripts/scrape.py — CLI wrapper for all three fetcher modes.
# Basic fetch (text output)
python3 ~/skills/scrapling/scripts/scrape.py <url> -q
# CSS selector extraction
python3 ~/skills/scrapling/scripts/scrape.py <url> --selector ".class" -q
# Stealth mode (Cloudflare bypass) — only on sites you're authorized to access
python3 ~/skills/scrapling/scripts/scrape.py <url> --mode stealth -q
# JSON output
python3 ~/skills/scrapling/scripts/scrape.py <url> --selector "h2" --json -q
Fetcher Modes
- http (default) — Fast HTTP with browser TLS fingerprint spoofing. Most sites.
- stealth — Headless Chrome with anti-detect. For Cloudflare/anti-bot.
- dynamic — Full Playwright browser. For heavy JS SPAs.
When to Use Each Mode
web_fetchreturns 403/429/Cloudflare challenge → use--mode stealth- Page content requires JS execution → use
--mode dynamic - Regular site, just need text/data → use
--mode http(default)
Python Inline Usage
For custom logic beyond the CLI, write inline Python. See references/patterns.md for:
- Adaptive scraping (
auto_save/adaptive— saves element fingerprints locally) - Session/cookie handling
- Async usage
- XPath, find_similar, attribute extraction
Notes
- MCP server (
scrapling mcp): starts a local network service for AI-native scraping. Only start if explicitly needed and trusted — it exposes a local HTTP server. auto_save=True: persists element fingerprints to disk for adaptive re-scraping. Creates local state in working directory.- Stealth/dynamic modes use Chromium headless — no
xvfb-runneeded. - For large-scale crawls, use the Spider API (see Scrapling docs).
Installation
openclaw install scrapling-fetcher
💻Code Examples
patchright install chromium # required for stealth/dynamic modes
- `scrapling[all]` installs `patchright` (a stealth fork of Playwright, bundled as a PyPI package — not a typo), `curl_cffi`, MCP server deps, and IPython shell.
- `patchright install chromium` downloads Chromium (~100 MB) via patchright's own installer (same mechanism as `playwright install chromium`).
- Confirm with user before running — installs ~200 MB of dependencies and browser binaries.
## Script
`scripts/scrape.py` — CLI wrapper for all three fetcher modes.# Basic fetch (text output)
python3 ~/skills/scrapling/scripts/scrape.py <url> -q
# CSS selector extraction
python3 ~/skills/scrapling/scripts/scrape.py <url> --selector ".class" -q
# Stealth mode (Cloudflare bypass) — only on sites you're authorized to access
python3 ~/skills/scrapling/scripts/scrape.py <url> --mode stealth -q
# JSON output
python3 ~/skills/scrapling/scripts/scrape.py <url> --selector "h2" --json -qTags
Quick Info
Ready to Install?
Get started with this skill in seconds
Related Skills
4claw
4claw — a moderated imageboard for AI agents.
Aap Passport
Agent Attestation Protocol - The Reverse Turing Test.
Acestep Lyrics Transcription
Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API.
Adaptive Suite
A continuously adaptive skill suite that empowers Clawdbot.