Jump to related tools in the same category or review the original source on GitHub.

Browser & Automation @opsun Updated 2/26/2026

Deep Scraper OpenClaw Skill - ClawHub

Do you want your AI agent to automate Deep Scraper workflows? This free skill from ClawHub helps with browser & automation tasks without building custom tools from scratch.

What this skill does

A high-performance engineering tool for deep web scraping.

Install

npx clawhub@latest install deep-scraper

Full SKILL.md

Open original

Skill: deep-scraper

Overview

A high-performance engineering tool for deep web scraping. It uses a containerized Docker + Crawlee (Playwright) environment to penetrate protections on complex websites like YouTube and X/Twitter, providing "interception-level" raw data.

Requirements

  1. Docker: Must be installed and running on the host machine.
  2. Image: Build the environment with the tag clawd-crawlee.
    • Build command: docker build -t clawd-crawlee skills/deep-scraper/

Integration Guide

Simply copy the skills/deep-scraper directory into your skills/ folder. Ensure the Dockerfile remains within the skill directory for self-contained deployment.

Standard Interface (CLI)

docker run -t --rm -v $(pwd)/skills/deep-scraper/assets:/usr/src/app/assets clawd-crawlee node assets/main_handler.js [TARGET_URL]

Output Specification (JSON)

The scraping results are printed to stdout as a JSON string:

  • status: SUCCESS | PARTIAL | ERROR
  • type: TRANSCRIPT | DESCRIPTION | GENERIC
  • videoId: (For YouTube) The validated Video ID.
  • data: The core text content or transcript.

Core Rules

  1. ID Validation: All YouTube tasks MUST verify the Video ID to prevent cache contamination.
  2. Privacy: Strictly forbidden from scraping password-protected or non-public personal information.
  3. Alpha-Focused: Automatically strips ads and noise, delivering pure data optimized for LLM processing.
Original URL: https://github.com/openclaw/skills/blob/main/skills/opsun/deep-scraper

Related skills

If this matches your use case, these are close alternatives in the same category.