
What is Anysite CLI?
Anysite CLI is a powerful command-line toolkit that gives you full control over web data — from extraction to analysis to database loading. It supports 118+ endpoints across 60+ data sources including LinkedIn, Instagram, Twitter/X, Reddit, YouTube, GitHub, Amazon, SEC EDGAR, and more.GitHub Repository
Source code, issues, and contributions — MIT licensed
Single API Calls
Make instant API requests with flexible output formats and field filtering
Batch Processing
Process thousands of inputs in parallel with rate limiting and error handling
Dataset Pipelines
Build declarative YAML workflows with chained dependencies and scheduling
Database Integration
Load data into SQLite, PostgreSQL, or ClickHouse with auto-schema and diff-sync
Database Discovery
Auto-discover database structure and browse saved catalogs
LLM Analysis
Classify, summarize, enrich, and deduplicate data using AI models
SQL Querying
Query collected datasets with DuckDB SQL and generate analytics
Agent Protocol
Structured JSON output, exit codes, and discovery for AI agents
Agent Examples
AI agents for automated data collection and executive search
Quick Start
Get up and running in 60 seconds:Install the CLI
pip install anysite-cli — see Installation for all methods.Configure your API key
Get your API key from app.anysite.io and configure it:
Key Capabilities
| Feature | Description |
|---|---|
| 118+ Endpoints | Access data from LinkedIn, Instagram, Twitter/X, Reddit, YouTube, GitHub, Amazon, and more |
| Flexible Output | JSON, JSONL, CSV, and table formats with field selection and exclusion |
| Batch Processing | Parallel execution with rate limiting and configurable error handling |
| Dataset Pipelines | YAML-based multi-source workflows with dependency chains |
| Database Loading | Auto-schema creation, upsert, and diff-sync for SQLite, PostgreSQL, and ClickHouse |
| LLM Analysis | Classify, summarize, enrich, generate, match, and deduplicate records |
| Incremental Collection | Resume from where you left off with cursor tracking |
| Database Discovery | Auto-discover database structure, LLM-enriched catalogs |
| Agent Protocol | Auto-JSON in pipes, structured errors, exit codes, discovery payload |
| Scheduling | Cron-based automation with webhook notifications |
How It Fits in the Anysite Ecosystem
REST API
Direct HTTP API for programmatic integration into your applications
MCP Server
AI agent integration via Model Context Protocol for Claude, Cursor, and other tools
n8n Nodes
Visual workflow automation with drag-and-drop nodes