Skip to main content
Anysite CLI

What is Anysite CLI?

Anysite CLI is a powerful command-line toolkit that gives you full control over web data — from extraction to analysis to database loading. It supports 118+ endpoints across 60+ data sources including LinkedIn, Instagram, Twitter/X, Reddit, YouTube, GitHub, Amazon, SEC EDGAR, and more.

GitHub Repository

Source code, issues, and contributions — MIT licensed

Single API Calls

Make instant API requests with flexible output formats and field filtering

Batch Processing

Process thousands of inputs in parallel with rate limiting and error handling

Dataset Pipelines

Build declarative YAML workflows with chained dependencies and scheduling

Database Integration

Load data into SQLite, PostgreSQL, or ClickHouse with auto-schema and diff-sync

Database Discovery

Auto-discover database structure and browse saved catalogs

LLM Analysis

Classify, summarize, enrich, and deduplicate data using AI models

SQL Querying

Query collected datasets with DuckDB SQL and generate analytics

Agent Protocol

Structured JSON output, exit codes, and discovery for AI agents

Agent Examples

AI agents for automated data collection and executive search

Quick Start

Get up and running in 60 seconds:
1

Install the CLI

curl -fsSL https://raw.githubusercontent.com/anysiteio/anysite-cli/main/install.sh | bash
Or via pip: pip install anysite-cli — see Installation for all methods.
2

Configure your API key

Get your API key from app.anysite.io and configure it:
anysite config set api_key YOUR_API_KEY
3

Update the schema cache

anysite schema update
4

Make your first request

anysite api /api/linkedin/user user=satyanadella

Key Capabilities

FeatureDescription
118+ EndpointsAccess data from LinkedIn, Instagram, Twitter/X, Reddit, YouTube, GitHub, Amazon, and more
Flexible OutputJSON, JSONL, CSV, and table formats with field selection and exclusion
Batch ProcessingParallel execution with rate limiting and configurable error handling
Dataset PipelinesYAML-based multi-source workflows with dependency chains
Database LoadingAuto-schema creation, upsert, and diff-sync for SQLite, PostgreSQL, and ClickHouse
LLM AnalysisClassify, summarize, enrich, generate, match, and deduplicate records
Incremental CollectionResume from where you left off with cursor tracking
Database DiscoveryAuto-discover database structure, LLM-enriched catalogs
Agent ProtocolAuto-JSON in pipes, structured errors, exit codes, discovery payload
SchedulingCron-based automation with webhook notifications

How It Fits in the Anysite Ecosystem

REST API

Direct HTTP API for programmatic integration into your applications

MCP Server

AI agent integration via Model Context Protocol for Claude, Cursor, and other tools

n8n Nodes

Visual workflow automation with drag-and-drop nodes

Claude Code Skill

Want AI to build and run pipelines for you? Install the anysite-cli skill for Claude Code:
/plugin marketplace add https://github.com/anysiteio/agent-skills
/plugin install anysite-cli@anysite-skills
The skill gives Claude Code full knowledge of all anysite commands — describe what data you need in natural language and let the agent handle the rest. See Installation for details.
Anysite CLI is ideal when you need programmatic control, reproducible pipelines, or want to integrate data extraction into scripts and CI/CD workflows.