Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.anysite.io/llms.txt

Use this file to discover all available pages before exploring further.

Anysite turns the public web into structured data. Instead of writing scrapers and fighting anti-bot systems, you call a single API and get clean JSON back — whether you need a LinkedIn profile, a company’s official filings, a product’s reviews, or the comments under a YouTube video. This page is a plain-language tour of what you can actually do with each source. For exact parameters and response schemas, every section links to the full API Reference.

39 Data Sources

Social, professional, financial, commercial, and official-registry data

180+ Endpoints

Ready-to-use methods — no scraping, no proxies, no maintenance

One Integration

REST API, MCP Server, CLI, or n8n — same data everywhere

Professional & Social Networks

The core of Anysite. Reach people, companies, content, and engagement across every major platform.

LinkedIn

Our deepest source — 35 endpoints covering people, companies, and content. This is the backbone of sales prospecting, recruiting, and B2B research. People. Pull a full profile (experience, education, skills, certifications, honors, patents, languages) from a URL or name. Go further with a person’s posts, comments, reactions, and the people who endorsed their skills. Companies. Get a company profile, headcount and employee statistics, the actual list of employees, and everything the company has posted. Search. Find people, companies, jobs, and posts with rich filters — plus helper lookups for industries, locations, and schools to build precise queries. Sales Navigator-grade people search is included. Email finder. Discover and verify a person’s work email from their profile or LinkedIn URL — ideal for outbound enrichment.

LinkedIn endpoints

Profiles, companies, posts, search, Sales Navigator, and email finder

Instagram

8 endpoints for creator and brand intelligence. Get a user’s profile, their posts and reels, and their followers/following. Drill into any single post to read its comments and see who liked it. Search posts by keyword or hashtag to track campaigns and trends.

Twitter / X

4 endpoints for real-time social listening. Pull a user’s profile and full timeline, and search both posts and accounts by keyword — perfect for monitoring conversations, sentiment, and competitors.

Reddit

11 endpoints for community and sentiment research. Read posts and their comment trees, profile any user (their posts and comments), and explore subreddits — including popular ones and a subreddit’s full feed. Search across posts, subreddits, and users to find exactly the discussions you care about.

TikTok

5 endpoints for short-video intelligence. Get user profiles and their videos, pull any individual video with its comments, and search videos by keyword to spot trends and viral content.

YouTube

14 endpoints — the most complete video source. Explore a channel’s videos, shorts, live streams, playlists, and community posts. Open any video for its details, comments, and full subtitles/transcript (great for feeding into LLMs). Search channels, playlists, and videos directly.

Facebook

5 endpoints focused on the Ad Library — full ad transparency. Look up advertisers, browse the ads they’re running, and search ads or advertisers by keyword. The fastest way to monitor competitor advertising and creative strategy.

Startups, Companies & Official Registries

From early-stage startups to public-company filings — verified, structured company data.

Y Combinator

3 endpoints for startup intelligence. Get a YC company profile, and search both companies and founders across every batch — useful for VC sourcing, competitive mapping, and recruiting from top startups.

Crunchbase

3 endpoints for funding and investor data. Pull a company profile or search the Crunchbase database to understand rounds, investors, and growth signals.

Product Hunt

23 endpoints — the entire Product Hunt ecosystem. Track products (details, search, reviews, alternatives, customers, launch history), daily launches and their comments, leaderboards, collections, topics, categories, and discussion forums. Profile makers and users down to their launches, upvotes, collections, comments, and tech stacks. A goldmine for product research and trend-spotting.

Companies House (UK)

8 endpoints straight from the UK’s official company register. Look up a company’s profile, filing history, charges, officers, and persons with significant control (beneficial owners). Search companies and officers by name. Authoritative, court-grade data.

OpenCorporates

2 endpoints for global corporate lookup. Find a company profile or search across the world’s largest open database of company records.

SEC (US)

2 endpoints for US public-company research. Search companies and retrieve their SEC filing documents — 10-Ks, 8-Ks, and more — for financial and due-diligence work.

GitHub

3 endpoints for developer and technology intelligence. Look up repositories, search repos, and search users — handy for technical recruiting and OSS ecosystem analysis.

Search & Web

When you need the open web — search engines, maps, and a universal parser for anything else.

Google

10 endpoints across two areas. Maps: find places, search places, and read their photos, reviews, and reviewer profiles — local-business and reputation data at scale. Ads transparency: look up advertisers and the ads they run for competitive ad research.

Yahoo

7 endpoints combining a general search engine and finance data. Search the web, news, images, and videos — plus Yahoo Finance for stock quotes, historical price data, and ticker search.

DuckDuckGo

1 endpoint for privacy-friendly web search results — a clean, no-tracking alternative when you just need links and snippets.

Web Parser

2 endpoints that make any website a data source. Point the parser at a URL and get structured content back, or pull a site’s full sitemap to discover every page. This is the universal fallback when no dedicated source exists.

Universal scraping

Web Parser turns any page into structured JSON

Marketplaces & Product Reviews

E-commerce listings, sellers, and software-review platforms.

Amazon

5 endpoints for product and seller research. Get product details and offers, search products, and look up sellers and their catalogs — for pricing intelligence, MAP monitoring, and market analysis.

Mercari

2 endpoints for the Japanese marketplace — item details and item search.

Capterra

3 endpoints for software buyers. Pull a product’s details and its reviews, and search the Capterra catalog — competitive intelligence for SaaS.

G2

1 endpoint to search G2’s software-product directory — complementary review-site signal alongside Capterra.

Industry & Niche Sources (AI-Based Scrapers)

Beyond the major platforms, Anysite ships purpose-built parsers for specific sites — partner directories, niche job boards, registries, and publishers. Examples include Siemens and Honeywell partner networks, Rockwell Automation partners, Wasabi channel partners, the GLEIF LEI registry, BuiltWith technology data, ApplyBoard school programs, JustJoinIT job listings, and several specialist publishers. The important part isn’t the list — it’s the pattern: any website can become a clean, structured endpoint. If a source you need isn’t here yet, it can be built and it appears automatically across the API, MCP Server, and CLI.

Custom Endpoints

Request a parser for any site — it shows up automatically in every integration

How to Access These Sources

Every source above is available through all four Anysite integrations — same data, your choice of interface:

REST API

Direct HTTP calls with an access-token header

MCP Server

Let AI agents discover and call any endpoint

Anysite CLI

Batch processing, pipelines, and database loading

n8n Nodes

No-code workflow automation