Skip to main content
POST
/
api
/
huggingface
/
datasets
/huggingface/datasets
curl --request POST \
  --url https://api.anysite.io/api/huggingface/datasets \
  --header 'Content-Type: application/json' \
  --header 'access-token: <api-key>' \
  --data '
{
  "dataset": "<string>",
  "timeout": 300
}
'
[
  {
    "id": "<string>",
    "url": "<string>",
    "@type": "@huggingface_dataset",
    "author": "<string>",
    "sha": "<string>",
    "download_count": 123,
    "like_count": 123,
    "trending_score": 123,
    "description": "<string>",
    "tags": [],
    "gated": true,
    "is_private": false,
    "is_disabled": false,
    "created_at": "<string>",
    "last_modified": "<string>",
    "used_storage": 123,
    "paperswithcode_id": "<string>",
    "card_data": {},
    "siblings": []
  }
]

Authorizations

access-token
string
header
required

API token from the dashboard

Headers

access-token
string
required

Body

application/json
dataset
string
required

Dataset repository id ('org/name') or full dataset URL

Minimum string length: 1
Examples:

"stanfordnlp/imdb"

"https://huggingface.co/datasets/HuggingFaceFW/fineweb"

timeout
integer
default:300

Max scrapping execution timeout (in seconds)

Required range: 20 <= x <= 1500

Response

Successful Response

id
string
required
url
string
required
@type
string
default:@huggingface_dataset
author
string | null
sha
string | null
download_count
integer | null
like_count
integer | null
description
string | null
tags
string[]
gated
is_private
boolean
default:false
is_disabled
boolean
default:false
created_at
string | null
last_modified
string | null
used_storage
integer | null
paperswithcode_id
string | null
card_data
Card Data · object
siblings
HuggingfaceModelFile · object[]