Hugging Face · Capability

Hugging Face Dataset Viewer API — Dataset Info

Hugging Face Dataset Viewer API — Dataset Info. 2 operations. Lead operation: Check Dataset Validity. Self-contained Naftiko capability covering one Hugging Face business surface.

Run with Naftiko Hugging FaceDataset Info

What You Can Do

GET
Isvalid — Check Dataset Validity
/v1/is-valid
GET
Getsplits — Get Dataset Splits
/v1/splits

MCP Tools

check-dataset-validity

Check Dataset Validity

read-only idempotent
get-dataset-splits

Get Dataset Splits

read-only idempotent

Capability Spec

dataset-viewer-dataset-info.yaml Raw ↑
naftiko: 1.0.0-alpha2
info:
  label: Hugging Face Dataset Viewer API — Dataset Info
  description: 'Hugging Face Dataset Viewer API — Dataset Info. 2 operations. Lead operation: Check Dataset Validity. Self-contained
    Naftiko capability covering one Hugging Face business surface.'
  tags:
  - Hugging Face
  - Dataset Info
  created: '2026-05-19'
  modified: '2026-05-19'
binds:
- namespace: env
  keys:
    HUGGING_FACE_API_KEY: HUGGING_FACE_API_KEY
capability:
  consumes:
  - type: http
    namespace: dataset-viewer-dataset-info
    baseUri: https://datasets-server.huggingface.co
    description: Hugging Face Dataset Viewer API — Dataset Info business capability. Self-contained, no shared references.
    resources:
    - name: is-valid
      path: /is-valid
      operations:
      - name: isvalid
        method: GET
        description: Check Dataset Validity
        outputRawFormat: json
        outputParameters:
        - name: result
          type: object
          value: $.
        inputParameters:
        - name: dataset
          in: query
          type: string
          description: The dataset ID on the Hugging Face Hub
          required: true
    - name: splits
      path: /splits
      operations:
      - name: getsplits
        method: GET
        description: Get Dataset Splits
        outputRawFormat: json
        outputParameters:
        - name: result
          type: object
          value: $.
        inputParameters:
        - name: dataset
          in: query
          type: string
          description: The dataset ID on the Hugging Face Hub
          required: true
    authentication:
      type: bearer
      token: '{{env.HUGGING_FACE_API_KEY}}'
  exposes:
  - type: rest
    namespace: dataset-viewer-dataset-info-rest
    port: 8080
    description: REST adapter for Hugging Face Dataset Viewer API — Dataset Info. One Spectral-compliant resource per consumed
      operation, prefixed with /v1.
    resources:
    - path: /v1/is-valid
      name: is-valid
      description: REST surface for is-valid.
      operations:
      - method: GET
        name: isvalid
        description: Check Dataset Validity
        call: dataset-viewer-dataset-info.isvalid
        with:
          dataset: rest.dataset
        outputParameters:
        - type: object
          mapping: $.
    - path: /v1/splits
      name: splits
      description: REST surface for splits.
      operations:
      - method: GET
        name: getsplits
        description: Get Dataset Splits
        call: dataset-viewer-dataset-info.getsplits
        with:
          dataset: rest.dataset
        outputParameters:
        - type: object
          mapping: $.
  - type: mcp
    namespace: dataset-viewer-dataset-info-mcp
    port: 9090
    transport: http
    description: MCP adapter for Hugging Face Dataset Viewer API — Dataset Info. One tool per consumed operation, routed inline
      through this capability's consumes block.
    tools:
    - name: check-dataset-validity
      description: Check Dataset Validity
      hints:
        readOnly: true
        destructive: false
        idempotent: true
      call: dataset-viewer-dataset-info.isvalid
      with:
        dataset: tools.dataset
      outputParameters:
      - type: object
        mapping: $.
    - name: get-dataset-splits
      description: Get Dataset Splits
      hints:
        readOnly: true
        destructive: false
        idempotent: true
      call: dataset-viewer-dataset-info.getsplits
      with:
        dataset: tools.dataset
      outputParameters:
      - type: object
        mapping: $.