QuantCDN · Capability

QuantCDN API — AI Inference

QuantCDN API — AI Inference. 6 operations. Lead operation: Chat inference via API Gateway (buffered responses) with multimodal support. Self-contained Naftiko capability covering one Quantcdn business surface.

Run with Naftiko QuantcdnAI Inference

What You Can Do

POST
Chatinference — Chat inference via API Gateway (buffered responses) with multimodal support
/v1/api/v3/organizations/{organisation}/ai/chat
POST
Submittoolcallback — Submit Client Tool Results (Callback)
/v1/api/v3/organizations/{organisation}/ai/chat/callback
GET
Getdurableexecutionstatus — Get Durable Execution Status
/v1/api/v3/organizations/{organisation}/ai/chat/executions/{identifier}
POST
Chatinferencestream — Chat inference via streaming endpoint (true HTTP streaming) with multimodal support
/v1/api/v3/organizations/{organisation}/ai/chat/stream
POST
Embeddings — Generate text embeddings for semantic search and RAG applications
/v1/api/v3/organizations/{organisation}/ai/embeddings
POST
Imagegeneration — Generate images with Amazon Nova Canvas
/v1/api/v3/organizations/{organisation}/ai/image-generation

MCP Tools

chat-inference-api-gateway-buffered

Chat inference via API Gateway (buffered responses) with multimodal support

submit-client-tool-results-callback

Submit Client Tool Results (Callback)

get-durable-execution-status

Get Durable Execution Status

read-only idempotent
chat-inference-streaming-endpoint-true

Chat inference via streaming endpoint (true HTTP streaming) with multimodal support

generate-text-embeddings-semantic-search

Generate text embeddings for semantic search and RAG applications

read-only
generate-images-amazon-nova-canvas

Generate images with Amazon Nova Canvas

Capability Spec

quantcdn-ai-inference.yaml Raw ↑
naftiko: 1.0.0-alpha2
info:
  label: QuantCDN API — AI Inference
  description: 'QuantCDN API — AI Inference. 6 operations. Lead operation: Chat inference via API Gateway (buffered responses)
    with multimodal support. Self-contained Naftiko capability covering one Quantcdn business surface.'
  tags:
  - Quantcdn
  - AI Inference
  created: '2026-05-19'
  modified: '2026-05-19'
binds:
- namespace: env
  keys:
    QUANTCDN_API_KEY: QUANTCDN_API_KEY
capability:
  consumes:
  - type: http
    namespace: quantcdn-ai-inference
    baseUri: https://dashboard.quantcdn.io
    description: QuantCDN API — AI Inference business capability. Self-contained, no shared references.
    resources:
    - name: api-v3-organizations-organisation-ai-chat
      path: /api/v3/organizations/{organisation}/ai/chat
      operations:
      - name: chatinference
        method: POST
        description: Chat inference via API Gateway (buffered responses) with multimodal support
        outputRawFormat: json
        outputParameters:
        - name: result
          type: object
          value: $.
        inputParameters:
        - name: organisation
          in: path
          type: string
          description: The organisation ID
          required: true
        - name: body
          in: body
          type: object
          description: Request body (JSON).
          required: true
    - name: api-v3-organizations-organisation-ai-chat-callback
      path: /api/v3/organizations/{organisation}/ai/chat/callback
      operations:
      - name: submittoolcallback
        method: POST
        description: Submit Client Tool Results (Callback)
        outputRawFormat: json
        outputParameters:
        - name: result
          type: object
          value: $.
        inputParameters:
        - name: organisation
          in: path
          type: string
          description: The organisation ID
          required: true
        - name: body
          in: body
          type: object
          description: Request body (JSON).
          required: true
    - name: api-v3-organizations-organisation-ai-chat-executions-identifier
      path: /api/v3/organizations/{organisation}/ai/chat/executions/{identifier}
      operations:
      - name: getdurableexecutionstatus
        method: GET
        description: Get Durable Execution Status
        outputRawFormat: json
        outputParameters:
        - name: result
          type: object
          value: $.
        inputParameters:
        - name: organisation
          in: path
          type: string
          description: The organisation ID
          required: true
        - name: identifier
          in: path
          type: string
          description: Either the requestId from async response, or full executionArn (URL-encoded)
          required: true
    - name: api-v3-organizations-organisation-ai-chat-stream
      path: /api/v3/organizations/{organisation}/ai/chat/stream
      operations:
      - name: chatinferencestream
        method: POST
        description: Chat inference via streaming endpoint (true HTTP streaming) with multimodal support
        outputRawFormat: json
        outputParameters:
        - name: result
          type: object
          value: $.
        inputParameters:
        - name: organisation
          in: path
          type: string
          description: The organisation ID
          required: true
        - name: body
          in: body
          type: object
          description: Request body (JSON).
          required: true
    - name: api-v3-organizations-organisation-ai-embeddings
      path: /api/v3/organizations/{organisation}/ai/embeddings
      operations:
      - name: embeddings
        method: POST
        description: Generate text embeddings for semantic search and RAG applications
        outputRawFormat: json
        outputParameters:
        - name: result
          type: object
          value: $.
        inputParameters:
        - name: organisation
          in: path
          type: string
          description: The organisation ID
          required: true
        - name: body
          in: body
          type: object
          description: Request body (JSON).
          required: true
    - name: api-v3-organizations-organisation-ai-image-generation
      path: /api/v3/organizations/{organisation}/ai/image-generation
      operations:
      - name: imagegeneration
        method: POST
        description: Generate images with Amazon Nova Canvas
        outputRawFormat: json
        outputParameters:
        - name: result
          type: object
          value: $.
        inputParameters:
        - name: organisation
          in: path
          type: string
          description: The organisation ID
          required: true
        - name: body
          in: body
          type: object
          description: Request body (JSON).
          required: true
    authentication:
      type: bearer
      token: '{{env.QUANTCDN_API_KEY}}'
  exposes:
  - type: rest
    namespace: quantcdn-ai-inference-rest
    port: 8080
    description: REST adapter for QuantCDN API — AI Inference. One Spectral-compliant resource per consumed operation, prefixed
      with /v1.
    resources:
    - path: /v1/api/v3/organizations/{organisation}/ai/chat
      name: api-v3-organizations-organisation-ai-chat
      description: REST surface for api-v3-organizations-organisation-ai-chat.
      operations:
      - method: POST
        name: chatinference
        description: Chat inference via API Gateway (buffered responses) with multimodal support
        call: quantcdn-ai-inference.chatinference
        with:
          organisation: rest.organisation
          body: rest.body
        outputParameters:
        - type: object
          mapping: $.
    - path: /v1/api/v3/organizations/{organisation}/ai/chat/callback
      name: api-v3-organizations-organisation-ai-chat-callback
      description: REST surface for api-v3-organizations-organisation-ai-chat-callback.
      operations:
      - method: POST
        name: submittoolcallback
        description: Submit Client Tool Results (Callback)
        call: quantcdn-ai-inference.submittoolcallback
        with:
          organisation: rest.organisation
          body: rest.body
        outputParameters:
        - type: object
          mapping: $.
    - path: /v1/api/v3/organizations/{organisation}/ai/chat/executions/{identifier}
      name: api-v3-organizations-organisation-ai-chat-executions-identifier
      description: REST surface for api-v3-organizations-organisation-ai-chat-executions-identifier.
      operations:
      - method: GET
        name: getdurableexecutionstatus
        description: Get Durable Execution Status
        call: quantcdn-ai-inference.getdurableexecutionstatus
        with:
          organisation: rest.organisation
          identifier: rest.identifier
        outputParameters:
        - type: object
          mapping: $.
    - path: /v1/api/v3/organizations/{organisation}/ai/chat/stream
      name: api-v3-organizations-organisation-ai-chat-stream
      description: REST surface for api-v3-organizations-organisation-ai-chat-stream.
      operations:
      - method: POST
        name: chatinferencestream
        description: Chat inference via streaming endpoint (true HTTP streaming) with multimodal support
        call: quantcdn-ai-inference.chatinferencestream
        with:
          organisation: rest.organisation
          body: rest.body
        outputParameters:
        - type: object
          mapping: $.
    - path: /v1/api/v3/organizations/{organisation}/ai/embeddings
      name: api-v3-organizations-organisation-ai-embeddings
      description: REST surface for api-v3-organizations-organisation-ai-embeddings.
      operations:
      - method: POST
        name: embeddings
        description: Generate text embeddings for semantic search and RAG applications
        call: quantcdn-ai-inference.embeddings
        with:
          organisation: rest.organisation
          body: rest.body
        outputParameters:
        - type: object
          mapping: $.
    - path: /v1/api/v3/organizations/{organisation}/ai/image-generation
      name: api-v3-organizations-organisation-ai-image-generation
      description: REST surface for api-v3-organizations-organisation-ai-image-generation.
      operations:
      - method: POST
        name: imagegeneration
        description: Generate images with Amazon Nova Canvas
        call: quantcdn-ai-inference.imagegeneration
        with:
          organisation: rest.organisation
          body: rest.body
        outputParameters:
        - type: object
          mapping: $.
  - type: mcp
    namespace: quantcdn-ai-inference-mcp
    port: 9090
    transport: http
    description: MCP adapter for QuantCDN API — AI Inference. One tool per consumed operation, routed inline through this
      capability's consumes block.
    tools:
    - name: chat-inference-api-gateway-buffered
      description: Chat inference via API Gateway (buffered responses) with multimodal support
      hints:
        readOnly: false
        destructive: false
        idempotent: false
      call: quantcdn-ai-inference.chatinference
      with:
        organisation: tools.organisation
        body: tools.body
      outputParameters:
      - type: object
        mapping: $.
    - name: submit-client-tool-results-callback
      description: Submit Client Tool Results (Callback)
      hints:
        readOnly: false
        destructive: false
        idempotent: false
      call: quantcdn-ai-inference.submittoolcallback
      with:
        organisation: tools.organisation
        body: tools.body
      outputParameters:
      - type: object
        mapping: $.
    - name: get-durable-execution-status
      description: Get Durable Execution Status
      hints:
        readOnly: true
        destructive: false
        idempotent: true
      call: quantcdn-ai-inference.getdurableexecutionstatus
      with:
        organisation: tools.organisation
        identifier: tools.identifier
      outputParameters:
      - type: object
        mapping: $.
    - name: chat-inference-streaming-endpoint-true
      description: Chat inference via streaming endpoint (true HTTP streaming) with multimodal support
      hints:
        readOnly: false
        destructive: false
        idempotent: false
      call: quantcdn-ai-inference.chatinferencestream
      with:
        organisation: tools.organisation
        body: tools.body
      outputParameters:
      - type: object
        mapping: $.
    - name: generate-text-embeddings-semantic-search
      description: Generate text embeddings for semantic search and RAG applications
      hints:
        readOnly: true
        destructive: false
        idempotent: false
      call: quantcdn-ai-inference.embeddings
      with:
        organisation: tools.organisation
        body: tools.body
      outputParameters:
      - type: object
        mapping: $.
    - name: generate-images-amazon-nova-canvas
      description: Generate images with Amazon Nova Canvas
      hints:
        readOnly: false
        destructive: false
        idempotent: false
      call: quantcdn-ai-inference.imagegeneration
      with:
        organisation: tools.organisation
        body: tools.body
      outputParameters:
      - type: object
        mapping: $.