QuantCDN · Capability
QuantCDN API — AI Inference
QuantCDN API — AI Inference. 6 operations. Lead operation: Chat inference via API Gateway (buffered responses) with multimodal support. Self-contained Naftiko capability covering one Quantcdn business surface.
What You Can Do
POST
Chatinference
— Chat inference via API Gateway (buffered responses) with multimodal support
/v1/api/v3/organizations/{organisation}/ai/chat
POST
Submittoolcallback
— Submit Client Tool Results (Callback)
/v1/api/v3/organizations/{organisation}/ai/chat/callback
GET
Getdurableexecutionstatus
— Get Durable Execution Status
/v1/api/v3/organizations/{organisation}/ai/chat/executions/{identifier}
POST
Chatinferencestream
— Chat inference via streaming endpoint (true HTTP streaming) with multimodal support
/v1/api/v3/organizations/{organisation}/ai/chat/stream
POST
Embeddings
— Generate text embeddings for semantic search and RAG applications
/v1/api/v3/organizations/{organisation}/ai/embeddings
POST
Imagegeneration
— Generate images with Amazon Nova Canvas
/v1/api/v3/organizations/{organisation}/ai/image-generation
MCP Tools
chat-inference-api-gateway-buffered
Chat inference via API Gateway (buffered responses) with multimodal support
submit-client-tool-results-callback
Submit Client Tool Results (Callback)
get-durable-execution-status
Get Durable Execution Status
read-only
idempotent
chat-inference-streaming-endpoint-true
Chat inference via streaming endpoint (true HTTP streaming) with multimodal support
generate-text-embeddings-semantic-search
Generate text embeddings for semantic search and RAG applications
read-only
generate-images-amazon-nova-canvas
Generate images with Amazon Nova Canvas