Triton Inference Server · Capability
Triton Inference Server NVIDIA Triton Inference Server HTTP/REST API — Inference
Triton Inference Server NVIDIA Triton Inference Server HTTP/REST API — Inference. 2 operations. Lead operation: Triton Inference Server Run Inference on a Model. Self-contained Naftiko capability covering one Triton business surface.
What You Can Do
POST
Modelinfer
— Triton Inference Server Run Inference on a Model
/v1/v2/models/{model-name}/infer
POST
Modelversioninfer
— Triton Inference Server Run Inference on a Specific Model Version
/v1/v2/models/{model-name}/versions/{model-version}/infer
MCP Tools
triton-inference-server-run-inference
Triton Inference Server Run Inference on a Model
triton-inference-server-run-inference-2
Triton Inference Server Run Inference on a Specific Model Version