Triton Inference Server · Capability
Triton Inference Server NVIDIA Triton Inference Server HTTP/REST API — Model Repository
Triton Inference Server NVIDIA Triton Inference Server HTTP/REST API — Model Repository. 3 operations. Lead operation: Triton Inference Server List Models in the Repository. Self-contained Naftiko capability covering one Triton business surface.
What You Can Do
POST
Repositoryindex
— Triton Inference Server List Models in the Repository
/v1/v2/repository/index
POST
Modelload
— Triton Inference Server Load or Reload a Model
/v1/v2/repository/models/{model-name}/load
POST
Modelunload
— Triton Inference Server Unload a Model
/v1/v2/repository/models/{model-name}/unload
MCP Tools
triton-inference-server-list-models
Triton Inference Server List Models in the Repository
read-only
triton-inference-server-load-reload
Triton Inference Server Load or Reload a Model
triton-inference-server-unload-model
Triton Inference Server Unload a Model