Scalable Inference Serving · Capability
KServe Open Inference Protocol API — Inference
KServe Open Inference Protocol API — Inference. 2 operations. Lead operation: Run Model Inference. Self-contained Naftiko capability covering one Scalable Inference Serving business surface.
What You Can Do
POST
Runinference
— Run Model Inference
/v1/v2/models/{model-name}/infer
POST
Runmodelversioninference
— Run Model Version Inference
/v1/v2/models/{model-name}/versions/{model-version}/infer
MCP Tools
run-model-inference
Run Model Inference
run-model-version-inference
Run Model Version Inference