Fastly AI Accelerator — Chat Completions
Fastly AI Accelerator semantic-caching proxy for LLM chat completions. Provides OpenAI- and Google Gemini-compatible endpoints served from the Fastly edge, returning cached responses for semantically similar prompts.
Fastly AI Accelerator — Chat Completions is a Naftiko capability published by Fastly, one of 73 capabilities the APIs.io network indexes for this provider. It bundles 3 operations across the POST method.
The capability includes 3 state-changing operations. Lead operation: Create OpenAI-compatible chat completion via Fastly AI Accelerator semantic cache. Can be deployed as a REST endpoint, MCP tool, or Agent Skill via Naftiko.
Tagged areas include Fastly, AI Accelerator, AI, LLM, and Semantic Caching.
What You Can Do
MCP Tools
openai-chat-completion
Create OpenAI-compatible chat completion via Fastly AI Accelerator semantic cache
gemini-generate-content
Generate Google Gemini content via Fastly AI Accelerator
openai-embeddings
Create OpenAI embeddings via Fastly AI Accelerator