Chat Completions API
Also known as: Chat API, Messages API, Conversational API
Definition
An API endpoint that accepts a conversation history (array of messages with roles) and returns the model's next response. This is the primary interface for conversational AI, supporting multi-turn dialogue, system prompts, and tool use. The format was introduced by OpenAI and adopted widely.
What this is NOT
- Not legacy completions API (that takes raw text, not messages)
- Not embedding API (different purpose)
- Not the model itself (this is the interface)
Alternative Interpretations
Different communities use this term differently:
llm-practitioners
The /v1/chat/completions endpoint (OpenAI) or equivalent that takes messages with roles (system, user, assistant, tool) and returns a completion. The standard interface for chat-based LLM interaction.
Sources: OpenAI Chat Completions documentation, Anthropic Messages API
Examples
- POST /v1/chat/completions with messages array
- Anthropic's /v1/messages (similar concept)
- Multi-turn conversation with alternating user/assistant messages
- System prompt + user message + tool definitions
Counterexamples
Things that might seem like Chat Completions API but are not:
- Legacy /v1/completions (raw text completion)
- Embedding API (different purpose)
- Image generation API
Relations
- specializes openai-api (Chat completions is part of the OpenAI API family)
- overlapsWith function-calling (Tool use is part of chat completions)
- overlapsWith streaming (Chat completions supports streaming)
Implementations
Tools and frameworks that implement this concept:
- Claude 3 primary
- Claude 3.5 primary
- Claude 4 primary
- Gemini primary
- GPT-4 primary
- GPT-4 Turbo primary
- GPT-4o primary
- Llama 3 secondary
- Mistral Large primary
- OpenAI primary