Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions docs/docs/providers/agents/index.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
description: |
Agents

APIs for creating and interacting with agentic systems.
APIs for creating and interacting with agentic systems.
sidebar_label: Agents
title: Agents
---
Expand All @@ -13,6 +13,6 @@ title: Agents

Agents

APIs for creating and interacting with agentic systems.
APIs for creating and interacting with agentic systems.

This section contains documentation for all available providers for the **agents** API.
24 changes: 12 additions & 12 deletions docs/docs/providers/batches/index.mdx
Original file line number Diff line number Diff line change
@@ -1,15 +1,15 @@
---
description: |
The Batches API enables efficient processing of multiple requests in a single operation,
particularly useful for processing large datasets, batch evaluation workflows, and
cost-effective inference at scale.
particularly useful for processing large datasets, batch evaluation workflows, and
cost-effective inference at scale.

The API is designed to allow use of openai client libraries for seamless integration.
The API is designed to allow use of openai client libraries for seamless integration.

This API provides the following extensions:
- idempotent batch creation
This API provides the following extensions:
- idempotent batch creation

Note: This API is currently under active development and may undergo changes.
Note: This API is currently under active development and may undergo changes.
sidebar_label: Batches
title: Batches
---
Expand All @@ -19,14 +19,14 @@ title: Batches
## Overview

The Batches API enables efficient processing of multiple requests in a single operation,
particularly useful for processing large datasets, batch evaluation workflows, and
cost-effective inference at scale.
particularly useful for processing large datasets, batch evaluation workflows, and
cost-effective inference at scale.

The API is designed to allow use of openai client libraries for seamless integration.
The API is designed to allow use of openai client libraries for seamless integration.

This API provides the following extensions:
- idempotent batch creation
This API provides the following extensions:
- idempotent batch creation

Note: This API is currently under active development and may undergo changes.
Note: This API is currently under active development and may undergo changes.

This section contains documentation for all available providers for the **batches** API.
4 changes: 2 additions & 2 deletions docs/docs/providers/eval/index.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
description: |
Evaluations

Llama Stack Evaluation API for running evaluations on model and agent candidates.
Llama Stack Evaluation API for running evaluations on model and agent candidates.
sidebar_label: Eval
title: Eval
---
Expand All @@ -13,6 +13,6 @@ title: Eval

Evaluations

Llama Stack Evaluation API for running evaluations on model and agent candidates.
Llama Stack Evaluation API for running evaluations on model and agent candidates.

This section contains documentation for all available providers for the **eval** API.
4 changes: 2 additions & 2 deletions docs/docs/providers/files/index.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
description: |
Files

This API is used to upload documents that can be used with other Llama Stack APIs.
This API is used to upload documents that can be used with other Llama Stack APIs.
sidebar_label: Files
title: Files
---
Expand All @@ -13,6 +13,6 @@ title: Files

Files

This API is used to upload documents that can be used with other Llama Stack APIs.
This API is used to upload documents that can be used with other Llama Stack APIs.

This section contains documentation for all available providers for the **files** API.
20 changes: 10 additions & 10 deletions docs/docs/providers/inference/index.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,12 @@
description: |
Inference

Llama Stack Inference API for generating completions, chat completions, and embeddings.
Llama Stack Inference API for generating completions, chat completions, and embeddings.

This API provides the raw interface to the underlying models. Three kinds of models are supported:
- LLM models: these models generate "raw" and "chat" (conversational) completions.
- Embedding models: these models generate embeddings to be used for semantic search.
- Rerank models: these models reorder the documents based on their relevance to a query.
This API provides the raw interface to the underlying models. Three kinds of models are supported:
- LLM models: these models generate "raw" and "chat" (conversational) completions.
- Embedding models: these models generate embeddings to be used for semantic search.
- Rerank models: these models reorder the documents based on their relevance to a query.
sidebar_label: Inference
title: Inference
---
Expand All @@ -18,11 +18,11 @@ title: Inference

Inference

Llama Stack Inference API for generating completions, chat completions, and embeddings.
Llama Stack Inference API for generating completions, chat completions, and embeddings.

This API provides the raw interface to the underlying models. Three kinds of models are supported:
- LLM models: these models generate "raw" and "chat" (conversational) completions.
- Embedding models: these models generate embeddings to be used for semantic search.
- Rerank models: these models reorder the documents based on their relevance to a query.
This API provides the raw interface to the underlying models. Three kinds of models are supported:
- LLM models: these models generate "raw" and "chat" (conversational) completions.
- Embedding models: these models generate embeddings to be used for semantic search.
- Rerank models: these models reorder the documents based on their relevance to a query.

This section contains documentation for all available providers for the **inference** API.
4 changes: 2 additions & 2 deletions docs/docs/providers/safety/index.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
description: |
Safety
OpenAI-compatible Moderations API.
OpenAI-compatible Moderations API.
sidebar_label: Safety
title: Safety
---
Expand All @@ -13,6 +13,6 @@ title: Safety

Safety

OpenAI-compatible Moderations API.
OpenAI-compatible Moderations API.

This section contains documentation for all available providers for the **safety** API.
Original file line number Diff line number Diff line change
Expand Up @@ -244,6 +244,7 @@ async def create_response(self) -> AsyncIterator[OpenAIResponseObjectStream]:
messages=messages,
# Pydantic models are dict-compatible but mypy treats them as distinct types
tools=self.ctx.chat_tools, # type: ignore[arg-type]
parallel_tool_calls=self.parallel_tool_calls,
stream=True,
temperature=self.ctx.temperature,
response_format=response_format,
Expand Down
Loading