fix: enforce allowed_models during inference requests #4197

ashwinb · 2025-11-19T20:12:56Z

The allowed_models configuration was only being applied when listing models via the /v1/models endpoint, but the actual inference requests weren't checking this restriction. This meant users could directly request any model the provider supports by specifying it in their inference call, completely bypassing the intended cost controls.

The fix adds validation to all three inference methods (chat completions, completions, and embeddings) that checks the requested model against the allowed_models list before making the provider API call.

Test plan

Added unit tests

The `allowed_models` configuration was only filtering the model list endpoint but not enforcing restrictions during actual inference requests. This allowed users to bypass the restriction by directly requesting models not in the allowed list, potentially accessing expensive models when only cheaper ones were intended. This change adds validation to all inference methods (`openai_chat_completion`, `openai_completion`, `openai_embeddings`) to reject requests for disallowed models with a clear error message. **Implementation:** - Added `_validate_model_allowed()` helper method that checks if a model is in the `allowed_models` list - Called validation in all three inference methods before making API requests - Validation occurs after resolving the provider model ID to ensure consistency **Test Plan:** - Added unit tests verifying all inference methods respect `allowed_models` - Tests cover allowed models (success), disallowed models (rejection), and no restrictions (None allows all, empty list blocks all) - All existing tests continue to pass Fixes GHSA-5rjj-4jp6-fw39

ashwinb · 2025-11-19T20:22:32Z

cc @derekhiggins this is different than your proposed patch btw because it hooks in inside OpenAIMixin which is the correct layer to do this at.

derekhiggins

lgtm

ashwinb requested review from bbrowning, ehhuang, franciscojavierarceo, hardikjshah, leseb, mattf, raghotham, reluctantfuturist, slekkala1 and yanxi0830 as code owners November 19, 2025 20:12

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 19, 2025

derekhiggins approved these changes Nov 19, 2025

View reviewed changes

ashwinb merged commit d649c36 into llamastack:main Nov 19, 2025
37 of 38 checks passed

ashwinb deleted the feat/allowed_models branch November 19, 2025 22:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: enforce allowed_models during inference requests #4197

fix: enforce allowed_models during inference requests #4197

ashwinb commented Nov 19, 2025 •

edited

Loading

Uh oh!

ashwinb commented Nov 19, 2025

Uh oh!

derekhiggins left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: enforce allowed_models during inference requests #4197

fix: enforce allowed_models during inference requests #4197

Conversation

ashwinb commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test plan

Uh oh!

ashwinb commented Nov 19, 2025

Uh oh!

derekhiggins left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ashwinb commented Nov 19, 2025 •

edited

Loading