Skip to content

Conversation

@bcallender
Copy link
Contributor

  • Add FenicEmbeddingsResponse type
  • Update CachedResponse to support both completions and embeddings
  • Add ResponseType enum for type safety
  • Implement embedding cache key computation
  • Update SQLite schema to store embeddings
  • Update ModelClient to handle embedding caching
  • Add comprehensive tests for embedding caching

Copy link
Contributor Author

bcallender commented Nov 20, 2025

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

This stack of pull requests is managed by Graphite. Learn more about stacking.

@bcallender bcallender mentioned this pull request Nov 20, 2025
…to ensure cache is being used on actual queries
- Add FenicEmbeddingsResponse type
- Update CachedResponse to support both completions and embeddings
- Add ResponseType enum for type safety
- Implement embedding cache key computation
- Update SQLite schema to store embeddings
- Update ModelClient to handle embedding caching
- Add comprehensive tests for embedding caching
@bcallender bcallender force-pushed the feat/embedding-caching branch from c17aa9b to ff01bf4 Compare November 20, 2025 02:00
@bcallender bcallender force-pushed the feat/llm-caching branch 10 times, most recently from 422b203 to f696333 Compare December 4, 2025 03:24
Base automatically changed from feat/llm-caching to main December 4, 2025 03:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants