semantic-caching

Here are 5 public repositories matching this topic...

zzbright1998 / SentenceKV

Official implementation of "SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching" (COLM 2025). A novel KV cache compression method that organizes cache at sentence level using semantic similarity.

natural-language-processing transformers memory-efficiency efficient-inference inference-optimization kv-cache llm semantic-caching colm2025

Updated Sep 29, 2025
Python

renswickd / semantic-prompt-cache

Star

This app leverages Semantic Caching to minimize inference latency and reduce API costs by reusing semantically similar prompt responses.

optimization ttl-cache rag mistral-api semantic-caching

Updated Jul 4, 2025
Python

redis-developer / redis-movies-searcher-workshop

Star

This is a hands-on workshop to create the application Redis Movies Searcher

java docker redis full-text-search riot cache-aside vector-database vector-similarity-search semantic-caching

Updated Jun 19, 2025
Java

sensoris / semcache-node

Star

Node SDK for the Semcache API

node js openai llm semantic-caching

Updated Jun 18, 2025
JavaScript

sensoris / semcache-python

Star

Python library for the Semcache API

python ai openai llm anthropic semantic-caching

Updated Jun 9, 2025
Python

Improve this page

Add a description, image, and links to the semantic-caching topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the semantic-caching topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

semantic-caching

Here are 5 public repositories matching this topic...

zzbright1998 / SentenceKV

renswickd / semantic-prompt-cache

redis-developer / redis-movies-searcher-workshop

sensoris / semcache-node

sensoris / semcache-python

Improve this page

Add this topic to your repo