spranab/contextcache

View on GitHub by @spranab

A persistent key-value cache using content-hash addressing, designed for tool-augmented LLMs. Deduplicates identical tool call results across sessions, reducing redundant API calls and latency. Supports TTL-based expiry and pluggable storage backends.

What's novel

Tool-augmented LLMs repeatedly call the same tools with identical inputs across conversations, wasting tokens and API quota. ContextCache solves this with content-addressed storage — the same input always maps to the same cache key regardless of session. This is a purpose-built caching layer for agentic workflows, not a generic Redis wrapper.

Code Analysis

4 files read · 2 rounds

A high-performance agentic router that uses a small local LLM with a NoPE (No Positional Embedding) architecture and disk-based KV caching to route tool calls in under 200ms without positional bias errors.

Strengths

Innovative use of NoPE architecture to solve positional embedding issues in multi-tool routing; highly optimized performance via SHA-256 content-addressed disk caching for instant warm starts; clean separation between routing logic and synthesis LLM.

Weaknesses

Limited test coverage (mostly integration tests, no unit tests for core router logic); error handling relies on generic exceptions without detailed logging or recovery strategies; dependency on external GGUF model files which complicates deployment.

Score Breakdown

Innovation

6 (25%)

Craft

72 (35%)

Traction

20 (15%)

Scope

88 (25%)

Signal breakdown

Innovation

Not Fork+1

Code Novelty+2

Concept Novelty+2

Craft

Ci+0

Tests+8

Polish+1

Releases+3

Has License+5

Code Quality+21

Readme Quality+15

Recent Activity+7

Structure Quality+5

Commit Consistency+2

Has Dependency Mgmt+5

Traction

Forks+0

Stars+12

Hn Points+0

Watchers+3

Early Traction+5

Devto Reactions+0

Community Contribs+0

Scope

Commits+5

Languages+8

Subsystems+13

Bloat Penalty+0

Completeness+7

Contributors+5

Authored Files+15

Readme Code Match+3

Architecture Depth+7

Implementation Depth+8

Evidence

Commits

Contributors

Files

109

Active weeks

TestsCI/CDREADMELicenseContributing

Repository

Language

Python

Stars

Forks

License

NOASSERTION