All tools

Headroom

v0.25.0 · Apache 2.0

The context compression layer for AI agents. Compresses tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers.

Quick Start

$ pip install "headroom-ai[all]"
$ headroom wrap claude

Location in QuantWise OS:

quantwiseiq-tools/headroom/

Integration Modes

Library

compress(messages) in Python or TypeScript

Proxy

headroom proxy --port 8787, zero code changes

Agent Wrap

headroom wrap claude|codex|cursor|aider

MCP Server

headroom_compress, headroom_retrieve, headroom_stats

Token Savings (Real Workloads)

WorkloadBeforeAfterSavings
Code search (100 results)17,7651,40892%
SRE incident debugging65,6945,11892%
GitHub issue triage54,17414,76173%
Codebase exploration78,50241,25447%

Agent Support

Claude CodeCodexCursorAiderCopilot CLIOpenClawLangChainAgno

Key Features

  • • 60-95% token reduction, same accuracy
  • • Reversible compression (CCR)
  • • Cross-agent shared memory
  • • headroom learn — auto-fixes from failures
  • • MCP server integration
  • • 27.1k GitHub stars

Tech Stack

PythonRustTypeScriptONNXHF Transformers