Project
Inferoa
An inference-native agent harness for long-horizon tokenmaxxing.
- Protects prefix-cache discipline, bounded context, and deterministic tool schemas.
- Optimizes coding and research sessions through goal, plan, autoresearch, and tokenmaxxing modes.
- Routes across self-hosted vLLM models and external frontier models under explicit pressure.