What is Unity?
Unity is an AI agent you can interrupt mid-task, redirect without restarting, talk to while it works, and run several things on at once. It’s voice-native, with memory that compounds over time.What makes it different
Most agent frameworks give you one loop: the model picks a tool, calls it, reads the result, picks the next. If you want to change course, you cancel and start over. Unity gives every operation its own loop and returns a live handle you can steer. These handles nest — the user steers the ConversationManager, the ConversationManager steers the Actor, the Actor steers the managers. Corrections, pauses, and queries propagate through the full depth. In practice:- “Also include Q2 numbers” mid-way through a report → the agent adjusts without restarting
- “Pause that, something urgent” → work freezes and resumes exactly where it left off
- “How’s the flight search going?” → status update without disrupting the work
- Three tasks running at once, each independently steerable
Architecture at a glance
Deep dives
Steerable Handles
The universal protocol for mid-flight control. Pause, resume, interject, and query any running operation at any depth.
CodeAct
The agent writes Python programs over typed primitives — not flat JSON tool calls. Real variables, loops, and control flow.
Dual-Brain Voice
A slow deliberation brain and a fast real-time voice agent, communicating over IPC. The agent keeps talking while working.
State Managers
Distributed domain managers communicating through English-language APIs. Each runs its own LLM tool loop.
Memory
Every 50 messages, conversations are consolidated into structured, queryable tables — contacts, knowledge, tasks, policies.
Running Tests
Real LLMs, cached responses, parallel execution. Never mocked.
