Skip to main content

MushroomAgent

Distributed Embodied Intelligence

One mind, orchestrating everything. Thinking belongs to AI — sensing belongs to the world.

We break the physical boundaries of hardware with distributed architecture, orchestrating a cross-spatial "digital body" for AI.

Hear · Speak — real-time conversational context:
microphones and speakers form a responsive acoustic system for fluid agent interaction.

See · Remember — environmental semantic understanding:
cameras capture physical moments; AI interprets the environment in real time, giving intelligence the depth of "memory."

Act · Do — cross-device action orchestration:
hardware interfaces become virtual hands, turning cloud-based thought into precise physical action.

What is MushroomAgent?

MushroomAgent is a distributed agent runtime with two roles:

  • Agent — the decision side. It receives text, voice, video, device events, and other context, calls the model, decides what should happen next, and dispatches actions. The agent can run by itself on one machine with mushroom-agent start, so a separate node is optional.

  • Node — the device I/O side. A node collects information such as microphone audio, camera video, text, and local device events, forwards it to the agent, then executes actions returned by the agent. Those actions can be voice output, UI work, robot movement, or other device-specific behavior.

MushroomAgent is for developers building systems that span platforms — chat bots, voice assistants, IoT controllers, and hardware-accelerated agents — without running separate instances per surface.

How it works

channel → communication → sensor → agent → think → skill

Input enters through channels — Feishu, Discord, HTTP, WebSocket. The sensor layer processes text, voice, and files into structured perception. The agent assembles context from conversation history, workspace files, tools results, and loaded Skills, then hands it to the think engine. The LLM decides what to do: reply, execute a tool, or dispatch a device action. Results flow back the same way.

ModeThink locationInput/output locationCommand
Standalone agentThis deviceThis devicemushroom-agent start
Agent + nodesAgent hostEach attached nodemushroom-agent start on agent, mushroom-agent node attach on nodes

Key capabilities

Quick start

Start with agent mode. The agent can run alone and is enough for local chat or local runtime use. Add node mode later when another device should collect voice/video/input and execute actions for that agent.

See Getting Started for the install command and the node-mode path.

Explore the docs