The Open Infrastructure for Physical Intelligence
Grounding AI Agents into the Physical World.
English • 中文 • Architecture • Quick Start • Docs
Teach Once. Embody Anywhere. Evolve Continuously.
ROSClaw is not another chatbot framework. It is not a thin LLM-to-ROS wrapper. It is not a collection of random robotics tools.
ROSClaw is an open infrastructure layer for Physical Intelligence: a runtime that connects AI agents, robot embodiments, simulation sandboxes, skill systems, multimodal providers, physical memory, and self-evolution loops into one coherent operating layer.
It is designed for the next generation of embodied agents that must not only reason, but also act safely, remember physically, recover from failure, and improve over time.
┌──────────────────────────────────────────────────────────────┐
│ External Cognitive Brains │
│ OpenClaw / Claude / GPT / Qwen / Custom Agents │
└───────────────────────────┬──────────────────────────────────┘
│ MCP / SDK / AgentContext
▼
┌──────────────────────────────────────────────────────────────┐
│ ROSClaw Runtime │
│ AgentContext │ TaskContext │ SkillContext │ Trace │
└───────────────────────────┬──────────────────────────────────┘
│
┌───────────────────┼───────────────────┐
▼ ▼ ▼
┌───────────────┐ ┌───────────────┐ ┌───────────────┐
│ Provider │ │ Sandbox │ │ Darwin │
│ Capability │ │ e-URDF / │ │ Benchmark / │
│ Router │ │ MuJoCo / │ │ Regression / │
│ │ │ Firewall │ │ Evaluation │
└───────┬───────┘ └───────┬───────┘ └───────────────┘
│ │
└───────────┬───────┘
▼
┌──────────────────────────────────────────────────────────────┐
│ Physical World / Simulator │
│ UR5e / G1 / Go2 / RealSense / IoT / MuJoCo │
└───────────────────────────┬──────────────────────────────────┘
│
▼
┌──────────────────────────────────────────────────────────────┐
│ Practice Capture │
│ Unified Timeline / MCAP / JSONL / Video / Events │
└───────────────────────────┬──────────────────────────────────┘
│
▼
┌──────────────────────────────────────────────────────────────┐
│ SeekDB Knowledge Plane │
│ Robot │ Skill │ Provider │ Episode │ Failure │ Evidence │
└───────────────┬────────────────────────────┬─────────────────┘
│ │
▼ ▼
┌───────────────────────┐ ┌───────────────────────────────┐
│ Memory │ │ Know │
│ Spatiotemporal │ │ Physical-AI Knowledge │
│ Failure / Success │ │ Compiler │
│ Pattern / Causal │ │ TaskCard / Pattern / Evidence│
└───────────┬───────────┘ └───────────────┬───────────────┘
│ │
└──────────────┬───────────────────┘
│
▼
┌──────────────────────────────┐
│ How ←→ Auto │
│ Runtime Intervention │
│ Self-Evolution Control │
│ Proposal / Patch / Champion │
└──────────────┬───────────────┘
│
▼
┌─────────────────┐
│ Skill Registry │
│ Versioned / │
│ Champion / │
│ Rollback-safe │
└─────────────────┘
Large language models can plan, write code, and reason over symbols. But physical intelligence requires more than tokens.
A physical agent must understand:
- What body it has;
- What sensors and actuators it owns;
- What actions are safe;
- What happened during execution;
- Why a skill failed;
- How to recover;
- How to improve the skill without breaking safety.
ROSClaw provides the missing infrastructure between high-level AI agents and the physical world.
Every physical action should be grounded, validated, recorded, remembered, and improved.
The full closed loop:
Physical Task
↓
Agent Intent
↓
Capability Provider
↓
Sandbox / Firewall Validation
↓
Runtime Execution
↓
Praxis Capture
↓
Spatiotemporal Memory
↓
Runtime Intervention (How)
↓
Knowledge Compilation (Know)
↓
Auto Evolution
↓
Champion Skill
↓
Safer Physical Task
graph TD
subgraph Agents[Any MCP-Compatible Agent]
CC[Claude Code]
OC[OpenClaw]
AC[AutoGen / Custom]
end
subgraph Runtime[ROSClaw Runtime]
MCP[MCP Hub]
EB[Event Bus]
FW[Digital Twin Firewall]
TM[Unified Timeline]
MEM[Spatiotemporal Memory]
SK[Skill Manager]
HO[How Intervention]
AU[Auto Evolution]
DW[Darwin Evaluator]
end
subgraph Infra[Infrastructure]
EURDF[e-URDF Zoo]
SKDB[SeekDB Knowledge Plane]
end
subgraph HW[Hardware Layer]
G1[Unitree G1]
UR5[UR5e Arm]
GO[Go2]
Other[Your Robot]
end
CC & OC & AC <-->|MCP| MCP
MCP <-->|Events| EB
EB <-->|Validate| FW
EB <-->|Record| TM
EB <-->|Store| MEM
EB <-->|Execute| SK
EB <-->|Intervene| HO
EB <-->|Evolve| AU
EB <-->|Evaluate| DW
FW <-->|Model| EURDF
MEM <-->|Persist| SKDB
SK -->|Dispatch| HW
AU -->|Promote| SK
DW -->|Report| AU
Key Insight: All modules communicate exclusively through the EventBus. No direct module-to-module calls. This ensures complete decoupling and enables any agent to connect without hardware-specific knowledge.
| Module | Role |
|---|---|
rosclaw-runtime |
The lifecycle manager for the whole system. Owns configuration, plugins, health checks, event routing, and runtime orchestration. |
e-urdf-zoo |
The Physical DNA Registry. Defines robot embodiment, kinematics, dynamics, sensors, safety limits, capabilities, and simulation assets. |
rosclaw-provider |
The capability provider layer. Turns LLMs, VLMs, VLAs, VLNs, world models, skill policies, critics, and embeddings into routable physical capabilities. |
rosclaw-sandbox |
The physical simulation, validation, replay, and safety layer. Its firewall mode validates actions before execution. |
rosclaw-practice |
The praxis capture engine. Records unified physical timelines with sensorimotor traces, model decisions, tool calls, events, MCAP, and replay artifacts. |
rosclaw-memory |
The spatiotemporal memory system. Stores physical failures, success patterns, scene memory, trajectory memory, and causal experience graphs. |
rosclaw-how |
The runtime intervention controller. Injects minimal, evidence-backed guidance when an agent is stuck, unsafe, or regressing. |
rosclaw-know |
The physical-AI knowledge compiler. Turns papers, code, logs, trajectories, benchmark traces, and failures into structured engineering knowledge. |
rosclaw-auto |
The self-evolution control plane. Generates proposals, patches skills, runs experiments, evaluates candidates, promotes champions, and records dead ends. |
rosclaw-darwin |
The evaluation and evolution arena. Provides benchmark pressure, multi-seed validation, regression tests, and skill evaluation. |
rosclaw-forge |
The embodied asset compiler. Turns SDKs, ROS 2 interfaces, docs, and e-URDF profiles into MCP servers, skills, provider manifests, and asset bundles. |
rosclaw-dashboard |
The observability layer for runtime health, traces, sandbox replay, memory, interventions, and skill evolution. |
ROSClaw does not expose raw robot APIs directly to an LLM. Every action is grounded through robot embodiment, capability schemas, safety limits, and runtime context.
Token Intent → Capability Request → Safety Validation → Physical Execution
ROSClaw treats robot embodiment as a first-class system primitive. An e-URDF profile defines:
- Robot structure;
- Joints, links, sensors, actuators;
- Safety envelopes;
- Tool frames;
- Workspace limits;
- Capabilities;
- Simulation assets;
- Benchmark metadata.
This allows the same skill to be adapted, validated, and transferred across different robot bodies.
The rosclaw-sandbox module provides a simulation-first validation layer. Its firewall mode can block risky actions before they reach hardware.
Possible decisions:
ALLOW
BLOCK
MODIFY
REQUIRE_HUMAN_CONFIRMATION
Example result:
{
"decision": "BLOCK",
"risk_score": 0.92,
"reason": "Predicted collision between wrist_link and table",
"violated_constraints": ["collision", "workspace_boundary"],
"replay_id": "sandbox://replays/firewall_00042"
}ROSClaw records physical execution as structured praxis, not just logs. A single run can include:
- Robot state;
- Sensor snapshots;
- Action trace;
- Provider trace;
- Sandbox decision;
- Skill execution;
- Critic result;
- MCAP;
- Replay;
- Failure report.
rosclaw-how acts as a runtime reflex layer. When an agent is stuck, unsafe, invalid-heavy, or regressing, it can provide minimal, evidence-backed interventions such as:
- Safety constraints;
- Feasibility repair;
- Stabilizing hints;
- Next experiment suggestions;
- Recovery instructions.
rosclaw-auto turns repeated failures into structured improvement cycles:
FailureCase
↓
Diagnosis
↓
Hypothesis
↓
Proposal
↓
Patch
↓
Sandbox Experiment
↓
Darwin Evaluation
↓
Champion / DeadEnd
A skill is not overwritten blindly. It is versioned, evaluated, promoted, and rollback-safe.
Skill promotion pipeline:
pick_cube@v1.0.0 baseline_champion
↓
pick_cube@candidate_0001 sandbox_passed
↓
pick_cube@v1.1.0 sim_champion
↓
pick_cube@v1.1.0 sandbox_champion
↓
pick_cube@v1.1.0 real_candidate
↓
pick_cube@v1.1.0 real_champion
git clone https://github.com/ros-claw/rosclaw.git
cd rosclawbash scripts/install.shOr install in editable mode:
pip install -e .See INSTALL.md for detailed instructions.
./rosclaw doctorExpected output:
runtime: HEALTHY
event_bus: HEALTHY
seekdb: HEALTHY
provider: HEALTHY
sandbox: HEALTHY
practice: HEALTHY
memory: HEALTHY
how: HEALTHY
auto: HEALTHY
darwin: HEALTHY
dashboard: HEALTHY
./rosclaw startOr programmatically:
from rosclaw.core import Runtime, RuntimeConfig
config = RuntimeConfig(
robot_id="ur5e",
robot_zoo_path="./e-urdf-zoo",
default_eurdf_robot="ur5e",
enable_firewall=True,
enable_memory=True,
enable_practice=True,
enable_how=True,
enable_auto=True,
enable_darwin=True,
)
runtime = Runtime(config)
runtime.initialize()
runtime.start()./rosclaw robot list
./rosclaw robot inspect ur5e./rosclaw sandbox validate ur5e
./rosclaw sandbox run --robot ur5e --world tabletop --task reach./rosclaw firewall check \
--robot ur5e \
--world tabletop \
--action examples/actions/unsafe_reach.jsonExample Claude Code MCP configuration:
{
"mcpServers": {
"rosclaw": {
"command": "python3",
"args": ["-m", "rosclaw.mcp.minimal_server"],
"env": {
"PYTHONPATH": "src"
}
}
}
}Exposes tools such as: move_joints, grasp, get_robot_state, validate_trajectory, emergency_stop, query_world_objects, get_scene_graph, cognitive_search, system.list_robots, system.run_sandbox_task, system.query_practice, system.query_memory.
./rosclaw demo tabletop-grasp --robot-id ur5eWhat happens:
1. Agent receives task: "pick up the red cup"
2. Provider routes to perception and skill capabilities
3. Memory retrieves similar grasping experience
4. Skill provider generates grasp plan
5. Sandbox validates candidate motion
6. Runtime executes safe action
7. Practice records the full physical timeline
8. Critic evaluates success or failure
9. Memory stores the result
10. How generates recovery guidance if needed
11. Auto proposes a skill improvement after repeated failures
12. Darwin evaluates the candidate skill
13. A champion skill is promoted if it passes all gates
ROSClaw follows a strict safety boundary:
No model output should directly control a robot.
All physical execution must pass through:
Provider Schema
↓
e-URDF Constraints
↓
Sandbox / Firewall
↓
Runtime Guard
↓
Robot Controller
Hard rules:
- VLA outputs are proposals, not raw motor commands.
- World models are neural previews, not safety proofs.
- MCP is an agent tool interface, not a real-time control bus.
- Auto-generated skills must pass sandbox validation before execution.
- Code patches require human approval before production use.
- Safety configuration patches require human approval.
- Every champion skill must be rollback-safe.
ROSClaw treats skills as versioned physical assets with full lineage tracking.
Promotion is gated by six evaluation gates:
| Gate | Check |
|---|---|
| Success Improvement | Candidate success rate > baseline + threshold |
| Safety Regression | No increase in collision or safety events |
| Multi-Seed Validation | Passes on seeds [0, 1, 2, ...] |
| Sandbox Clearance | Firewall decision == ALLOW |
| Regression Suite | No degradation on existing tasks |
| Human Approval | Required for code patches and safety config |
Example CLI:
# Initialize an auto task (required before running)
./rosclaw auto init --task pick_cube --skill reach --type skill_tuning
# Run auto evolution experiment
./rosclaw auto run --task pick_cube --rounds 50
# Check evolution status
./rosclaw auto status
# List current champions
./rosclaw skill champions list
# Show skill lineage
./rosclaw skill lineage pick_cube
# Rollback if needed
./rosclaw skill rollback pick_cube --to v1.0.0ROSClaw includes an embodied asset compiler:
SDK / ROS 2 Interfaces / Docs / e-URDF
↓
rosclaw-forge
↓
MCP Server + Skill Manifest + Provider Manifest + Tests + ClawHub Metadata
Example:
./rosclaw forge sdk-to-mcp \
--name unitree_go2 \
--sdk-docs ./docs/unitree_go2_sdk.md \
--output ./generated/unitree_go2_bundleValidate generated assets:
./rosclaw forge validate ./generated/unitree_go2_bundleInstall to staging:
./rosclaw forge install ./generated/unitree_go2_bundle --stagingrosclaw/
├── src/rosclaw/ # Core runtime, schemas, CLI, MCP gateway
│ ├── core/ # Runtime, EventBus, lifecycle
│ ├── schemas/ # Unified canonical dataclasses
│ ├── provider/ # Capability provider layer
│ ├── sandbox/ # MuJoCo simulation & firewall
│ ├── practice/ # Timeline capture & MCAP
│ ├── memory/ # Spatiotemporal memory
│ ├── how/ # Runtime intervention
│ ├── know/ # Knowledge compiler
│ ├── auto/ # Self-evolution control plane
│ ├── darwin/ # Benchmark & evaluation arena
│ ├── forge/ # Asset compiler
│ ├── dashboard/ # Observability & WebSocket
│ └── mcp/ # MCP server implementation
├── e-urdf-zoo/ # Physical DNA registry
├── docs/ # Architecture, RFCs, usage guides
├── examples/ # Robot and simulation examples
├── tutorials/ # Step-by-step tutorials
├── tests/ # Unit, integration, E2E, safety tests
├── benchmarks/ # Benchmark and evaluation tasks
├── acceptance/ # Release acceptance tests
├── scripts/ # Install and utility scripts
├── rosclaw.yaml # Default runtime config
├── docker-compose.yml # Optional local services
├── ARCHITECTURE.md # 14 Engineering Iron Rules
├── QUICKSTART.md # Quick start guide
└── INSTALL.md # Installation details
Example rosclaw.yaml:
runtime:
robot_id: ur5e
safety_level: strict
event_bus:
backend: local
knowledge_plane:
backend: seekdb
path: .rosclaw/seekdb
object_store:
backend: local
path: .rosclaw/artifacts
sandbox:
enabled: true
backend: mujoco
firewall_mode: true
provider:
enabled: true
practice:
enabled: true
mcap: true
memory:
enabled: true
how:
enabled: true
cooldown_window: 3
evidence_trace_enabled: true
auto:
enabled: true
allow_code_patch: false
require_human_approval: true
trigger_failure_threshold: 3
darwin:
enabled: true
seeds: [0, 1, 2]
episodes: 50
metrics: [success_rate, collision_rate, completion_time]- Runtime and plugin architecture
- e-URDF physical embodiment registry
- MCP-compatible agent runtime
- Capability provider layer
- MuJoCo sandbox and firewall mode
- Practice timeline capture (MCAP / JSONL)
- SeekDB-backed spatiotemporal memory
- How runtime intervention (v1.5)
- Know physical-AI knowledge compiler
- Auto self-evolution control plane
- Darwin benchmark & evaluation arena
- Skill Registry with champion/lineage/rollback
- Forge SDK-to-MCP asset compiler
- Dashboard observability (WebSocket + HTTP API)
- End-to-end physical intelligence demos
- Unified schema package (
rosclaw.schemas) - ARCHITECTURE.md — 14 Engineering Iron Rules
- Isaac Sim backend
- Multi-robot collaborative sandbox
- Advanced DDS reflex handshake
- LeRobot / RLDS dataset export
- OpenVLA and Cosmos provider integration
- Darwin benchmark leaderboard
- ClawHub skill and provider marketplace
- Real-world long-horizon inspection demos
- Embodied agent evaluation
- Skill learning and refinement
- Simulation-to-real validation
- Multi-modal robot memory
- Benchmark-driven evolution
- Robot skill packaging and versioning
- Safe LLM-to-robot execution
- Digital twin pre-validation
- Inspection and manipulation workflows
- Failure replay and root-cause analysis
- MCP-compatible physical tools
- Capability routing and abstraction
- Agent safety guardrails
- Runtime intervention
- Self-improving skill systems
Run tests:
PYTHONPATH=src pytest tests -vRun end-to-end pipeline:
PYTHONPATH=src pytest tests/test_e2e_full_pipeline.py -vRun architecture checks:
./rosclaw doctor --ros2ROSClaw welcomes contributors building the open infrastructure for Physical Intelligence.
Good first contribution areas:
- e-URDF profiles for new robots;
- MCP servers for robot SDKs;
- Capability providers for perception, action, navigation, and verification;
- Sandbox tasks and worlds;
- Skill packages;
- Benchmark tasks;
- Documentation and tutorials.
Please read CONTRIBUTING.md before submitting a pull request.
ROSClaw is research infrastructure for physical AI and embodied agents.
Always test in simulation before running on real hardware. Use emergency stop systems, workspace boundaries, safety-rated controllers, and human supervision when deploying to physical robots.
ROSClaw does not replace certified industrial safety systems.
If you use ROSClaw in your research, please consider citing:
@software{rosclaw2026,
title = {ROSClaw: Open Infrastructure for Physical Intelligence},
author = {ROSClaw Contributors},
year = {2026},
url = {https://github.com/ros-claw/rosclaw}
}If you use the Genesis simulator with ROSClaw, please also cite:
@article{genesis2026,
title = {Genesis: A Generative Physics Engine for General Purpose Robotics},
author = {Genesis Authors},
journal = {arXiv preprint},
year = {2026},
url = {https://arxiv.org/abs/2604.04664}
}This project is released under the MIT License. See LICENSE.
- Website: https://www.rosclaw.io/
- GitHub: https://github.com/ros-claw/rosclaw
- Documentation: docs/
- Quick Start: QUICKSTART.md
- Architecture: ARCHITECTURE.md