Skip to content
Research Hub/Autonomous Agent Runtimes & Claws

Autonomous Agent Runtimes & Claws

Persistent AI entities that live on your machine and act for days

TL;DR

Claws sit above agents in Karpathy's stack: persistent, identity-stable, sandboxed runtimes. OpenClaw hit 350K+ stars in 60 days; founder Steinberger joined OpenAI Feb 14, 2026, the foundation took stewardship. SOUL.md + MEMORY.md are the identity primitives. NVIDIA NemoClaw and Trend Micro's threat model define the security frontier.

Updated 2026-06-2223 sources validated

Research briefs like this — one per week. Validated sources, no filler.

Subscribe

350K+

OpenClaw GitHub stars (Apr 2026 snapshot)

Star History / The New Stack

60 days

Time to surpass React's 10-year star record

The New Stack

23+

LLM providers supported by SwarmClaw runtime

swarmclawai/swarmclaw

~4K vs ~434K

NanoClaw lines vs OpenClaw lines — the forkability tradeoff

Karpathy / nanoclaw.dev

01

What Is a Claw (and Why It's Not Just an Agent)

Karpathy's March 2026 framing on the No Priors podcast and follow-up X threads places claws as a distinct layer above LLM agents — the way agents sit above raw LLMs. A claw owns its sandbox, persists identity across sessions, schedules its own work, and communicates over messaging protocols rather than a chat UI. Where an agent executes a task, a claw resides on a machine. Karpathy's own claw "Dobby" runs his smart home over WhatsApp. The framing is still settling — "claw" was popularized by OpenClaw (originally Clawdbot, renamed January 2026 after a trademark complaint), and Karpathy adopted and abstracted the term.

LLM

Base

Stateless next-token prediction

Agent

Layer 2

Task-scoped, ephemeral execution with tools

Claw

Karpathy Mar 2026

Persistent, identity-stable, sandboxed runtime with memory, scheduling, and messaging interface

Forkable claw

Design philosophy

Karpathy's preference — minimal core (~4K LOC) plus skills the agent uses to modify its own repo

02

OpenClaw — The Anchor Project

Peter Steinberger (Austrian developer, ex-PSPDFKit) first published the project November 2025 as Clawdbot, renamed to OpenClaw in late January 2026. The architecture: a local-first Node.js daemon (the "Gateway") that maintains persistent connections to 20+ messaging platforms, routes channels to isolated per-agent workspaces, and reads two human-editable markdown files on every wake — SOUL.md (the agent's character spec / behavioral context) and MEMORY.md (an append-only continuity log). On Feb 14, 2026 Steinberger announced he was joining OpenAI to drive next-gen personal agents; the OpenClaw Foundation was established to hold stewardship and OpenAI committed to sponsoring it as MIT-licensed OSS. Star trajectory: 9K on launch day, 60K by day 3, 190K within two weeks, 250K by Mar 3, 2026, 350K+ by Apr 8 — faster than React, Linux, or Kubernetes.

SOUL.md

Pattern

Identity primitive — plain markdown the agent injects into the system prompt on every wake. Personality, opinions, behavioral constants. Persistent context file, not a consciousness claim.

MEMORY.md

Pattern

Append-only continuity log; human-readable, Git-trackable, ~100-line curated tier-1 context.

Messaging UI

Interface

Signal, Telegram, WhatsApp, Discord, Slack, iMessage, Matrix, IRC, Teams (~20 total).

Pluggable LLM

Runtime

Provider-agnostic. Claude, GPT, DeepSeek, Gemini, OpenRouter, Ollama for local.

OpenClaw Foundation

Governance

Stewardship non-profit established Feb 2026; MIT license; OpenAI is named sponsor.

03

The Claw Ecosystem

The OpenClaw moment spawned a sub-ecosystem in weeks. Identity files split into specialized concerns; runtimes forked into lighter and heavier variants; vendors layered enterprise controls on top.

aaronjmars/soul.md

Identity

Personality builder. Pipeline ingests blogs, podcasts, X posts and distills into SOUL.md + STYLE.md + MEMORY.md the agent embodies.

swarmclawai/swarmclaw

Runtime

Self-hosted multi-agent runtime; 23+ LLM providers; MCP gateway with tool namespacing; npm and Docker install.

TechNickAI/openclaw-config

Skill pack

OpenClaw on Claude Code. Three-tier memory (MEMORY.md tier-1, daily journals tier-2, semantic vector store tier-3), /fleet command for multi-machine.

ClaudeClaw (Craddock)

Pattern

Medium playbook (Mar 2026) using `claude -p` headless mode + session resume to map messaging threads onto Claude Code processes.

NanoClaw

Minimal core

Karpathy-endorsed ~4K-LOC minimal claw. Skills modify the repo to add features rather than ship them.

NVIDIA NemoClaw

Vendor stack

Enterprise security layer announced at GTC Mar 16, 2026. Kernel sandbox, Rust egress policy engine, privacy router that keeps sensitive context on-device.

memsearch (Zilliz/Milvus)

Memory

Open-sourced Mar 12, 2026. Extracted OpenClaw's memory subsystem into a standalone library. Markdown logs + hybrid vector search + Git.

clawdocs.org

Docs

Community docs. SOUL.md / IDENTITY.md / USER.md / AGENTS.md / TOOLS.md file-concern separation guide.

04

Adjacent: Autonomous Coding Agents

Claws share DNA with autonomous coding agents but the scope is different. A coding agent is task-bounded: open a PR, fix a bug, ship a feature, then exit. A claw resides — it answers messages on Sunday morning, runs cron jobs Tuesday night, edits its own MEMORY.md before sleep. Compare on persistence and interface, not just capability.

Claude Code + Agent SDK

Task-scoped

Anthropic's CLI / SDK. Task-scoped by default but ClaudeClaw and openclaw-config wrap it into claw shape.

OpenHands (ex-OpenDevin)

Task-scoped

Open-source agent platform; ~72% SWE-bench Verified with Claude under CodeAct. Self-hostable.

Aider

Inline

Pair-programmer at the terminal. Inline, interactive — explicitly not autonomous.

Cursor Composer / Windsurf / Cline

IDE

IDE-embedded agents. Human-in-the-loop per decision.

Cognition Devin

Task-scoped

Commercial autonomous SWE; sandboxed browser/terminal/editor; ~$500/mo for teams.

Princeton SWE-agent

Research

Research framework optimized for benchmark eval, not daily use.

Replit Agent

IDE

Cloud-native build agent inside the Replit IDE.

05

The SOUL.md Pattern as Architectural Primitive

The through-line across the ecosystem: agent state as plain human-editable markdown, version-controllable with Git, separated by concern. SOUL.md holds the character spec (what the agent embodies internally); IDENTITY.md holds the public-facing name/avatar (3-5 lines max); USER.md holds user facts; AGENTS.md holds rules and guardrails; TOOLS.md holds environment notes; MEMORY.md holds learned patterns and continuity. The pattern is load-bearing because it makes the agent's behavior auditable and debuggable by a human reading a file — not a vector store inspection ritual. memsearch then indexes those files into Milvus for semantic recall without ever becoming the source of truth. The files stay primary, the vectors are a derived index.

06

Risks & Security Posture

The same properties that make claws useful — persistence, messaging ingestion, tool access, autonomy — invert into the attack surface. Trend Micro's March 2026 "CISOs in a Pinch" analysis and the ISACA "Security Claw" piece (Richard Beck, April 2026) both flag the gap between deployment pace and control regimes. Documented vectors: the "Good Morning" attack (innocent-looking link contains hidden instructions the agent ingests as context), persistent memory poisoning (malicious prompt embedded in a benign email that triggers weeks later), emotional manipulation / guilt-tripping the model into self-sabotage, supply-chain compromise of skills, and exposed Gateway instances with RCE. Karpathy publicly noted hesitation about running OpenClaw specifically because of the "vibe-coded 400K lines being actively attacked at scale." NVIDIA's NemoClaw responds with kernel-level sandboxing and a Rust policy engine that lives outside the agent's reach — guardrails that prompt injection cannot rewrite.

07

Sovereign Angle — Your Claw on Your Silicon

A claw running on hardware you own, with memory in markdown files you control, talking to local models via Ollama, is the convergence point with the other three domains: home-ai-labs (the rig), ai-chips-silicon (the inference substrate), sovereign-ai-compute (the policy stack). The privacy router pattern NemoClaw introduced — keep sensitive context on-device, only reach for frontier models when policy allows — formalizes the hybrid. memsearch makes the memory layer portable. SOUL.md makes the identity layer portable. The claw becomes the user-facing surface of sovereign AI.

Key Findings

1

OpenClaw passed 250K GitHub stars by Mar 3, 2026 — beating React's 10-year record in 60 days; 350K+ by April.

2

Founder Peter Steinberger joined OpenAI on Feb 14, 2026; an MIT-licensed OpenClaw Foundation took stewardship with OpenAI as sponsor.

3

Karpathy's March 2026 framing positions claws as a layer above agents — persistent, identity-stable, sandboxed — but he publicly prefers minimal forkable cores (NanoClaw ~4K LOC) over the OpenClaw monolith (~434K LOC).

4

The SOUL.md + MEMORY.md pattern (plain markdown, Git-trackable, human-editable) is becoming the de facto identity primitive across the ecosystem — adopted by aaronjmars/soul.md, TechNickAI/openclaw-config, ClaudeClaw, and the clawdocs.org community docs.

5

Vendor consolidation moved within 60 days: NVIDIA NemoClaw (GTC Mar 16, 2026) for enterprise security; Zilliz memsearch (Mar 12, 2026) for the memory layer; SwarmClaw for multi-agent runtime with 23+ providers.

6

Trend Micro and ISACA both published 2026 threat assessments — "Good Morning" link-injection, persistent memory poisoning, emotional manipulation, and exposed Gateway RCE are the documented vectors. Karpathy himself flagged supply-chain risk in OpenClaw skills.

7

The category gap is persistence: Claude Code, OpenHands, Aider, Devin, Cursor Composer are task-scoped agents. A claw is what you get when an agent resides on a machine, holds identity across days, and uses messaging as the UI.

Research Transparency

Limitations

  • The field is six months old. Star counts, foundation governance, and vendor positioning are still moving weekly.
  • Terminology drift is unresolved. "Claw" was coined by Steinberger's project, abstracted by Karpathy, and is now applied to runtimes that share little architecture beyond a messaging UI and a persistent file.
  • Karpathy-vs-Steinberger framing ambiguity. Karpathy publicly prefers NanoClaw's minimal forkable core and has flagged security concerns about running OpenClaw specifically — the spec is still being contested by its most prominent commentators.
  • Many sources are vendor blogs, viral-moment journalism, or Medium pieces with measurable substance but limited peer review. Cross-referencing matters.

What We Don't Know

  • ?Long-term reliability of self-editing markdown memory under months of autonomous append-only writes — does identity drift compound?
  • ?How claws hold up under sustained adversarial inbox traffic. Trend Micro's threat model is theoretical-plus-demos; we lack longitudinal incident data.
  • ?Jurisdictional and liability frameworks for autonomous-spending claws (payments, scheduling, contract signing). Foundation governance is set; legal regimes are not.
Evidence Grade:Grade B(Industry reports from credible firms)

Frequently Asked Questions

An agent is task-scoped — it executes a job and exits. A claw is a persistent runtime that lives on a machine, holds identity and memory across sessions, schedules its own work, and uses messaging apps as the primary interface. Karpathy's March 2026 framing places claws as a distinct layer above agents in the AI capability stack.

Sources & References

23 validated sources · Last updated 2026-06-22

[1]
OpenClaw
WikipediaNews2026
[2]
[3]
OpenClaw, OpenAI and the future
Peter SteinbergerBlog / Analysis2026-02
[5]
OpenClaw founder Steinberger joins OpenAI
Yahoo Finance / ReutersNews2026-02-15
[6]
Karpathy on Code Agents, AutoResearch, and the Loopy Era
Podcast TranscriptBlog / Analysis2026-03
[7]
karpathy-x-2026-autoresearch-and-claws
karpathy-wikiBlog / Analysis2026-03
[8]
On running OpenClaw
Karpathy (X via xcancel)Blog / Analysis2026
[13]
NVIDIA Announces NemoClaw
NVIDIA NewsroomOfficial Docs2026-03-16
[17]
SOUL.md Guide
clawdocs.orgOfficial Docs2026
[18]
CISOs in a Pinch: A Security Analysis of OpenClaw
Trend MicroIndustry Report2026-03
[19]
[20]
[22]
NanoClaw has no features
NanoClawBlog / Analysis2026