← Back to Blog

From LangChain to OpenClaw: Ship Your First File-Based Agent in One Evening

OpenAgents.mom · 2026-05-31 · 8 min read

You built your first agent in LangChain. It ran. You felt good. Then you tried to debug it three weeks later, and the behavior you wanted was buried inside a chain of Python decorators, a few environment variables you half-remember setting, and a runnable.py file that has grown to 400 lines.

That's not a LangChain problem specifically — it's a pattern problem. When your agent's behavior lives in code, every change is a deployment. Every rollback is a git revert. Every "why did it do that?" is a stack trace.

File-based agent configs take a different approach: the agent's personality, memory rules, tool permissions, and operational constraints live in plain text files that you can read, diff, version, and hand off. OpenClaw is built around this model. If you've been running LangChain in production and want something you can self-host with less ceremony, this walkthrough covers the migration in practical terms.

What You're Actually Moving

Before touching any config, it helps to inventory what a typical small LangChain agent is actually doing:

System prompt: usually a string literal in Python or an f-string pulling from env vars
Tool definitions: decorated Python functions with docstrings the LLM reads
Memory: a ConversationBufferMemory or a vector store, wired in at runtime
Model config: ChatOpenAI(model="gpt-4o", temperature=0.2) scattered wherever the chain is instantiated
Guard rails: often nothing, or a custom output parser that throws on bad output

All of this becomes explicit files in an OpenClaw workspace. Nothing is inferred from code.

The File Layout You're Targeting

An OpenClaw workspace bundle is a directory. A minimal one looks like this:

my-agent/
├── AGENTS.md          # workspace-level instructions (the agent reads this)
├── SOUL.md            # personality, tone, hard limits
├── tools.yaml         # what tools the agent can call, and under what conditions
├── memory.yaml        # memory backend, summarization rules, retention policy
├── model.yaml         # model selection, temperature, fallbacks
└── .env.example       # what env vars are expected (secrets stay out of the bundle)

If you've seen why file-based agent configs beat black-box AI builders, the reasoning is the same here: the diff between two versions of SOUL.md tells you exactly what changed about your agent's behavior. A diff between two Python chain files usually doesn't.

Translating Your System Prompt → SOUL.md

In LangChain you might have this:

system_prompt = """You are a devops assistant. You help engineers triage
incidents and write runbooks. You do not modify production infrastructure
directly. Always ask for confirmation before suggesting irreversible steps."""

In OpenClaw, that becomes SOUL.md:

# Identity
You are a devops assistant. You help engineers triage incidents and write runbooks.

# Hard Limits
- Do not modify production infrastructure directly.
- Always ask for confirmation before suggesting irreversible steps.
- Do not speculate about root causes without surfacing your confidence level.

# Tone
Direct. No jargon. If you don't know something, say so.

The difference is structure. Hard limits are a named section, not a sentence buried in a paragraph. When you read the file six months from now, you know where to look.

Translating Tools → tools.yaml

A LangChain tool definition looks like this:

@tool
def run_kubectl(command: str) -> str:
    """Run a kubectl command against the current cluster context."""
    return subprocess.run(command.split(), capture_output=True, text=True).stdout

In tools.yaml:

tools:
  - name: run_kubectl
    description: Run a kubectl command against the current cluster context.
    allowed_commands:
      - "kubectl get"
      - "kubectl describe"
      - "kubectl logs"
    blocked_commands:
      - "kubectl delete"
      - "kubectl apply"
    require_confirmation: false
    timeout_seconds: 30

Notice what you gained: allowed_commands and blocked_commands are explicit config, not logic inside your tool function. A security reviewer can read this file without understanding Python. You can audit it in a pull request without running the agent.

For more on locking down what your agent can actually touch, the OpenClaw security checklist covers tool-level restrictions in detail.

Common Mistakes

Copying tool docstrings verbatim. LangChain docstrings are written for the LLM to read at runtime. Your tools.yaml description should be written for a human reviewer first. Rewrite it.
Skipping blocked_commands. If you only list allowed_commands, you're trusting the agent to stay in bounds. Block the dangerous ones explicitly.
Leaving timeout_seconds unset. Without a timeout, a hung subprocess keeps your agent waiting indefinitely. Set it low and raise it only if you have a measured reason.

Translating Memory → memory.yaml

LangChain's ConversationBufferMemory keeps everything in RAM and dies when the process dies. If you were using a vector store for longer-term memory, you had to wire that up yourself.

In memory.yaml:

memory:
  backend: sqlite          # or postgres, chroma, etc.
  path: ./memory.db
  max_turns_in_context: 20
  summarize_after_turns: 40
  summarization_model: gpt-4o-mini
  retention_days: 90

This is declarative. You can swap sqlite for postgres without touching any application code. You can change retention_days and the agent picks it up on next start.

Translating Model Config → model.yaml

In LangChain, model config is usually instantiation code:

llm = ChatOpenAI(model="gpt-4o", temperature=0.2, max_tokens=2048)

In model.yaml:

model:
  primary: gpt-4o
  temperature: 0.2
  max_tokens: 2048
  fallback:
    - gpt-4o-mini
    - claude-3-5-haiku-20241022
  fallback_on: [rate_limit, timeout]

The fallback block is the part LangChain doesn't give you out of the box. If your primary model times out, the agent tries the next one. You configure this once; you don't write retry logic.

Setting Up AGENTS.md

AGENTS.md is the workspace-level file the agent reads at the start of every session. Think of it as the operational brief: what's this agent for, what environment is it running in, what's the escalation path.

# Workspace: Production Incident Triage Agent

## Purpose
This agent helps on-call engineers triage production incidents.
It has read access to kubectl, CloudWatch logs, and the incident runbook repo.

## Environment
- Cluster: prod-us-east-1
- Runbook repo: /mnt/runbooks
- On-call Slack channel: #oncall-eng

## Escalation
If the agent cannot determine root cause within 10 minutes, recommend paging the platform team.

This file is the first thing you update when the agent moves to a new environment. No code change. No redeploy.

Security Guardrails

Don't put secrets in AGENTS.md or SOUL.md. Reference env var names instead ($SLACK_WEBHOOK_URL). The .env.example file documents what's needed without storing values.
Version-control the whole workspace directory. The files are the source of truth. If someone edits them on the server without committing, you lose your audit trail.
Restrict write access to tools.yaml. It controls what the agent can execute. Treat it like you'd treat a sudoers file.

What LangChain Still Does Better

This is a fair comparison, so here's where LangChain has the edge:

Ecosystem breadth. LangChain has integrations with more tools, vector stores, and APIs than any file-based runtime today. If you need a niche connector, LangChain probably has it; you'll build it yourself in OpenClaw.
Python-native composability. If your agent logic is genuinely complex — conditional branching, dynamic tool loading, multi-step reasoning chains — Python gives you more expressive power than YAML.
LangGraph for stateful flows. If you're building a multi-agent graph with complex state transitions, LangGraph is purpose-built for that. OpenClaw workspaces are simpler, which is a tradeoff.

The honest summary: OpenClaw workspaces are a good fit for agents with stable, well-defined behavior. If your agent's logic changes constantly, or you need dynamic tool composition at runtime, a code-first framework gives you more flexibility.

For a broader look at where different frameworks sit in 2026, navigating the AI frameworks ecosystem lays out the tradeoffs without a vendor slant.

Running the Migration in One Evening

Here's a realistic timeline:

Time	Task
0:00–0:30	Inventory your existing LangChain agent (system prompt, tools, memory, model config)
0:30–1:00	Write `SOUL.md` and `AGENTS.md`
1:00–1:45	Write `tools.yaml` and `memory.yaml`
1:45–2:15	Write `model.yaml`, set up `.env`, run the agent for the first time
2:15–3:00	Test against real inputs, fix gaps in SOUL.md, commit the workspace to git

If you're starting from scratch rather than migrating, you can skip the inventory step and use the OpenClaw workspace wizard to generate the initial files. That cuts the first hour down to about fifteen minutes.

Most of your time will be in SOUL.md — writing explicit hard limits is harder than it sounds when you're used to implicit behavior from a system prompt string.

Once you have a working workspace, turning it into a daily driver covers the operational side: logging, health checks, and making sure the agent behaves consistently outside your dev machine.

Is This the Right Move for You?

If you're running a LangChain agent in production and you're happy with it, don't migrate for the sake of it. The file-based approach wins on auditability, portability, and operator clarity — not on raw capability.

But if you've spent more than an hour debugging agent behavior that you couldn't read out of a config file, or if you want to self-host without managing a Python environment on every target machine, an OpenClaw workspace bundle is worth the evening it takes to set up.

The files are the agent. Put them somewhere you can read them.

Generate Your File-Based Agent Workspace From Scratch

Answer a few questions about your agent's purpose and tools — the wizard outputs a ready-to-run OpenClaw workspace bundle you can drop into git and run tonight.

Generate My Workspace