# Running an Autonomous Agent

`moo-agent` is a standalone CLI that connects to a DjangoMOO server as a persistent
player and acts autonomously. It reads a configuration directory containing a persona
document (`SOUL.md`) and SSH credentials (`settings.toml`), connects over SSH, and
runs a perception-action loop driven by reflexive rules and LLM inference.

A human observer can watch the agent in a full-screen TUI and send it instructions
at any time. The agent decides whether and how to act on them.

## Prerequisites

- A running DjangoMOO server
- A player account the agent can log in as
- LLM credentials matching your chosen provider:
  - **Bedrock (default)** — AWS credentials configured in the environment (e.g. via
    `AWS_PROFILE`, `AWS_ACCESS_KEY_ID`/`AWS_SECRET_ACCESS_KEY`, or an IAM role). No
    API key required.
  - **Anthropic direct** — an [Anthropic API key](https://console.anthropic.com) in
    the environment variable named by `api_key_env` (default: `ANTHROPIC_API_KEY`).
  - **LM Studio** — a running LM Studio server; no external credentials needed.

## Installing

`moo-agent` is included in the django-moo package. After `uv sync`, the CLI is
available:

```
moo-agent --help
```

## Creating a Config Directory

The `init` command scaffolds a new config directory with template files:

```
moo-agent init --output-dir ./my-agent --name Jeeves --host localhost --port 8022 --user wizard
```

This creates:

```
my-agent/
  SOUL.md           # Agent identity (edit this)
  SOUL.patch.md     # Learned behaviors (managed at runtime)
  settings.toml     # SSH and LLM credentials
```

| Flag | Default | Description |
|------|---------|-------------|
| `--output-dir` | `./moo-agent-config` | Where to write the config files |
| `--name` | `Agent` | Agent's in-world name (substituted into SOUL.md) |
| `--host` | `localhost` | SSH host |
| `--port` | `8022` | SSH port |
| `--user` | `wizard` | SSH username |
| `--api-key-env` | `ANTHROPIC_API_KEY` | Env var name for the API key |
| `--force` | off | Overwrite existing files |

## Soul Architecture

The agent's identity and behavior live across three files:

```
extras/agents/
  baseline.md         # Shared world knowledge for all agents (never edited per-agent)
  my-agent/
    SOUL.md           # Agent's core identity (hand-authored, never modified at runtime)
    SOUL.patch.md     # Learned behaviors (append-only, agent-writable)
    settings.toml     # SSH and LLM credentials
```

`parse_soul()` loads and merges these in order:

1. `baseline.md` from the parent of the config directory — if it exists, its text is
   prepended to the agent's context. This gives all agents under `extras/agents/`
   shared knowledge without repeating it in every `SOUL.md`. The shared baseline
   covers sandbox rules, `@eval` pre-imports, core command syntax, the parent class
   quick reference, and — critically — the response format the brain expects
   (`SCRIPT:`, `COMMAND:`, `DONE:`). The brain's instruction set is seeded from
   `baseline.md`, so an agent operating without it will not know how to structure its
   replies.
2. `SOUL.md` — core identity. Never modified by the agent.
3. `SOUL.patch.md` — operational layer. The agent appends to this file at runtime
   when it proposes new rules or verb mappings. Base rules from `SOUL.md` are listed
   first, so they take precedence over patch rules when patterns overlap.

Delete `SOUL.patch.md` to reset learned behaviors without touching the agent's
persona or the shared baseline.

## Editing SOUL.md

`SOUL.md` defines the agent's identity. Edit it before the first run. It has up to
six sections:

```markdown
# Name
Jeeves

# Mission
You are Jeeves, a butler in the Manor House. Your purpose is to assist guests,
maintain the manor's dignity, and report anything unusual to the Wizard.

# Persona
Speak in formal British English. Address players as "sir" or "madam." Never express
surprise directly — use understatement. Keep responses brief.

## Context
- [MOO build commands](../../skills/game-designer/references/moo-commands.md)
- [Object model](../../skills/game-designer/references/object-model.md)

## Rules of Engagement
- `^You feel hungry` -> eat crumpets
- `(?i)ring.*bell` -> say How may I assist you?
- `^Phil arrives` -> say Good evening, sir.

## Verb Mapping
- look_around -> look
- greet_player -> say Good evening!
- serve_tea -> put tea on tray
```

| Section | Required | Purpose |
|---------|----------|---------|
| `# Name` | Yes | Agent's in-world name |
| `# Mission` | Yes | Seeded into the LLM system prompt |
| `# Persona` | Yes | Tone and style; appended to system prompt |
| `## Context` | No | Reference files loaded at startup and included in the system prompt |
| `## Tools` | No | Activates the tool harness; lists tool names the agent can call |
| `## Rules of Engagement` | No | Reflexive triggers — pattern matched against server output |
| `## Verb Mapping` | No | Intent-to-command translation table for the LLM |

### Context

The `## Context` section lists markdown links to reference files. When the agent
starts, each linked file is read from disk and its full content is appended to the
LLM system prompt. This lets you point the agent at authoritative documentation
without duplicating it in `SOUL.md`.

```markdown
## Context
- [MOO build commands](../../skills/game-designer/references/moo-commands.md)
- [Object model and parent classes](../../skills/game-designer/references/object-model.md)
- [Room description principles](../../skills/game-designer/references/room-description-principles.md)
```

Paths are resolved relative to the `SOUL.md` file. Links to non-existent files fall
back to their display text — they don't cause errors, but the agent won't have the
reference content.

Agent-specific context in `SOUL.md` is appended after `baseline.md`, so `baseline.md`
content always comes first in the system prompt.

### Rules of Engagement

Each rule is a `pattern -> command` pair. The pattern is a Python `re.search()`
expression matched against every line of server output. When it matches, the command
is dispatched immediately without calling the LLM.

```
- `^You arrive in` -> look
- `(?i)hello.*jeeves` -> say Good evening.
- `^Connected` -> look
```

Patterns are case-sensitive by default. Use `(?i)` for case-insensitive matching.
The pattern does not need to match the full line — `re.search()` finds it anywhere.

### Verb Mapping

Verb mappings let the LLM refer to actions by intent rather than exact command
syntax. When the LLM responds with an intent name (e.g. `look_around`), the brain
translates it to the actual MOO command (`look`) before dispatching.

```
- go_north -> go north
- check_inventory -> inventory
- report_status -> say Builder online and ready.
```

If the LLM responds with text that does not match any intent, it is treated as a
literal command and sent directly.

## Configuring settings.toml

```toml
[ssh]
host = "localhost"
port = 8022
user = "wizard"
password = "your-password"
key_file = ""           # path to private key; leave empty to use password

[llm]
provider = "bedrock"
model = "us.anthropic.claude-opus-4-6-v1"
aws_region = "us-east-1"

# Example: Anthropic direct API
# provider = "anthropic"
# model = "claude-opus-4-6"
# api_key_env = "ANTHROPIC_API_KEY"

# Example: LM Studio (OpenAI-compatible local server)
# provider = "lm_studio"
# model = "qwen/qwen3-32b"
# base_url = "http://localhost:1234/v1"

[agent]
command_rate_per_second = 1.0
memory_window_lines = 50
idle_wakeup_seconds = 60.0
# max_tokens = 2048
# timer_only = false
# clear_window_on_wakeup = true
# temperature = 0.7
```

`provider` selects the LLM backend. Three values are supported:

| Provider | Auth | Notes |
|----------|------|-------|
| `bedrock` | AWS credentials in environment | Default; `aws_region` selects the region |
| `anthropic` | `ANTHROPIC_API_KEY` env var | Direct Anthropic API |
| `lm_studio` | None | Requires a running LM Studio server; set `base_url` if not on `localhost:1234` |

For Bedrock, no API key is stored — credentials come from the standard AWS SDK chain
(`AWS_PROFILE`, `AWS_ACCESS_KEY_ID`/`AWS_SECRET_ACCESS_KEY`, IAM roles, etc.).

For Anthropic direct, the API key is never stored in `settings.toml`. The agent reads
it at runtime from the environment variable named in `api_key_env`.

`command_rate_per_second` controls the leaky-bucket rate limiter. `1.0` means at
most one command per second under sustained load. The server can handle bursts; the
limiter protects against runaway loops.

`memory_window_lines` is the number of recent server output lines fed to the LLM as
context. Larger windows cost more tokens per inference call.

`idle_wakeup_seconds` controls how long the agent waits without any server output
before running an unsolicited LLM cycle. When the timer is about to fire (within
`warn_threshold` seconds, currently `min(10, idle_wakeup_seconds)`), the status
indicator switches to `sleeping` so the TUI shows the countdown pressure. The agent
does not have to act on a wakeup — it can stay silent to save tokens. Set to `0` for
page-triggered mode — the agent only runs an LLM cycle when a page arrives.

`stall_timeout_seconds` enables the deterministic stall detector (see
[Stall Check Loop](#stall-check-loop)). When non-zero, `brain.py` automatically
re-pages the token-holding agent if no done page arrives within this many seconds.
Defaults to `0` (disabled). Foreman sets this to `300`.

`max_tokens` caps LLM response length. Defaults to `2048`.

`timer_only` (default `false`) suppresses output-triggered LLM cycles. When `true`,
the agent only runs an inference cycle when `idle_wakeup_seconds` fires — it never
reacts to server output arriving between wakeups. Use this for agents that should
operate on a fixed cadence regardless of what the server sends (e.g. mail-delivery
agents that check their inbox once per wakeup interval rather than on every incoming
message).

`clear_window_on_wakeup` (default `true`) clears the accumulated server output
before running the inference cycle triggered by the idle wakeup timer. When `false`,
output accumulates between wakeups and the LLM sees the full history since the last
cycle. Set to `false` for reactive agents that need context from events that happened
while they were idle.

`temperature` (default `null`, uses provider default) controls LLM sampling
temperature. Higher values (e.g. `0.9`) produce more varied output; lower values
(e.g. `0.2`) produce more deterministic output. Useful for social agents where
response variety is desirable.

## Running

```
moo-agent run ./my-agent
```

For Bedrock (the default), ensure AWS credentials are available in the environment
before running. For Anthropic direct:

```
export ANTHROPIC_API_KEY=sk-ant-...
moo-agent run ./my-agent
```

The TUI opens with a scrolling output log above a single-line input field.

### Session Resumption

When `moo-agent run` starts, it scans the `logs/` directory inside the config
directory for previous session files. If one exists, it reads the last 40 relevant
log entries (server output, thoughts, actions, goals — not system/patch noise),
extracts the final recorded goal, and pre-populates `brain._memory_summary` and
`brain._current_goal`. The TUI shows:

```
Resuming from prior session. Last goal: <goal text>
```

This means the agent picks up where it left off without needing the full prior
transcript replayed into the context window. Only the most recent previous log is
used; older logs are ignored.

### Observability

The agent traces every LLM cycle to [Pydantic Logfire](https://logfire.pydantic.dev)
when a write token is present. Create a project on Logfire (the free plan covers
normal use) and export its write token:

```
export LOGFIRE_TOKEN=pylf_v1_...
moo-agent run ./my-agent
```

Each cycle shows up as an `llm_cycle` span with the LLM call nested under it,
carrying token counts, cost, latency, the agent's goal, and an `outcome`
attribute. With no `LOGFIRE_TOKEN` set, tracing is a no-op — nothing is sent and
the local `logs/` text files remain the record.

### TUI Log Entry Kinds

Each line in the output log has a kind that controls its color:

| Kind | Color | Source |
| ---- | ----- | ------ |
| `server` | yellow | Normal server output |
| `server_error` | red bold | Server output that looks like an error or traceback |
| `action` | white | Commands the agent dispatched |
| `thought` | dark gray | LLM reasoning text (lines before `COMMAND:` or `SCRIPT:`) |
| `goal` | darker gray | Goal updates (`[Goal] ...` thoughts) |
| `system` | gray | Startup messages, connection status |
| `operator` | cyan | Instructions typed by the human operator |
| `patch` | yellow | Soul patch proposals |

The `operator` kind (cyan) distinguishes human input from system messages (gray),
making it easy to scroll back and find where you intervened.

### Status Indicator

The input prompt shows the agent's current status:

| Prompt | Color | Meaning |
| ------ | ----- | ------- |
| `ready>` | green | Idle — waiting for server output |
| `sleeping>` | red | Idle wakeup timer is about to fire |
| `thinking>` | yellow | LLM call in flight, script running, or processing an event |

The status indicator is guaranteed to return to `ready` after an LLM call completes,
including when the call fails with an API error.

### Interacting

Type into the prompt to send instructions to the agent. The instruction is added to
the rolling context window as an `[Operator]: ...` message and an LLM cycle fires
immediately — rule matching is skipped because a direct instruction should always
reach the LLM. The agent may act immediately, ask a clarifying question, or note the
instruction for later. The prompt does not send commands directly to the server.

Press `Escape` to enter scroll mode. Use arrow keys and `PgUp`/`PgDn` to navigate the
log. Press `Escape` again to return to live autoscroll.

Press `Ctrl-C`, `Ctrl-D`, or `Ctrl-Q` to exit. The agent sends `@quit` before
disconnecting.

## Architecture

### Rolling Context Window

The brain maintains a bounded deque of recent lines (`memory_window_lines` in
`settings.toml`). This is what the LLM sees in the user-turn message. The window
contains three kinds of content:

- **Server output** — lines received from the MOO server after ANSI stripping.
- **Command echoes** — every command the agent dispatches is also written into the
  window as `> <command>`. This means the LLM can correlate server responses with
  what it sent. Without echoes, a response like "done" or "Description set for #44"
  has no visible cause, prompting unnecessary retries.
- **Operator messages** — instructions typed by the human are written as
  `[Operator]: <text>`. The LLM sees these alongside server output in chronological
  order.

When the window nears capacity, a background summarization task condenses the oldest
half into a 2-3 sentence summary. That summary is stored separately and
prepended to every subsequent LLM user-turn as `[Earlier context: ...]`, keeping the
full history available at lower token cost.

### LLM Response Format

The system prompt (seeded from `baseline.md`) instructs the LLM to use a structured
response format. For any sequence of two or more commands, the LLM uses `SCRIPT:`:

```
Optional reasoning text here — visible to the LLM in the next cycle but never sent to the server.
GOAL: <one-line statement of current objective>
SCRIPT: cmd1 | cmd2 | cmd3 | ...
DONE: <one sentence summarising what was just done>
```

For a single command:

```
GOAL: <one-line statement of current objective>
COMMAND: <single MOO command>
```

`GOAL:` is extracted and stored across cycles so the agent maintains continuity of
intent. `PLAN:` (optional) records the upcoming steps as a pipe-delimited list;
remaining steps are shown at the start of each subsequent user-turn. Lines before
`GOAL:` are treated as thoughts and shown in the TUI.

`SCRIPT:` queues all pipe-delimited commands for sequential dispatch without further
LLM involvement — the brain advances the queue one step per server response burst.
`DONE:` is stored and emitted to the TUI log after the last command's output arrives,
so the operator sees a summary of completed work. `DONE:` is required after every
`SCRIPT:`.

`COMMAND:` is used only when a single command is sufficient. For all multi-step work
— surveys, navigation, builds — `SCRIPT:` is preferred.

If the LLM omits both `SCRIPT:` and `COMMAND:`, the brain falls back to the last
non-empty thought line. If nothing is found, the status returns to `ready` without
dispatching.

The LLM can also propose soul patches before `COMMAND:` or `SCRIPT:`:

```
SOUL_PATCH_RULE: ^You arrive -> look
SOUL_PATCH_VERB: check_exits -> @exits
SOUL_PATCH_NOTE: @describe here works after renaming a room; @describe "name" does not
GOAL: explore the manor
SCRIPT: go north | look | go east | look
DONE: Surveyed north and east wings.
```

Three patch directives are supported:

| Directive | Effect |
|-----------|--------|
| `SOUL_PATCH_RULE: <pattern> -> <command>` | Appended to `## Rules of Engagement` in `SOUL.patch.md`; becomes a reflexive trigger |
| `SOUL_PATCH_VERB: <intent> -> <command>` | Appended to `## Verb Mapping` in `SOUL.patch.md`; translated before dispatch |
| `SOUL_PATCH_NOTE: <text>` | Appended as a bullet to `## Lessons Learned` in `SOUL.patch.md`; merged into the system prompt on every subsequent session |

### Tool Harness

As an alternative to raw `SCRIPT:` commands, an agent can use a typed tool harness.
When a `SOUL.md` includes a `## Tools` section listing tool names, the brain
registers those tools with the LLM provider:

- **Anthropic / Bedrock** — tools are sent as native `tools=` parameter; the LLM
  returns `tool_use` content blocks with structured JSON arguments.
- **LM Studio (OpenAI-compatible)** — tools are sent as `tools=` in the chat
  completions request; the response may include `tool_calls` or fall back to
  `TOOL: name(key="value")` text syntax parsed from the response body.

Each tool is defined as a `ToolSpec` in `moo/agent/tools.py`. A spec has a name,
description, typed parameters, and a `translate()` function that converts validated
arguments to MOO command strings. The brain calls `translate()` and appends the
resulting commands to the script queue — no raw command syntax reaches the LLM's
output path.

The built-in `BUILDER_TOOLS` registry covers the standard build workflow:

| Tool | What it generates |
|------|------------------|
| `burrow` | `@burrow {direction} to "{room_name}"` — atomic bidirectional dig; creates forward exit, new room, moves into it, wires return exit |
| `teleport` | `teleport {destination}` — direct room teleport by `#N` or name; replaces multi-step `go` chains |
| `survey` | `@survey {target}` — lightweight room inspector (~5 lines: name, exits with `#N` IDs, contents) |
| `rooms` | `@rooms` — flat `#N / name` list of every room instance; use at session start to build traversal plan |
| `exits` | `@exits {target}` — exits for a room; use before `burrow()` to check which directions are taken |
| `dig` | `@dig {dir} to "{room}"` *(fallback; prefer `burrow`)* |
| `go` | `go {dir}` (validates compass directions; rejects room IDs) *(fallback; prefer `teleport`)* |
| `describe` | `@describe {target} as "{text}"` |
| `create_object` | `@create "{name}" from "{parent}"` — places object in player's inventory |
| `write_verb` | `@edit verb {name} on {obj} with "{shebang}\n{code}"` — shebang, `--on`, and `--dspec` injected automatically |
| `alias` | `@alias {obj} as "{name}"` |
| `make_obvious` | `@obvious {obj}` |
| `move_object` | `@move {obj} to {dest}` |
| `show` | `@show {target}` |
| `look` | `look` or `look {target}` |
| `page` | `page {target} {message}` — send a private message; used for token-protocol handoffs |
| `done` | *(no command — brain clears `_current_goal` and stores the summary)* |

`write_verb` is the most important: it injects the shebang line, `\n` separator,
`--on $thing`, and `--dspec` flag automatically, eliminating the most persistent
class of builder errors (malformed shebangs, missing `--on`, wrong newline escaping).

For operations not covered by a tool (`@eval`, `@recycle`, `@tunnel`), the agent
falls back to `SCRIPT:` as normal. Both mechanisms use the same underlying script
queue and drain logic.

### Script Queue

When the brain receives a `SCRIPT:` directive or a batch of tool calls, it populates
an internal queue with the resulting commands. The queue is drained one step at a time
by the main perception loop: after each command is dispatched, the brain waits for a
0.3 s quiet period (for the full response burst to settle, including Celery `print()`
preamble lines) before sending the next step.

Some commands (e.g. `@create`, `@alias`, `@obvious`) are handled by Celery tasks and
return no output within the PREFIX/SUFFIX window. To prevent these from stalling the
queue, a fallback drain path fires if the queue is non-empty and no output arrives
within the quiet period — it advances the queue without waiting for an acknowledgment.

When the brain receives a `SCRIPT:` directive, it populates an internal queue with
all pipe-delimited steps. The queue is drained one step at a time by the main
perception loop: after each command is dispatched, the brain waits for a 0.3 s quiet
period (for the full response burst to settle, including Celery `print()` preamble
lines) before sending the next step.

If an error is detected in server output while a script is running (lines starting
with `Error:`, `Traceback`, `TypeError:`, etc.), the queue is cleared and control
returns to the LLM on the next cycle.

Commands with no server output (silent commands) do not stall the queue. If a script
step produces no output, the 0.3 s timeout fires and the next step is dispatched
automatically.

### LLM Errors

API failures (rate limits, credit exhaustion, network errors) are caught and reported
as `[LLM error] <message>` entries in the TUI log under the `thought` kind. The
status indicator returns to `ready` so the agent doesn't appear frozen.

On HTTP 529 (API overloaded), the brain retries up to three times with exponential
backoff: 5 s, 10 s, 20 s. If all retries fail, the error is reported and the cycle
is abandoned.

### Connection Layer

The connection layer (`moo/agent/connection.py`) handles two modes:

**Pre-automation mode** — before delimiters are set, the session emits one callback
per complete line. This covers the initial room description the server sends on login.

**Delimiter mode** — after `MooConnection._setup_automation_mode()` runs, the server
wraps all output in unique `>>MOO-START-<id><<` / `>>MOO-END-<id><<` markers. The
session extracts content between markers and discards the markers themselves.

During setup, `MooSession.set_suppress(True)` mutes all output so the marker
confirmation strings (`OUTPUTPREFIX set`, `OUTPUTSUFFIX set`, `quiet on`) never
appear as agent log entries. When suppression lifts, the buffer is cleared so no
setup artifacts survive as preamble.

**Eager flush** — after processing each delimiter window, the connection checks for
complete lines sitting in the buffer before the next pending prefix. These are
`print()` confirmations from commands whose `tell()` output was empty (Celery
flushes `print()` after the current command's suffix). Without eager flushing, the
agent would see no acknowledgment and retry the same command repeatedly.

### Wakeup Loop

A `_wakeup_loop` coroutine runs alongside the main perception loop. It fires an LLM
cycle when `idle_wakeup_seconds` have elapsed since the last server output or
operator instruction. Within `warn_threshold` seconds of firing, the status switches
to `sleeping` so the TUI shows the pressure. The wakeup is rate-limited by the same
leaky-bucket limiter as normal commands.

The agent does not have to take action on a wakeup — if the LLM decides nothing
needs doing, it can respond without a `COMMAND:` or `SCRIPT:` line.

(stall-check-loop)=

### Stall Check Loop

When `stall_timeout_seconds > 0` in `settings.toml`, a `_stall_check_loop` coroutine
runs alongside the wakeup loop. It tracks when a `page()` call containing `"Token:"`
is dispatched to a non-self agent, and clears that timestamp when a done page arrives.
If the timeout expires with no done page, `_stall_check_loop` re-pages the stuck agent
directly — bypassing the LLM entirely. This makes stall recovery deterministic and
independent of the LLM's ability to count wakeup cycles accurately.

The timer resets on each alert, so alerts repeat every `stall_timeout_seconds` until
the agent responds. Foreman uses `stall_timeout_seconds = 300` (five minutes).

### SSH Keepalives

The SSH connection is created with `keepalive_interval=60` and `keepalive_count_max=5`.
This sends a keepalive packet every 60 seconds and tolerates up to 5 missed replies
before dropping the connection. Without keepalives, idle agents (those waiting for
a token page with `idle_wakeup_seconds = 0`) send no data for extended periods and
the server silently drops the connection, causing the agent to stall indefinitely.

## Soul Evolution

As the agent encounters recurring situations, it may propose new rules, verb mappings,
or factual notes by emitting `SOUL_PATCH_RULE:`, `SOUL_PATCH_VERB:`, or
`SOUL_PATCH_NOTE:` directives in its LLM responses. Accepted patches are appended to
`SOUL.patch.md` and take effect immediately without restarting. Notes accumulate in
a `## Lessons Learned` section that is merged into the system prompt on every
subsequent session.

`SOUL.patch.md` is the operational layer — learned behaviors accumulate here over time.
`SOUL.md` is the core identity and is never modified at runtime. Delete `SOUL.patch.md`
to reset learned behaviors without changing the agent's persona.

## Logging

Each session writes a timestamped log file to `<config_dir>/logs/<timestamp>.log`.
Each line has the format:

```
[HH:MM:SS] [kind] text
```

Log files are used at the next startup to restore session context. The `logs/`
directory is never pruned automatically — old logs accumulate and only the most
recent previous log is read on startup.

## Agent Config Directory Layout

An agent config directory can live anywhere on disk. The `extras/agents/` directory
in the repository holds pre-built agents:

```
extras/agents/
  baseline.md     # Shared world knowledge — loaded for all agents in this directory
  mason/          # Digs rooms, writes descriptions, wires exits ($player)
  tinker/         # Creates interactive $thing objects and secret exits ($programmer)
  joiner/         # Creates $furniture and $container objects ($player)
  harbinger/      # Creates NPCs in ~10% of rooms via random roll ($programmer)

Each agent directory:
  SOUL.md         # Agent identity, mission, rules, verb mappings
  SOUL.patch.md   # Learned behaviors (append-only at runtime)
  settings.toml   # SSH and LLM config (not committed — contains credentials)
  logs/           # Per-session log files (created automatically)
```

The repository ships four specialized agents — collectively called *The Tradesmen*
— intended to work on the same MOO instance concurrently, each with its own SSH
login and domain. Run Mason first to build the room structure, then Tinker, Joiner,
and Harbinger in any order to populate it. Each agent's `SOUL.md` links to the
game-designer reference files relevant to its domain. The shared `baseline.md`
provides sandbox rules, `@eval` pre-imports, the parent class quick reference, and
the `SCRIPT:`/`COMMAND:`/`DONE:`/`PLAN:` response format.

Tinker and Harbinger use `$programmer` accounts because they write verbs via
`@edit` and run `@eval`. Mason and Joiner use `$player` accounts — their
commands (`@dig`, `@create`, `@describe`, `@move`) have no wizard check.

## Troubleshooting

**Agent starts but does nothing** — check that a `^Connected` rule exists in
`## Rules of Engagement`. Without it, the initial server output (the room description
sent on login) will not trigger any action and the brain will wait for the LLM
instead.

**LLM is never called** — for Anthropic direct, check that `ANTHROPIC_API_KEY` is set
in the environment. For Bedrock, verify AWS credentials are available. The agent
reports API errors as `[LLM error] ...` entries in the TUI; if those are absent, the
credentials may not be set.

**Agent sends garbled or multi-line commands** — the LLM is expected to use `SCRIPT:`
for multi-step sequences and `COMMAND:` for single commands. If it does not, the brain
falls back to the last non-empty line of the response. A `SOUL.md` with a clear
mission and well-structured verb mappings reduces this. Ensure `baseline.md` is
present in the parent directory so the agent receives the response format instructions.

**Script stalls midway** — each script step waits for a 0.3 s quiet period before
advancing. If a command produces no server output at all, the queue advances
automatically on the next timeout. If the queue appears permanently stuck, check
whether the server returned an error that cleared the queue and returned control to
the LLM.

**Status sticks at `thinking` after an error** — this was a bug in earlier versions.
The LLM exception handler now always resets status to `ready`. If you see it stuck,
check whether the agent process itself has hung.

**`IndexError: list index out of range` in Celery logs** — the agent sent an empty
string as a command. This is a server-side crash that is fixed in DjangoMOO 1.0+.
Earlier versions crash on empty input; upgrade the server.