Auctor Agent

A White Paper on Bounded Autonomy for Content Operations

Executive Summary

OpenClaw is useful to Auctor not because Auctor should become a clone of it, but because OpenClaw names a set of agent primitives that are easy to miss when we focus only on prompts, models, or UI.

The core lesson is this:

Smarter agents do not come primarily from bigger prompts. They come from better runtime primitives: durable state, scoped memory, proactive observation, constrained action, skill loading, and tight feedback loops.

For Auctor, the right response is not to become a general-purpose personal daemon. The right response is to become a domain daemon for content operations.

That means:

more durable than a chat session
more proactive than a request-response tool
more stateful than a workflow runner
more governed than an autonomous shell agent

The thesis of this paper is:

Auctor should evolve into a site-scoped, database-first, policy-governed content operator whose agents can continuously observe, reason, prepare artifacts, request approvals, and learn from outcomes without abandoning the existing stack of Next.js, FastAPI, Mastra, Supabase/Postgres, and Electron.

This paper translates OpenClaw's ideas into Auctor-compatible primitives and argues for a specific architecture: bounded autonomy over canonical content state.

Why This Matters Now

Today Auctor already has several important ingredients:

a canonical content domain model
typed workflows for strategy, briefs, and drafts
a Mastra harness with modes and subagents
structured grounding via documents, extractions, summaries, and graph edges
audit logs and sync runs
a read-only workspace layer
an Electron shell with scheduled cycles and approval gates

That is already much more than a chat wrapper.

But the current system still behaves like a collection of workflow features rather than a coherent agent runtime. The gaps are not mainly "better prompts." They are runtime gaps:

memory is fragmented across threads, runs, tables, and files
scheduled autonomy is prompt-driven rather than state-driven
workspace artifacts are present but not yet central to cognition
tool policy exists but is not yet a unified capability model
agents can act inside workflows, but they do not yet maintain a durable operating posture across time
the system can log actions, but it does not yet learn systematically from outcomes

OpenClaw is a good foil because it is explicit about these missing primitives.

The Constraints We Should Preserve

Any proposal that ignores Auctor's existing constraints will be elegant on paper and wrong in practice.

The architecture should preserve the following:

1. Database-first canonical state

The source of truth for operational entities should remain the ContentEngine database:

documents
extractions
summary nodes
graph edges
plan items
briefs
drafts
reviews
validations
workflow runs
audit logs

Files should remain mirrors, casefiles, playbooks, or working sets.

2. Human approval as a product feature

Auctor's value is not that it removes humans from the loop. Its value is that it makes the loop legible, faster, and better grounded.

Publish actions, major state transitions, and non-obvious decisions should continue to pass through explicit approval surfaces.

3. Existing framework boundaries

We should keep:

Next.js App Router as the operator surface
FastAPI as the LangExtract sidecar
Mastra as the workflow and harness substrate
Supabase/Postgres as primary persistence
Electron as the local always-on shell where autonomy is desired

The goal is to deepen the runtime, not replace it.

4. Domain specificity over generic agent breadth

Auctor is a content operator, not a universal personal assistant.

We should resist:

multi-channel messaging sprawl
generic shell-first autonomy
arbitrary plugin execution
file-first canonical state
open-ended personal productivity scope

5. Site scoping

The system is architecturally multi-site, even if consul is the dominant default today.

Every durable primitive should remain site-aware.

Current Auctor Anchors

The proposed primitives are not starting from zero. Auctor already contains partial versions of several of them.

Current anchors in the codebase include:

frontend/src/mastra/harness.ts
- mode-aware orchestration, shared state, subagents, workspace hookup
frontend/src/mastra/workflows/content-engine-workflows.ts
- typed strategy, brief, and draft workflows with suspend and resume points
frontend/src/content-engine/services/workflow-runner.ts
- persistent run execution and resume semantics
frontend/src/content-engine/services/agent-runner.ts
- model execution, cost accounting, and audit logging
frontend/src/mastra/tools/content-engine-tools.ts
- a typed tool surface over canonical content data and publishing actions
frontend/src/mastra/workspace.ts
- a read-only searchable workspace with BM25 and filesystem access
frontend/app/api/harness/chat/route.ts
- a streaming operator control plane for conversational orchestration
desktop/src/workspace/scheduler.ts
- scheduled autonomy with morning, midday, evening, and weekly cycles
desktop/src/claude/tool-policy.ts
- approval-gated tool use and workspace-scoped write restrictions

This means the architectural question is not "how do we bolt agents onto Auctor?"

It is:

How do we turn these isolated agent features into a coherent agent runtime?

What OpenClaw Teaches, and How Auctor Should Respond

The most useful way to interpret OpenClaw is primitive-by-primitive.

OpenClaw concept	Why it matters	Auctor response
Persistent gateway daemon	Agents become ambient and stateful rather than ephemeral	Adapt. Use the existing harness plus Electron runtime as an Auctor control plane, not a multi-channel gateway
Markdown-first operating files	Agent behavior becomes inspectable and editable	Adopt selectively. Use workspace playbooks and operating docs, but keep DB state canonical
Durable sessions	Context persists beyond one request	Adapt. Persist thread summaries, run journals, and artifact-linked memory rather than only chat history
Hybrid memory search	Retrieval becomes broader than recent context	Adopt. Combine structured DB retrieval with workspace search and run summaries
Skills loaded on demand	Capabilities stay modular without bloating every prompt	Adopt. Implement Auctor skill packs or playbooks for domain tasks
Heartbeat / proactive checks	Agent can notice and surface issues before being asked	Adopt. Convert scheduled cycles into state-driven observation loops
Lane queueing	Avoid conflicting concurrent action in the same context	Adapt. Add per-site and per-artifact execution lanes
Messaging platform adapters	Makes the agent present where people already are	Avoid for now. Auctor does not need channel sprawl to become smarter
General plugin ecosystem	Fast extension, but high trust surface	Constrain heavily. Prefer internal adapters and approved skill packs
Broad shell/tool access	Increases raw capability and risk	Avoid as a primary design choice. Prefer typed tools and approved actions

The pattern is clear:

Auctor should borrow OpenClaw's runtime ideas, not its product surface.

The Auctor Agent Thesis

The right mental model for Auctor is not "chatbot with workflows" and not "desktop AI OS."

It is:

A site-scoped content operator with durable memory, observation loops, typed capabilities, and artifact-centered reasoning layered on top of canonical workflow state.

This implies five first principles.

1. Perception

The agent must continuously observe:

research freshness
competitor changes
ranking movement
analytics anomalies
blocked drafts
pending reviews
publish readiness

Without perception, autonomy degrades into blind scheduling.

2. Memory

The agent must remember more than the current thread:

durable site directives
prior operator decisions
workflow outcomes
repeated editorial corrections
successful and unsuccessful publish patterns

Without memory, every run starts from zero.

3. Deliberation

The agent must decompose work into distinct reasoning roles:

observe
prioritize
plan
draft
critique
validate

Without decomposition, the system becomes one large prompt with weak guarantees.

4. Action

The agent must be able to mutate the world in bounded ways:

create artifacts
update workflow state
prepare casefiles
request review
publish only behind policy and approval

Without action, intelligence never leaves the prompt.

5. Governance

The agent must act under policies, budgets, and approval rules.

Without governance, autonomy becomes operational risk.

Reference Architecture

Proposed Primitives

The rest of this paper defines the primitives that would make Auctor measurably smarter while remaining inside the current architecture.

Primitive 1: Agent Operating Profile

Definition

A durable, inspectable operating profile that defines how an Auctor agent should think and behave for a given site, mode, or workflow layer.

Why it matters

Today Auctor instructions live in a mix of:

hard-coded agent strings
harness mode prompts
strategy directives
workspace templates

That works, but it fragments agent identity across code and storage.

An operating profile would consolidate:

site mission
audience and brand constraints
editorial rules
escalation rules
approval boundaries
workflow defaults
"never do this" constraints

Auctor-compatible implementation

Represent the profile as a layered combination of:

DB-backed site directives
workspace markdown docs
agent and mode-level instructions
runtime policy settings

The profile should be loadable by Mastra orchestrators and visible in the operator UI.

What it unlocks

consistent agent behavior across chat, workflows, and scheduled runs
easier inspection and revision of agent posture
less prompt drift between surfaces

Primitive 2: Attention Queue

Definition

A durable queue of agent-relevant issues that need judgment, review, or action.

Why it matters

Right now the system has many raw events and statuses, but not a unified "attention model."

The agent should think in terms of:

stale research
competitor spike
AI overview appearance for tracked keyword
brief awaiting review
draft blocked by SEO
publish-ready draft awaiting approval
integration degraded

Without an attention queue, proactive behavior turns into repeated full scans of the whole system.

Auctor-compatible implementation

Promote system events and workflow conditions into normalized attention items with:

site key
item type
severity
owning mode or layer
linked artifact IDs
status
created and resolved timestamps

This can be surfaced in the web UI and consumed by both scheduled loops and interactive orchestrators.

What it unlocks

proactive work with bounded scope
better triage
less redundant reasoning
an operator inbox that matches how the agent already thinks

Primitive 3: Observation Loops

Definition

Scheduled or event-driven runs that inspect the system, convert raw state into attention items, and only escalate when something actually requires action.

Why it matters

The desktop scheduler already runs morning, midday, evening, and weekly cycles, but those cycles are prompt-driven and broad.

A smarter system should be state-driven:

check for stale competitor crawl data
detect ranking drops for strategic keywords
find drafts stuck in review
flag repeated editorial corrections on the same agent
identify publish-ready items with missing metadata

Auctor-compatible implementation

Use the existing scheduler and workflow infrastructure, but shift the contract from "run a long generic prompt" to "execute a typed observation pass."

Observation passes should:

read canonical DB state
compute findings deterministically where possible
create attention items
optionally generate short casefile notes
notify the operator only when thresholds are crossed

What it unlocks

proactive autonomy without chat spam
less token waste
a transition from scheduled prompts to continuous monitoring

Primitive 4: Casefile Workspace

Definition

A durable artifact layer that captures the evidence, reasoning context, and output bundle for each important run or content artifact.

Why it matters

The RFC already points toward casefiles, and OpenClaw is right about one thing: agents become more capable when they can inspect and build artifact bundles over time.

For Auctor, casefiles should not replace the database. They should make the database legible.

Each casefile should make it easy to answer:

what did the agent see
what evidence mattered
what it proposed
what the operator changed
what finally shipped
how the outcome performed

Auctor-compatible implementation

Extend the existing workspace into run-scoped and artifact-scoped bundles:

manifest.json
evidence summary
selected extractions
plan rationale
brief package
draft package
validation report
publish payload
reflection note

The DB remains canonical. The casefile is the inspection and export layer.

What it unlocks

higher trust
better debugging
better long-term learning inputs
cleaner human handoff

Primitive 5: Hybrid Memory Stack

Definition

A layered retrieval model that combines structured data, artifact files, and synthesized run memory.

Why it matters

Auctor's strongest asset versus generic agent systems is that it already has structured retrieval:

documents
extractions
summary nodes
graph edges
workflow state

The mistake would be to throw that away and replace it with generic vector memory.

The right move is to add memory layers above it.

Auctor-compatible implementation

Use a three-tier memory model:

Canonical structured retrieval from the database
Workspace and casefile retrieval through BM25 and optional embeddings
Reflection memory composed of run summaries, operator corrections, and learned patterns

The agent should query these in order of trust:

DB facts
current casefiles and workspace docs
distilled historical reflections

What it unlocks

stronger contextual recall
less duplication of mistakes
richer grounding without abandoning structured data

Primitive 6: Skill Packs and Playbooks

Definition

On-demand capability bundles that package domain instructions, templates, retrieval hints, and allowed tools for a specific class of work.

Why it matters

OpenClaw is directionally right that large prompts do not scale. Auctor should not try to stuff every brand rule, workflow nuance, SEO heuristic, and publishing checklist into every model call.

Skill packs let the agent load context only when needed.

Examples:

competitor teardown playbook
high-trust B2B briefing playbook
comparison-page drafting playbook
internal linking remediation playbook
publish-readiness playbook
post-publish monitoring playbook

Auctor-compatible implementation

Represent skill packs as workspace-backed markdown plus structured metadata:

scope
owning mode
required tools
optional templates
examples
approval requirements

Mastra orchestrators can select them explicitly, and workflows can declare defaults by content type or step.

What it unlocks

modular agent behavior
less prompt sprawl
easier continuous improvement
domain specialization without changing code for every behavior tweak

Primitive 7: Capability Profiles and Policy Engine

Definition

A unified model for what each agent, mode, and workflow step is allowed to read, write, mutate, or publish.

Why it matters

Auctor already has the beginnings of this:

different tool sets per agent
desktop approval gates for publish tools
explicit publish-target state

But the policies are split across code paths and surfaces.

To become more autonomous safely, the system needs a single capability language.

Auctor-compatible implementation

Define capability profiles by:

mode
agent
workflow step
environment
site

Each profile should express:

allowed tools
allowed write scopes
approval-required actions
cost or step budgets
model class
escalation rules

This policy engine should be shared conceptually across:

Mastra tools
workflow runner
desktop runtime
publish surfaces

What it unlocks

safer autonomy
more consistent operator expectations
easier reasoning about what an agent may do in each context

Primitive 8: Execution Lanes and Conflict Control

Definition

A concurrency model that prevents multiple agent runs from stepping on the same artifact or objective at the same time.

Why it matters

OpenClaw's lane queue is not interesting because it is clever. It is interesting because it acknowledges that concurrency without coordination is destructive.

For Auctor, the danger is not two chat messages racing. It is:

two runs revising the same draft
a scheduled loop mutating a plan item while a human is reviewing it
publish and SEO remediation colliding
multiple agents writing overlapping casefiles

Auctor-compatible implementation

Add execution lanes scoped to:

site
plan item
brief
draft
workflow run

A lane should support:

serialize
defer
merge
require operator choice

The workflow runner, scheduled loops, and desktop runtime should all honor the same lane model.

What it unlocks

safer autonomy
fewer artifact conflicts
cleaner mental model for the operator

Primitive 9: Reflection and Learning

Definition

A structured post-run learning loop that turns outcomes into reusable memory rather than leaving them buried in audit logs.

Why it matters

Auctor already logs what happened. That is not the same as learning from what happened.

The system should be able to capture:

which plan items were approved quickly
which briefs repeatedly triggered changes
which draft issues appear over and over
which SEO blocks were common
which published formats performed best
which agent configurations correlated with higher quality

Auctor-compatible implementation

After major runs, generate a small reflection artifact with fields like:

objective
outcome
operator corrections
failure modes
reusable lessons
confidence for reuse

Only high-signal reflections should be promoted into durable memory or playbook updates.

What it unlocks

cumulative intelligence
fewer repeated mistakes
better agent prompts and playbooks over time

Primitive 10: Model Routing and Deliberation Topology

Definition

A formal map of which models and reasoning roles should be used for which tasks.

Why it matters

Auctor already has partial specialization:

strategy agents
writer
editor
validator

That should become a deliberate topology, not an incidental one.

Different tasks want different properties:

observation wants low cost and consistency
strategy wants synthesis
writing wants voice and composition
editing wants critique
validation wants determinism

Auctor-compatible implementation

Keep the current agent layering, but formalize a topology like:

observer
researcher
strategist
brief architect
writer
critic
validator
publisher
reflector

Model routing should consider:

cost ceiling
task criticality
step budget
need for tool use
tolerance for creative variance

What it unlocks

lower operating cost
clearer agent responsibilities
better repeatability

Primitive 11: Operator Inbox and Escalation Surface

Definition

A single place where the agent can surface decisions, approvals, blocked items, and proposed next actions.

Why it matters

The current system already has:

plan approval flows
question responses
suspended workflows
draft and brief review surfaces

These should become one coherent operator inbox rather than several unrelated interruptions.

Auctor-compatible implementation

Normalize all escalations into operator-facing decision objects:

approval required
question asked
conflict detected
review requested
anomaly detected
publish-ready notification

The inbox should link directly to the relevant casefile or artifact.

What it unlocks

better human-agent collaboration
cleaner autonomy boundaries
easier review of what the system is waiting on

Primitive 12: Autonomy Budget

Definition

A set of hard limits that define how much autonomous work the system may do before it must stop, summarize, or escalate.

Why it matters

The more proactive Auctor becomes, the more it needs bounded stopping conditions.

Otherwise the system risks:

burning cost budgets
creating too many low-value artifacts
over-notifying the operator
mutating too many entities without review

Auctor-compatible implementation

Budgets can be expressed per run or per cadence:

max steps
max cost
max artifacts created
max unattended mutations
max time window
mandatory escalation checkpoints

These budgets should be visible to the operator and enforced by policy.

What it unlocks

safer long-running autonomy
better predictability
easier trust calibration

Operating Model: What a Smarter Auctor Actually Does

If these primitives are implemented, Auctor changes from a workflow-triggered app into a bounded operating loop:

Observe system state and external signals.
Convert signals into attention items.
Prioritize the highest-value items based on site strategy and budgets.
Load the relevant operating profile, playbooks, memory, and casefiles.
Delegate to specialized agents or workflows.
Write outputs to canonical state and artifact bundles.
Escalate when policy requires human review.
Reflect on outcomes and store only high-value lessons.

The important change is not "more autonomy." It is closed-loop autonomy.

What We Should Not Copy from OpenClaw

OpenClaw is strongest when it names useful primitives. It is weakest when copied too literally into the wrong domain.

Auctor should not copy:

1. Messaging-platform sprawl

Auctor does not need to live in WhatsApp, Telegram, Slack, Discord, and Signal to become intelligent.

Its natural surfaces are:

the operator web app
the agent window
the desktop shell
structured inbox and review surfaces

2. Generic shell-first power

Open-ended local shell capability is not the right center of gravity for Auctor.

Auctor should prefer:

typed tools
typed workflows
scoped mutations
explicit publish approvals

3. Plugin permissiveness

A wide-open plugin marketplace would create trust, maintenance, and safety problems faster than it would add value.

Domain-specific, approved adapters and playbooks are a better fit.

4. File-first truth

Files are useful for legibility and inspection. They should not outrank the ContentEngine database.

5. Personal assistant scope

Auctor should stay focused on content operations, research operations, and publishing operations.

That focus is a strength.

A Practical Adoption Path

This paper is theoretical, but the primitives can be introduced incrementally.

Phase 1: Consolidate agent posture

Goals:

introduce operating profiles
unify instructions across harness, workflows, and scheduled runs
define capability profiles

Expected effect:

more consistent agent behavior

Phase 2: Build attention and observation

Goals:

add attention items
convert scheduled cycles into typed observation passes
introduce site-scoped execution lanes

Expected effect:

proactive but bounded autonomy

Phase 3: Make artifacts first-class

Goals:

implement casefiles for major runs
connect workspace browsing to actual workflow artifacts
attach operator decisions to casefiles

Expected effect:

higher transparency and better memory inputs

Phase 4: Add playbooks and reflections

Goals:

load skill packs on demand
promote high-value reflections into durable memory
connect repeated corrections back into playbooks

Expected effect:

cumulative improvement over time

Phase 5: Tighten the autonomous loop

Goals:

introduce autonomy budgets
formalize model routing and agent topology
unify inbox and escalation handling

Expected effect:

smarter autonomous operation without loss of control

The Deeper Theory

The deepest lesson from OpenClaw is not about daemons, chat apps, or markdown files.

It is that useful agents need a runtime that can answer six questions continuously:

What do I know right now?
What has changed since the last time I looked?
What deserves attention next?
What am I allowed to do about it?
What evidence will justify my recommendation?
What should I remember if this happens again?

Today Auctor can answer pieces of those questions.

The primitives in this paper would let it answer them coherently.

That is what would make Auctor feel smarter:

not more verbose
not more agent-branded
not more "autonomous" in the abstract

But more situationally aware, historically grounded, policy-bounded, and operationally continuous.

Conclusion

The right future for Auctor is not a generic AI operating system.

It is a bounded content operations intelligence layer built on:

canonical workflow state
artifact-centered reasoning
proactive observation
modular playbooks
explicit policy
durable learning

OpenClaw shows the shape of the missing runtime primitives.

Auctor's opportunity is to implement those primitives in a way that respects its own architecture:

database-first
workflow-native
human-gated
site-scoped
operationally auditable

If we do that, Auctor stops being "an app with agents" and becomes "an agent with a real operating model."