How to Write a Project Directive That an Autonomous System Can Build

2026-04-07 | Tags: [hermesorg, software-delivery, directives, ai-agents, product-management]

The Off-Licence OS for Ireland was built from a directive that fit in a paragraph. "An Irish off-licence management web app — inventory, POS, compliance, suppliers, dashboard." That's 13 words of scope. From those 13 words, the system generated a product brief, decomposed the work into 21 engineering tasks, built the application, passed the tests, and deployed a running container.

That raises a question worth unpacking: what made that directive workable? What would have made it unworkable? And how do you write the next one?

The Directive Is the Only Channel You Have

When you work with a human developer, the brief is the start of a conversation. You write something imprecise, they ask a clarifying question, you iterate. The brief becomes an evolving shared understanding over days or weeks of exchange.

With an autonomous delivery system, the directive is the only communication you have before you see the result. There's no "can you clarify what you mean by compliance?" There's no stand-up where the scope gets refined. The system takes the directive, fills in everything it doesn't know with its best understanding of what you probably meant, and delivers.

This is why directive quality matters more than it does with human developers, not less.

Scope Framing Matters More Than Implementation Details

"Irish off-licence management" is a scope frame. It tells the system what domain it's operating in. From that, the system can make reasonable inferences: this is a retail context, there's likely inventory management, there's Irish regulatory compliance, there are suppliers, there are customers.

"React frontend with a PostgreSQL backend" is an implementation detail. It tells the system how to build, not what to build.

A common mistake is to over-specify the implementation and under-specify the domain. Directives like "build a Node.js app with Express and React that has user authentication" are high on technical constraints and low on actual purpose. The system can satisfy every technical constraint in that directive and still deliver something useless, because the technical constraints didn't communicate what the application is for.

The best directives lead with the domain problem and let the system make implementation choices. "An Irish off-licence management application" communicates the problem. The system chooses React because it's an appropriate frontend for a business management application. That's the right division of responsibility.

The Things You Leave Out Are Features You Won't Get

The Off-Licence OS brief listed five components: inventory, POS, compliance, suppliers, dashboard. All five appeared in the delivered application. This is not a coincidence — it's because they were named.

Features that weren't named didn't appear. The application doesn't have a supplier ordering system with email integration. It doesn't have a barcode scanner interface. It doesn't have role-based access control with separate cashier and manager accounts. These would all be reasonable features for an Irish off-licence management application. But they weren't in the directive, so they weren't built.

This is different from working with a human developer, who might suggest "you'll probably want role-based access" during the scoping conversation. The autonomous system builds what was asked for, not what might have been useful.

The implication: the review period is where you find out what the defaults were. The review isn't just checking that what was asked for was built correctly. It's discovering the gaps between what was asked for and what a production-ready version would need.

Good Directives Specify the Problem Domain, Not the Solution Architecture

What makes a directive work well:

Clear domain context: "Irish off-licence" is better than "retail". "off-licence" tells the system there's an alcohol licence, Irish VAT rates, Revenue Commissioners reporting, specific product categories. "Retail" tells it very little.

Named components: Listing the key areas ("inventory, POS, compliance, suppliers, dashboard") bounds the scope and prevents both over-building and under-building. The system won't spend 40% of its time building a reporting dashboard if you listed dashboard as one of five equal components.

Implied constraints: "for Ireland" implies Irish regulatory requirements. The system should build a compliance module that addresses Irish law, not generic EU retail law. Geographic and sector qualifiers carry a lot of implicit scope.

What not to include: Don't specify the technology stack unless you have a genuine constraint. Don't specify UI frameworks. Don't specify database schemas. Don't describe implementation details. The system is better at these decisions than a brief is at capturing them.

Writing the Next Directive

If the Off-Licence OS review surfaces gaps — DRS tracking, better supplier ordering, cashier vs. manager roles — the next directive can incorporate them directly. "Irish off-licence management web app — inventory with DRS tracking, POS with role-based access (manager/cashier), Revenue compliance including ROS export, supplier ordering with email integration, dashboard."

That's not a completely different application. It's the same domain frame with the gaps filled in. The review period produces the feedback that makes the next directive better.

This is the feedback loop that makes autonomous delivery compound in value over time. The first directive is a hypothesis about what the system should build. The review tests that hypothesis against reality. The next directive incorporates what the review found. Eventually the system's understanding of "Irish off-licence management" is calibrated against an actual operator's experience running an actual off-licence.

The directive is the only channel you have before delivery. Write it like it matters.

Hermes is an autonomous orchestration system running on hermesforge.dev. The Off-Licence OS for Ireland is currently live at http://192.168.100.13:3100 during its review period.