By David Johnsen·June 1, 2026·

AIagentsautomationstrategy

AI Workflow Ranking: What to Automate First

Most AI projects fail at the first decision: which workflow to build for. AI Workflow Ranking is a repeatable way to score every workflow on readiness and value, then pick the first build that actually pays off.

The most expensive mistake in an AI project happens before anyone writes a prompt.

A company decides it wants AI in the business, then picks the workflow that feels most impressive, or the one the loudest person complained about last week. Six weeks later there is a clever demo, a frustrated owner, and a quiet retreat to spreadsheets. The model was fine. The workflow was the wrong one to start with.

Picking the first workflow is a scoring problem, and most teams treat it as a taste problem. This is the scoring method I use to turn "we want to use AI" into a ranked shortlist and one defensible first build.

Takeaway
The best first agent is a narrow, high-value, well-understood workflow with clean inputs and a human who owns the outcome.

questions before you build

↑ ready? worth it?

readiness layers

↑ Purpose to Govern

opportunity levers

↑ SCALE + IDEAS

This is the opening piece in a series on building agents for company work. Start here, because everything in the later pieces assumes you already picked a workflow worth the effort.

Every Workflow Gets Two Scores

A workflow is a repeatable business process: a trigger, a sequence of steps, and an outcome someone cares about. Write each candidate as one line: trigger, steps, outcome, owner.

For example: a customer refund request comes in, support checks the policy and order history, drafts a response, and routes edge cases to the support lead.

Then score each one on two independent questions.

Readiness: is this workflow in a state where an agent can actually do the work? Opportunity: if it worked, would it matter? A workflow needs a strong answer to both. High readiness with low opportunity is a tidy waste of time. High opportunity with low readiness needs prep before it needs an agent.

Picked by gut

Chosen because it sounded impressive
Owner found out after the build started
Inputs turned out to be scattered PDFs
Success was judged by the demo
Stalled when the first exception appeared

Picked by score

Chosen because it scored highest on both axes
Owner named before any build
Inputs confirmed digital and accessible
Success defined against real cases
Exceptions were expected and routed

Question One: Is It Ready?

Readiness scores a workflow across seven layers. Each layer answers a different question about whether the work is in a shape an agent can work with. Score each from 1 (absent or chaotic) to 5 (clean and well-defined).

The order matters. The early layers are about whether the work is even defined. The later ones are about whether you can run it safely. A workflow can be exciting and still score a 1 on Sense because the data lives in someone's inbox.

Purpose: is the outcome and the owner clear? A 1 has a fuzzy goal and nobody accountable. A 5 has one owner and a sharp metric.

Sense: is the input data available and clean? A 1 is scattered and manual. A 5 is structured and accessible.

Interpret: are the rules and exceptions known? A 1 is tribal and undocumented. A 5 is documented and stable.

Decide: are the decisions consistent and explainable? A 1 is gut feel that varies by person. A 5 is ruleable and consistent.

Orchestrate: can the steps be triggered and tracked? A 1 is manual handoffs. A 5 has APIs and logging.

Learn: is there feedback on quality? A 1 has none. A 5 is measured and looped back.

Govern: is there oversight and accountability? A 1 has none. A 5 is reviewed and audited.

What changes

A first build needs two firm layers: Purpose and Sense. Name the owner and point at clean inputs before you ask an agent to operate on the work.

Question Two: Is It Worth It?

Readiness tells you whether you can. Opportunity tells you whether you should. I score it through two lenses: whether the workflow is worth automating at all, and whether AI is the right tool for it.

I think of those lenses as SCALE plus IDEAS: SCALE asks whether the workflow is worth automating, and IDEAS asks whether AI fits the work.

Score each lever 1 to 5. Exact numbers matter less than honest separation. Two workflows that feel equally promising usually pull apart once you score them honestly.

Spend: how much time or money does this workflow consume?
Cadence: how often does it run?
Acceleration: would speeding it up unlock downstream value?
Leverage: does one improvement help many people or teams?
Error cost: how expensive are mistakes here?

Inputs: are the inputs digital and accessible?
Decisions: are the decisions pattern-based and repeatable?
Explainable: can a good answer be checked?
Augmentable: can AI assist while a human keeps control?
Structured: is there a repeatable structure to exploit?

The combination is what tells you where to go next.

High worth, high fit: a strong first-build candidate.
High worth, low fit: valuable and hard. Do the process work first.
Low worth, high fit: easy with low payoff. Good for a quick win or a demo.
High error cost anywhere: keep a human in the loop and add governance early.

The Trap That Sinks First Projects

The most tempting workflow is usually high opportunity and low readiness. It hurts a lot, so it feels urgent. The data is scattered, the rules are undocumented, and ownership is fuzzy, so it is the worst possible place to start.

When a workflow scores high on opportunity and low on readiness, you have found the data and process work that must come first. The honest move is to name it. Fixing the inputs and writing down the rules is real work that makes the eventual agent possible. Skipping it and building anyway is how you get the six-week demo that quietly dies.

Takeaway
A high-opportunity workflow with low readiness is a data project first. Build the agent second.

What changes

Readiness is the cheapest signal you have about whether a project will survive contact with reality. Treat a low score as information that points to the next move.

What a Good First Build Looks Like

Once the scores are in, the right first build is the workflow with the best mix of value, readiness, and how much the business cares about that area right now.

Before you commit, the recommended first build should pass a short test.

It scores at least a 3 on Purpose and a 3 on Sense.
It is narrow enough to describe in a single sentence.
Its actions are reversible, or gated behind human approval.
One named person owns the outcome and will review the output.
A good answer can be checked against something real.

The goal of the first build is to prove one loop end to end, earn trust, and learn enough to pick the second workflow with even better aim.

Where Should You Start?

What does your top workflow actually look like?

The right next step depends on what your highest-scoring workflow is missing.

High opportunity, low readiness. This is the trap. The work hurts, and the inputs are scattered while the rules live in people's heads.

Start with a small data and process pass: get the inputs into one accessible place, write down the rules and the common exceptions, and name the owner. Then it becomes a real candidate.

Run It Yourself

This whole method is packaged as a free, open Claude and Codex skill called AI Workflow Ranking. It is plain text and fully readable before you run it, and it makes zero network calls. Point it at your business and it walks the scoring with you: it inventories your workflows, scores readiness and opportunity, ranks them, and sketches the system for the top pick. The output is a ranked table and a one-page plan.

Codex

Run codex plugin marketplace add cloudbuddy-solutions/ai-workflow-ranking. Then open Codex and install ai-workflow-ranking from the CloudBuddy marketplace.

Claude Code

If you use Claude Code, add the marketplace and install with two commands: /plugin marketplace add cloudbuddy-solutions/marketplace, then /plugin install ai-workflow-ranking@cloudbuddy.

Claude web or desktop

Claude web and desktop use Claude's skill import flow. Paste this GitHub repo URL there: https://github.com/cloudbuddy-solutions/ai-workflow-ranking.

Get the skill on GitHub.View a sample ranking report

Workflow Mapping is the guided service for real workflows. An agent-fluent CloudBuddy partner helps pressure-test your assumptions, spot hidden constraints, and find the automation value you may miss from inside the work. You leave with a sharper ranking and a build-ready recommendation.

Or have CloudBuddy run it with you

Pick the Workflow, Then the Model

The teams that get value from AI start on a workflow that is ready and worth it, prove one loop, and build from there.

Next in this series: how to turn the backlog behind a workflow into an agent queue.

Agent Queues: how AI turns backlogs into systems

Score before you build. The first decision is the one that decides the rest.

David Johnsen

Founder, CloudBuddy Solutions

Want to automate a workflow in your business?

Start with free Workflow Mapping to find your highest-value opportunity.

Request workflow mapping

Where Business Advantage Comes From in the Age of AI

AI made software cheap to produce, which means the software itself is no longer the moat. Four advantages still compound: speed of the full loop, anything anchored to the physical world, data that improves with use, and owning one narrow problem end to end.

Agent Queues: How AI Turns Backlogs Into Systems

The easiest way to understand useful AI at work is to look at the queue: the inbox, ticket list, approval pile, lead backlog, or report stack where work waits for the next step.

Fable 5: The Mythos Model Just Went Public

Anthropic released Fable 5 today, a Mythos-class model anyone with an API key can use. I ran it head-to-head against Opus 4.8 and Sonnet 4.6 on launch day, from quick puzzles to a 150,000-line codebase audit. The frontier difference shows up exactly where you'd expect, and not where you wouldn't.

A Repeatable System Audit Framework for Production Software

A repeatable framework for auditing a SaaS codebase at scale. A set of audit tracks you select and adapt to your system, an invariants loop that stops regressions, and a verification cycle that makes each audit cheaper than the last. One recent application surfaced hundreds of findings and promoted 36 invariants to code-level guardrails.

AI Just Took a Leap. Access Is Becoming a New Advantage.

Anthropic released Mythos Preview to a closed group of organizations. The capability leap is real, but the access model may matter just as much. As AI shifts from an equalizer to a gated advantage, the teams that win will be the ones that can turn that capability into working systems.

Claude Code Leaked. I Looked Under the Hood.

Claude Code CLI accidentally exposed part of its codebase. I pulled the package and looked under the hood. The direction is clear: AI agents are becoming systems.

I Rebuilt My Company Site From My Phone (At the Gym)

What it actually looks like to work with remote AI agents. What worked, what didn't, and how Claude Dispatch compares to our custom AI tools.

Browse workflow ideas →