AI & Machine Learning

How to Build a Virtual Agent Team for Faster Shipping: A Step-by-Step Guide from Docker's Coding Agent Sandboxes Team

2026-05-01 23:58:30

Introduction

Imagine having a team of seven AI agents that test your product, triage issues, post release notes, and even fix bugs—all running autonomously in your CI pipeline. That’s exactly what the Coding Agent Sandboxes team at Docker built with their Fleet. This guide walks you through the same approach so you can create your own virtual agent team. You’ll learn how to move from traditional scripts to autonomous agents that use judgment, not just instructions.

How to Build a Virtual Agent Team for Faster Shipping: A Step-by-Step Guide from Docker's Coding Agent Sandboxes Team
Source: www.docker.com

What You Need

Step-by-Step Guide

Step 1: Define Agent Roles and Responsibilities

Before writing any code, decide what roles your virtual team needs. The Fleet uses roles like:

Each role should have a clear persona, a set of responsibilities, and boundaries on what tools it may use. For example, the build engineer can run make and compile binaries but should not modify source code unless authorized.

Step 2: Create Skill Files for Each Role

A skill file is a markdown document that describes the agent’s role, knowledge, and decision-making process. It is not a step-by-step script; it’s a persona description. For instance:

# /cli-tester

You are an exploratory tester for the sbx CLI tool.
You know how to build, install, and run sbx on Linux, macOS, and Windows.
Your job is to test all CLI commands: create, start, stop, remove, configure networking, mount workspaces.
When you find an issue, investigate its cause and write a detailed report.
You may escalate to the build engineer if needed.

Store these files in a skills/ directory in your repository. The same file will be invoked both locally and in CI—no separate versions needed.

Step 3: Develop Locally First (Local First, CI Second)

This is the core design principle. Always iterate on your skill files on your laptop before wiring them into CI. Why? Because debugging in CI means commit-push-wait-read-logs cycles that take minutes. Locally, you see the agent think in real time, spot confusion, and fix the skill in seconds.

How to do this:

  1. Install the sandbox tool (sbx) and the AI agent (Claude Code) on your machine.
  2. Create a sandbox environment for the agent to work in.
  3. Invoke the skill locally: claude --skill skills/cli-tester.md
  4. Watch the agent build the binaries, run commands, and produce reports.
  5. If the agent misunderstands, edit the skill file and re-invoke. Iterate until the behavior matches your expectations.

Your local machine becomes the primary development environment. CI is just another runtime for the exact same skill.

Step 4: Wire Skills into CI Workflows

Once your skill works locally, add it to your CI system. For example, in a GitHub Actions workflow:

jobs:
  test-cli:
    runs-on: ${{ matrix.os }}
    strategy:
      matrix:
        os: [ubuntu-latest, macos-latest, windows-latest]
    steps:
      - uses: actions/checkout@v3
      - name: Setup environment
        run: ...
      - name: Run CLI tester skill
        run: claude --skill skills/cli-tester.md

Notice: the same skills/cli-tester.md file is called. No translation, no “CI version.” The workflow only sets up the environment and checks out code. The agent does the rest.

How to Build a Virtual Agent Team for Faster Shipping: A Step-by-Step Guide from Docker's Coding Agent Sandboxes Team
Source: www.docker.com

Make separate workflows for different roles. For example, run the /cli-tester nightly on all three OSes, and the /triage agent after each issue update.

Step 5: Run Autonomous Tasks and Review Reports

Now let your Fleet work. Each agent will:

All reports are written to a shared location (GitHub issues, comments, or a dedicated folder). The team reviews them daily.

Step 6: Iterate and Refine Based on Failures

Autonomous agents will make mistakes. When a test fails unexpectedly, a script stops. A role (agent) investigates. But you still need to improve the skill files over time.

After each CI run, examine the agent’s output:

Update the corresponding skill file locally, test it, then commit the change. The same iterative cycle applies: local first, then CI.

Tips for Success

Building a virtual agent team is not about replacing engineers—it’s about freeing them from repetitive tasks so they can focus on higher-level problems. Start small, iterate locally, and watch your shipping velocity increase.

Explore

Steam Controller Launch Day Gets Instant Mounting Ecosystem: Mechanism Basegrip Unveiled Inside AWS’s 2026 Roadmap: Q&A on the Biggest Agentic AI Announcements A Blueprint for Collaborative Design Leadership: Balancing People and Craft How to Evaluate a Surgeon General Nominee: A Closer Look at Nicole Saphier's Stance on MAHA Health Topics Exploring AI's Role in Accessibility: Opportunities and Cautions