Aaron: Artificial Intelligence

Sandboxing Kiro CLI

Aaron — Mon, 08 Jun 2026 22:59:26 GMT

When an AI agent like Kiro executes commands, modifies files, or runs scripts, it does so with whatever permissions your local environment provides. For workflows involving sensitive data, production credentials, or complex automation, this introduces meaningful risk. Sandboxing defines explicit boundaries around what Kiro can and cannot do, reducing that risk without sacrificing productivity.

This guide is written for developers using Kiro on macOS. It covers what sandboxing is, why it matters, and three practical methods for applying it to your workflow.

What Is Sandboxing?
Why Sandboxing Is Needed
Can Hooks Replace a Sandbox?
How to Sandbox Kiro on macOS
Feature Comparison
Advantages of Sandboxing
Limitations of Sandboxing
Conclusion

What Is Sandboxing?

Sandboxing means running an AI agent, or the commands it issues, inside a restricted and isolated environment. The agent retains the ability to perform actions, but only within carefully controlled boundaries.

Consider the analogy of a workshop with a locked tool cabinet. A craftsperson can work freely within the workshop, using whatever tools are available, but cannot access anything locked away or work outside the designated space. Kiro operates in a similar fashion: it can execute commands, modify files, and run scripts, but only within the limits you define.

Why Sandboxing Is Needed

Without defined boundaries, an AI agent operates with the same access rights as the developer running it. In practice, this means it could accidentally:

delete important files
expose API keys or credentials stored in shell profiles
damage the operating system
modify files outside the intended project scope
access private data stored elsewhere on disk

Consider a command as simple as:

rm -rf /

Without restrictions, this would destroy an entire macOS system. Sandboxing prevents this class of accident by restricting what the agent is permitted to touch before any command executes.

To see this in practice: suppose Kiro needs to run python app.py. With sandboxing, execution is constrained to /home/user/project. It cannot reach system folders, modify directories outside the project, or read sensitive files elsewhere. The operation completes normally, but the damage from any mistake is contained.

Can Hooks Replace a Sandbox?

Kiro supports hooks, which are rules that inspect and can block actions before they are executed. A common question is whether hooks provide sufficient security on their own. The short answer is: they help, but they are not a substitute.

A useful way to frame the distinction is:

Hooks are specific checks
A sandbox is the boundary itself

Hooks work by inspecting commands, checking for dangerous patterns, and allowing or blocking behavior accordingly. They are a valuable layer of defense. However, hooks are implemented in software and carry the same vulnerabilities as any software system. If a hook fails to match the exact command invocation used, the command proceeds. A sandbox restricts access at the environment level regardless; the operating system enforces the boundary, not the hook.

Hooks and sandboxing are complementary, not competing. Hooks provide targeted checks; sandboxes provide the hard boundary that catches what hooks miss.

How to Sandbox Kiro CLI on macOS

There are three primary approaches, each suited to a different level of isolation and workflow complexity.

1. Use a Virtual Machine (UTM)

A virtual machine (VM) provides the most complete form of isolation available. It creates an entirely separate macOS environment, with its own kernel, filesystem, and user profile, running as a guest on your physical machine. Kiro, installed inside the VM, has no visibility into your host machine’s files, credentials, or shell configuration.

For macOS on Apple Silicon, UTM is the recommended choice. It is free, and well-suited to running macOS guest environments.

Setup steps:

Download and install UTM from the official site (mac.getutm.app) or from the Mac App Store.
Open UTM and select Virtualize from the start screen.
Choose macOS 12+ as the operating system.
Import or download the macOS installer. If UTM offers the option to continue without selecting an IPSW file, it will use the installer on your boot partition.
Set RAM and CPU limits appropriate to your host machine.
Set a disk size for the virtual environment.
Review and save the configuration on the Summary screen. The VM will appear in the left sidebar.
Launch the VM using the play button. Initial setup will take several minutes to complete.

Once the guest environment is running and kiro-cli is installed inside it, the agent operates in complete isolation. Your host machine’s Documents folder, Desktop, credentials, and shell profiles remain invisible to it.

2. Use a Container (Docker)

Docker containers offer a lighter-weight alternative to a full VM. They share the host macOS kernel but isolate the filesystem, processes, and network from the host environment. If a script runs incorrectly inside the container, only the container is affected; the host machine remains unchanged.

Setup steps:

Create a Dockerfile:

FROM node:lts-slim
RUN apt-get update && apt-get install -y git curl python3 build-essential
WORKDIR /workspace
RUN curl -fsSL https://cli.kiro.dev/install | bash

Build and tag the image:

docker build -f Dockerfile -t kiro-sandbox .

Start the container, mounting only the specific project directory Kiro should access:

docker run -it --rm \
  -v $(pwd):/workspace \
  kiro-sandbox bash

3. Use Application-Level Isolation (SRT)

For developers who prefer to work directly on their local machine without a VM or container, application-level isolation provides a practical middle ground. Anthropic’s Sandbox Runtime (srt) wraps kiro-cli in a declarative sandbox, enforcing filesystem and network boundaries using Apple’s native Seatbelt security framework on macOS.

It’s worth noting that srt is an experimental tool. Its configuration and API may evolve over time.

Setup steps:

Install the Sandbox Runtime via npm:

npm install -g @anthropic-ai/sandbox-runtime

Create the configuration file at ~/.srt-settings.json. The paths listed under allowWrite must include the application’s data directory, or the tool will not start correctly:

{
  "allowPty": true,
  "enableWeakerNestedSandbox": true,
  "enableWeakerNetworkIsolation": true,
  "network": {
    "allowedDomains": [
      "*.kiro.dev",
      "*.amazonaws.com",
      "*.awsapps.com",
      "*.aws.dev"
    ]
  },
  "filesystem": {
    "allowWrite": [
      "./workspace",
      "~/Library/Application Support/kiro-cli/",
      "~/.kiro"
    ]
  }
}

Launch kiro-cli inside the sandbox:

srt kiro-cli chat

On macOS, srt hooks into Apple’s Seatbelt framework to enforce these boundaries at the kernel level. Any attempt by Kiro to write outside the allowWrite list or reach an unlisted domain is blocked before it executes.

Feature Comparison

The three approaches differ not just in setup complexity, but in where the isolation boundary sits in the computing stack. The table below compares them across eight technical dimensions.

Advantages of Sandboxing

Blast Radius Containment: If something goes wrong, damage is confined to the sandbox; the host machine is unaffected and the sandbox can simply be discarded.
Secure Execution of Untrusted Code: Safely test unfamiliar scripts or third-party tools without auditing every line in advance.
Protection Against Zero-Day Exploits: Restricting what an application is permitted to do limits potential damage from vulnerabilities that have not yet been disclosed or patched.
Clean, Reproducible Environments: Each instance starts from a known state, free of stale configuration or leftover artifacts from previous sessions.

Limitations of Sandboxing

Performance and Resource Overhead: Isolation boundaries require computational resources. Virtual machines in particular introduce meaningful CPU, RAM, and startup time costs that may affect developer experience.
Sandbox Escape Vulnerabilities: No isolation mechanism is perfect. A flaw in the sandbox implementation can allow a process to break containment. External data sources, such as repositories containing hidden Unicode payloads, represent a less obvious escape vector.
Context Blindness and Friction: The sandbox may block Kiro’s access to local files or internal tools it legitimately needs. Some configuration is necessary to restore that access without reopening the boundaries the sandbox is meant to enforce.
Sandbox-Aware Behavior: Some malicious software detects when it is running inside a test environment and suppresses its harmful behavior, only revealing itself once it reaches an unrestricted host.
Trusted-Channel Data Poisoning: Allowlisting domains like github.com or google.com grants network access but cannot sanitize the content returned. A prompt injection payload in a repository file, or adversarially crafted search results, can manipulate Kiro’s behavior from within the sandbox. The boundary controls what the agent can reach, not what it reads or how it interprets that content.
No Defense Against Agentic Attack Vectors: Sandboxing operates at the system and network level and does not address the behavioral attack surface of an AI agent. The OWASP Top 10 for LLM Applications identifies risks sandboxing cannot mitigate: goal hijacking, tool misuse, identity abuse, unexpected code execution, and context poisoning. These attacks target model reasoning, not the host OS, and pass through sandbox boundaries undetected.

Conclusion

Sandboxing controls what Kiro can access at the system and network level; it does not govern how the agent reasons about or responds to the content it retrieves. For macOS developers, the three approaches covered here represent a progression from maximum isolation to minimum friction.

A virtual machine is the right choice when the risk profile demands the strongest possible guarantee, such as working with sensitive data or long-running agents.
A container is a sensible default for most development work, offering solid isolation with familiar tooling and low overhead.
Application-level isolation via srt is well-suited to developers who need to stay close to their local environment while still enforcing meaningful boundaries around what Kiro can access.

AWS Kiro Custom Agents: Your First Agent in 15 Minutes

Aaron — Sun, 24 May 2026 17:52:01 GMT

Kiro CLI ships with a default agent (kiro_default), but custom agents let you go further. A custom agent is a named configuration that gives an LLM a specific role, a defined set of tools, and context loaded automatically at startup. Rather than repeating the same prompt setup every session, a custom agent captures it once and makes it instantly available to you and your team.

This tutorial will walk through building a code-reviewer agent to review code with controlled tool access and automatically load a project README on startup. By the end, you will have a working local agent file you can activate, swap to, and commit to version control.

What you will build: a code-reviewer agent that reads files and runs shell commands without prompting, loads your project README automatically, and greets you when activated.

Time: ~15 minutes

Prerequisites

Kiro CLI installed and authenticated (version >= 2.1)
An active chat session (kiro-cli chat)
A project directory with a README.md under version control (git)

Step 1 — Understand where agents live

Kiro looks for agents in two places:

Location

.kiro/agents/ in your project — Scope: only available for the project
~/.kiro/agents/ — Scope: available everywhere

When both locations contain an agent with the same name, the local version takes precedence. This makes local agents a good choice when you want behavior tailored to a specific project, while global agents are better suited for general-purpose assistants you reach for everywhere.

For this tutorial, a local agent keeps things contained to your project.

To begin, create the agents directory in the project:

mkdir -p .kiro/agents

With the directory in place, the next step is creating the configuration file.

Step 2 — Create the agent file

Create .kiro/agents/code-reviewer.json with the following content:

{
  "name": "code-reviewer",
  "description": "Reviews code changes. Reads files and runs git commands without prompting.",
  "prompt": "You are a thorough code reviewer. Focus on correctness, clarity, and security. Be concise.",
  "tools": ["read", "shell"],
  "allowedTools": ["read", "shell"],
  "resources": [
    "file://README.md"
  ],
  "welcomeMessage": "Ready to review. Share a file path or paste a diff."
}

What each field does:

tools — declares what the agent can use
allowedTools — declares what runs without a permission prompt
resources — files loaded into context when the agent starts
welcomeMessage — shown when you switch to this agent

Save the file. Kiro detects new agent files automatically, no restart is required for the agent to appear in the list

Note on config changes: adding a new agent file takes effect immediately. Changes to an existing agent’s configuration, however, take effect the next time you activate the agent (via /agent swap). A running session does not reload mid-conversation.

Step 3 — Activate the agent

Start a chat session:

kiro-cli chat

Inside the session, swap to your new agent:

 /agent

Select code-reviewer from the list. You will see:

✔ Choose one of the following agents · code-reviewer
Ready to review. Share a file path or paste a diff.
code-reviewer · auto

Your README.md is already loaded in context. To confirm, ask the agent something about it:

code-reviewer · auto
What does this project do?

The agent answers using the README content. No file-reading prompt appears because read is in allowedTools and runs silently by design.

Step 4 — Test tool permissions

To see the permission boundary in action, ask the agent to inspect recent changes:

code-reviewer · auto 
What files have changed?

The agent runs git status without prompting, because shell is pre-approved within allowedTools. Now try something outside its approved list:

code-reviewer · auto 
Write a summary to NOTES.md

Kiro will prompt you for permission before writing, because write is not listed in allowedTools. This is the security boundary working as intended.

Step 5 — Restrict write access with toolsSettings

You decide the agent should be able to write, but only to a reviews/ directory. Exit Kiro and update the config to add write capability to both tools and allowedTools:

{
  "name": "code-reviewer",
  "description": "Reviews code changes. Reads files and runs git commands without prompting.",
  "prompt": "You are a thorough code reviewer. Focus on correctness, clarity, and security. Be concise.",
  "tools": ["read", "write", "shell"],
  "allowedTools": ["read", "shell", "write"],
  "toolsSettings": {
    "write": {
      "allowedPaths": ["reviews/**"]
    }
  },
  "resources": [
    "file://README.md"
  ],
  "welcomeMessage": "Ready to review. Share a file path or paste a diff."
}

Create the directory:

mkdir reviews

Start a new session with kiro-cli chat (config changes take effect on next chat activation), and swap to the code-review agent with commands from Step 3:

# activate new session
kiro-cli chat
# activate the code-review agent
/agent

Now ask the agent to write a review:

code-reviewer · auto 
Review project files and save findings to reviews/main-review.md

The agent writes to reviews/main-review.md without prompting. An attempt to write anywhere else will still require confirmation.

Troubleshooting

Agent does not appear in the `/agent` list

Check that the file is valid JSON — a missing comma or bracket will silently prevent the agent from loading. A JSON linter or jq . .kiro/agents/code-reviewer.json can surface syntax errors quickly.

Resource file not found warning

Kiro resolves file:// paths relative to the project root. If README is in a subdirectory, update the path to match: file://docs/REAME.md

Config changes not taking effect

Changes to an existing agent require re-activation. Run /agent to change agents and then swap back to reload the config in the current session.

Conclusion

Agent files live in .kiro/agents/ (local) or ~/.kiro/agents/ (global)
tools declares availability; allowedTools removes the permission prompt
toolsSettings constrains what allowed tools can touch (e.g., allowedPaths for write operations)
resources pre-load files into context at startup

Next steps

Move the prompt to a separate file: "prompt": "file://./prompts/code-reviewer.md" for easier editing
Commit .kiro/agents/code-reviewer.json to version control so teammates get the same agent automatically
Read the official configuration reference for all available fields

AI in the SDLC

Aaron — Fri, 15 May 2026 02:33:32 GMT

The Open Group IT4IT lifecycle of digital products

The conversation around AI-driven software development has never been louder. From Amazon’s opinionated AI-Driven SDLC1 to the growing body of work around Spec-Driven Development2 , the industry is moving quickly to position AI as the definitive solution to an age-old challenge: how do we build software faster, more reliably, and at greater scale?

These frameworks offer genuine value. The productivity gains for teams that adopt them are real, and the examples are compelling. However, the discourse consistently makes the same mistake: it scopes “the SDLC” to mean the build phase and little else.

The true software development lifecycle is far broader. It encompasses Ideation, Architecture, Planning, Build, Operations, Fixes, and Retirement, a complete arc from the first spark of a product idea to the deliberate decommissioning of a system. When we evaluate AI’s role through this wider lens, a more honest and more complicated picture emerges. AI is not a replacement for the SDLC process. It is, at its best, a powerful accelerator but only when its integration into the lifecycle is deliberately and carefully architected.

Why Small Teams Don’t Prove the Case

It is worth acknowledging what AI-first development gets right. For solopreneurs, small startups, and lean product teams, an AI-first SDLC can be genuinely transformative. The context window of a modern AI system is sufficient to hold the full scope of a small codebase. One or two engineers can move with a speed that would have been impossible a few years ago. The gains are real.

The challenge is that these successes are being used to justify adoption at an entirely different scale, large enterprises with hundreds or thousands of engineers, hundreds of services, and decades of accumulated architectural decisions. The properties that make AI effective for a small team do not transfer cleanly to this environment.

At enterprise scale, the assumptions break down. A single AI agent cannot hold the full context of a distributed system spanning dozens of teams and hundreds of services. The clean feedback loop between a developer and an AI assistant becomes a tangled web of dependencies, competing priorities, and organizational constraints. Understanding why requires examining the most fundamental problem with AI at scale: non-determinism.

The Non-Determinism Problem

Non-determinism is not a quirk of current AI systems that will eventually be engineered away. It is an inherent property of the probabilistic models that power them. For small-scale development tasks (writing a function, generating a test suite, summarizing documentation), this is largely acceptable. The cost of variation is low, and a human reviewer catches the drift.

At enterprise scale, the cost of non-determinism compounds rapidly.

Consider a large organization running hundreds of microservices across multiple teams. Each service represents a distinct bounded context, typically owned by a small team. Features and epics orchestrate work across these boundaries at a higher level. If an AI planning agent is responsible for generating specifications across multiple epics simultaneously, it must do so consistently (not just within a single session), but across repeated invocations, different teams, and changing business context.

This is where Spec-Driven Development begins to strain. Ask an AI agent to define the acceptance criteria for a given feature, and it will produce a reasonable answer. Ask it again tomorrow, with slightly different phrasing, and the answer will shift. At small scale, this is manageable. Across hundreds of services and dozens of teams, this drift accumulates into inconsistency that is difficult to detect and expensive to correct.

The deeper issue is accountability. Human developers navigate ambiguity through judgment, context, and professional accountability. When a decision leads to a poor outcome, there is a person who made that call and can learn from it. When an AI agent makes the same decision, a decision from a system that is non-deterministic by design, accountability becomes diffuse. Who owns the output? Who is responsible when acceptance criteria shift between sprints and the resulting system fails to meet business needs? These are not rhetorical questions. They are organizational challenges that must be answered before AI can be safely integrated at scale.

Architecture Consistency at Scale

The non-determinism problem is most consequential when it touches architectural decisions. For large organizations, technical architecture is not a creative exercise, it is a discipline. Architecture principles, patterns, and guidance must be applied consistently across the landscape if that landscape is to remain manageable over time.

Consider what happens when architecture is delegated to an AI without constraints. Asked to design a new service, the AI might select a microservices approach. Asked again for a different service with similar requirements, it might favor a modular monolith. Asked a third time, it might propose an event-driven architecture. While each choice may be individually defensible, collectively they produce a fragmented landscape where every service is a unique artifact, each with its own operational patterns, its own failure modes, and its own runbooks.

This fragmentation also has direct operational and financial consequences. Operations teams cannot apply generalized expertise across services that each behave differently. Incident response becomes slower because runbooks cannot be standardized. Cloud costs become difficult to manage because cloud resource selections (Lambda versus EC2 versus ECS versus EKS), vary by service rather than following a consistent decision framework. FinOps programs, which depend on predictable patterns to optimize spend, are undermined by this inconsistency.

The solution is not to exclude AI from architectural decisions, but to constrain the space in which it operates. Guardrails, prescriptive pattern sets, and architectural governance frameworks give AI agents a bounded set of valid choices. Within that bounded space, AI can accelerate architectural work significantly. Outside it, the long-term costs outweigh the short-term gains.

Operations: The Hidden Cost of Upstream Decisions

Operational complexity does not emerge at deployment time. It is designed in (or more accurately, it is neglected) at the planning and architecture phases. Every decision made upstream about how a service is structured, what data it produces, and how it communicates with its neighbors has a direct consequence for how it will operate in production.

This is the hidden cost that AI-SDLC frameworks consistently underestimate.

For AI agents to participate meaningfully in operations (like detecting anomalies, diagnosing failures, triggering remediations) they require rich, consistent observability signals. Logs must be structured and semantically meaningful. Metrics must cover the right indicators. Distributed traces must propagate correctly across service boundaries. These are not implementation details that can be added after the fact. They must be part of the initial planning phase; rather than discovered as gaps during the first production incident.

This introduces a critical architectural requirement: the feedback loop between operational agents and planning agents must be explicitly designed. When an operational agent encounters a failure it cannot diagnose because the necessary signals are missing, that information must flow back to the planning and architecture layers. The planning agent that generated the original specification must be capable of receiving and incorporating this feedback. Without this loop, the system learns nothing from production, and the same observability gaps are reproduced in every subsequent service.

Furthermore, operational agents cannot be generic. A service built on a Lambda-based event-driven pattern has fundamentally different failure modes than a service built on a long-running container. Effective operational AI requires specialization, agents that understand the specific patterns they are operating in, not agents that reason from first principles about every incident. This in turn, reinforces the argument for architectural consistency: a landscape with fewer distinct patterns requires fewer specialized agents and produces more predictable operational outcomes.

The Phases Nobody Talks About

The build phase receives the majority of attention in AI-SDLC. This is understandable because it is where the most visible productivity gains occur, and it is the phase most responsive to automation. However, a lifecycle that begins at planning and ends at deployment is not a lifecycle ... it is a fragment.

Ideation is where software begins. Before a line of code is written, before an architecture is selected, business context must be translated into product requirements. This is an activity that involves stakeholder negotiation, market understanding, strategic judgment, and organizational politics. AI can assist with this phase: synthesizing research, generating initial requirement drafts, identifying gaps in specifications. However, the judgment about what to build and why remains a human responsibility. AI-SDLC frameworks that begin at the planning phase are implicitly assuming that ideation has already been resolved, which is rarely true in practice.

Fixes represent a continuous parallel track to feature development. Bug reports, production incidents, and security vulnerabilities do not pause while the planning agent generates the next sprint’s epics. AI agents that operate in the fixes track face a different set of constraints than those operating in the feature track: they must reason from incomplete information, work against time pressure, and frequently operate on legacy code that predates any AI involvement. Integrating fixes into an AI-SDLC requires explicit tooling for incident context ingestion, prioritization logic, and safe rollback mechanisms, none of which are addressed in current frameworks.

Retirement and migration may be the most neglected phase of all. Every dependency within a service has a lifecycle of its own. Programming languages release new versions, and libraries reach end-of-life. When a core technology in a service’s stack loses community support or vendor maintenance, the cost of inaction compounds over time with security exposure, incompatibility with adjacent services, and eventual forced migrations under time pressure.

A complete AI-SDLC must account for this. It requires a dedicated monitoring capability with an agent or set of agents whose responsibility is tracking the dependency health of every service in the portfolio. When a dependency approaches end-of-life, the system should surface that signal to the planning layer before it becomes a crisis ... not after. This is not an unexpected requirement; rather, it is the operational reality of maintaining software at scale, and it is almost entirely absent from current AI-SDLC thinking.

What a Real Enterprise AI-SDLC Requires

Having examined the gaps, it is possible to sketch out an AI-SDLC that actually works at enterprise scale. It is not a single agent, a single framework, or a single vendor’s platform. It is a system of systems and a set of specialized, constrained agents operating within a governance structure that preserves human accountability at the strategic layer.

The foundational principle is a clear division of responsibility. Human orchestrators own the “what” and the “why.” They define business strategy, set architectural principles, establish governance guardrails, and make the calls that carry accountability. AI agents own the “how”, by executing within the boundaries that human orchestrators define, accelerating the tedious and dependency-heavy work of implementation, and surfacing information that humans need to make better decisions.

This division only functions if the following components are in place:

Guardrails and constrained pattern sets. Architectural AI agents must operate within a defined set of approved patterns. The set should be small enough to maintain consistency and large enough to cover legitimate variation. Deviations from approved patterns should require human approval, not AI discretion.

Observability-first specifications. Planning agents must generate specifications that include explicit observability requirements, like what logs the service must produce, what metrics it must expose, what traces it must propagate. These requirements are not optional and must be validated before a service is considered complete.

Explicit feedback loops. Operational agents must have a defined channel to communicate signal gaps and failure patterns back to planning and architecture agents. This loop closes the connection between what was designed and what is actually happening in production.

Dependency monitoring. A dedicated agent or capability must track the health and lifecycle status of every dependency across the portfolio, surfacing EOL risks to the planning layer on a continuous basis.

Human accountability checkpoints. Non-deterministic outputs in architectural decisions, acceptance criteria, and migration plans must pass through human review before they are committed to. AI generates the options; humans make the call.

Together, these components address the non-determinism problem not by eliminating it, but by containing it. AI operates freely within bounded, reversible, low-stakes decisions. Human judgment intervenes at the high-stakes, high-consequence points where accountability matters.

Conclusion: The Real Revolution

The most important shift in enterprise software development is not the adoption of AI. It is the recognition that AI adoption requires architectural thinking of the same rigor and care as any other major system integration.

Organizations that treat AI-SDLC as a tool swap by replacing the old process steps with new AI-driven equivalents, will encounter the compounding costs described throughout this article: Fragmented architectures, Inconsistent observability, Untracked dependencies, and Accountability gaps that surface at the worst possible moments.

Organizations that succeed will be those that design the integration deliberately: defining the boundaries within which AI operates, building the feedback loops that keep the system honest, and preserving human judgment at the points where it matters most.

The question facing enterprise software leaders is not “how do we adopt AI in our SDLC?” It is a more precise and more demanding question: “how do we architect a system in which AI and human judgment each do what they do best, across the full lifecycle of every service we operate?”

That question does not have a simple answer. It requires the same systems thinking, strategic clarity, and organizational discipline that have always separated organizations that manage complexity well from those that are managed by it. AI does not change that requirement. Rather, it makes meeting that requirement more achievable than it has ever been.

https://aws.amazon.com/blogs/devops/ai-driven-development-life-cycle/

https://www.thoughtworks.com/radar/techniques/spec-driven-development

Aaron: Artificial Intelligence

Sandboxing Kiro CLI

Table of Contents

What Is Sandboxing?

Why Sandboxing Is Needed

Can Hooks Replace a Sandbox?

How to Sandbox Kiro CLI on macOS

1. Use a Virtual Machine (UTM)

2. Use a Container (Docker)

3. Use Application-Level Isolation (SRT)

Feature Comparison

Advantages of Sandboxing

Limitations of Sandboxing

Conclusion

AWS Kiro Custom Agents: Your First Agent in 15 Minutes

Prerequisites

Step 1 — Understand where agents live

Step 2 — Create the agent file

Step 3 — Activate the agent

Step 4 — Test tool permissions

Step 5 — Restrict write access with toolsSettings

Troubleshooting

Agent does not appear in the /agent list

Resource file not found warning

Config changes not taking effect

Conclusion

Next steps

AI in the SDLC

Why Small Teams Don’t Prove the Case

The Non-Determinism Problem

Architecture Consistency at Scale

Operations: The Hidden Cost of Upstream Decisions

The Phases Nobody Talks About

What a Real Enterprise AI-SDLC Requires

Conclusion: The Real Revolution

Agent does not appear in the `/agent` list