AI Deploy Assistant

⚠️ Sample / Illustration repository — NOT production-ready.

This codebase is published as a reference implementation to illustrate a multi-agent deployment-assistant pipeline. It is not intended for direct production use. Before deploying anywhere beyond a sandbox, you are expected to conduct your own security review, complete the deferred posture items listed in Security & Known Posture Items, harden IAM scoping, validate the prompt-injection surface, and add the operational controls (auth, audit, rate limiting, observability, DR) appropriate to your environment. The maintainers make no warranty as to fitness for any particular purpose.

A generalized, AI-powered product deployment assistant. Uses a multi-agent pipeline (Strands + Amazon Bedrock) to guide users through deploying any product catalog on AWS: gathering requirements via interview, generating architecture designs, producing Infrastructure-as-Code (CloudFormation), and creating documentation.

The system is product-agnostic — the product catalog, interview fields, deployment patterns, and validation rules are all driven by configuration files (config.yaml + catalog.lock.yaml), not hardcoded logic.

Quick Start

# Install dependencies
cd backend && uv sync && cd ..
cd frontend && pnpm install && cd ..

# Configure environment
cp backend/.env.sample backend/.env
# Edit backend/.env — set AI_DEPLOY_AWS_REGION to your Bedrock-enabled region

# Start both services
./dev.sh

Backend: http://localhost:8000/ping | Frontend: http://localhost:3000

See Local Development Guide for detailed setup.

Architecture

Browser → Next.js Frontend (wizard UI)
           ↓ API calls
         FastAPI Backend (orchestrator)
           ├── Interview Planner (Sonnet 4.5) → plans questions from KB + catalog
           ├── Interview Executor (Haiku 4.5) → gathers requirements from user
           ├── Design Agent (Sonnet 4.5) → generates 3 architecture options
           ├── IaC Agent (Sonnet 4.5) → produces CloudFormation templates
           └── Documentation Agent (Sonnet 4.5) → diagrams + user guide
                    ↕
              Knowledge Base (Bedrock KB or Local files)
              + Catalog Lock File (deterministic field schema)

Core Concepts

Two Config Files

File	Purpose	Maintained by
`config.yaml`	Product identity + KB connection + policy overrides (~10 lines)	Developer (hand-edited)
`catalog.lock.yaml`	Full product schema — use cases, fields, patterns, appliance config	Generated from KB, reviewed + committed

4-Stage Pipeline

Interview — AI-guided requirements gathering (fields from catalog)
Design — 3 architecture options grounded in KB documents
IaC — CloudFormation generation (parameterize, compose, or generate)
Documentation — Architecture diagram and user guide

Knowledge Base Provider

The system supports three KB modes (auto-selected by config):

Mode	When	How
Bedrock	`AI_DEPLOY_KNOWLEDGE_BASE_ID` is set	AWS Bedrock KB API with vector search
Local	`knowledge_base.local_path` in config.yaml	TF-IDF search over local markdown files
Null	Neither configured	Graceful no-op (LLM uses built-in knowledge)

Project Structure

├── config.yaml              # Product identity (hand-edited)
├── catalog.lock.yaml        # Generated product schema (committed)
├── compose.yaml             # Docker Compose for Floci (local AWS emulator)
├── dev.sh                   # One-command local development startup
├── knowledge-base/          # Local KB documents (dev fallback)
│   ├── realtime-inference/  # use_case/deployment_type/doc_type.md
│   ├── batch-inference/
│   └── training/
├── scripts/                 # Setup and utility scripts
│   ├── setup-local.sh       # Provision Floci resources (DynamoDB, S3, SQS, Cognito)
│   ├── setup-bedrock-kb.sh  # Create/configure Bedrock Knowledge Base
│   └── generate-kb-metadata.sh  # Generate KB sidecar metadata files
├── backend/
│   ├── lambdas/ws/          # WebSocket Lambda handlers (authorizer, connect, etc.)
│   └── src/
│       ├── config/          # Settings, auth, AWS client, metrics, observability
│       ├── agents/          # LLM agent implementations (interview, design, iac, docs)
│       ├── models/          # Pydantic data models
│       ├── services/        # Business logic (catalog loader, KB provider, plan cache, etc.)
│       ├── storage/         # DynamoDB+S3 storage layer
│       ├── tools/           # Agent tools (KB search, save_artifact, mermaid validator)
│       ├── prompts/         # Template prompt files ({product_name} variables)
│       ├── validation/      # cfn-lint, cfn-guard, checkov pipeline
│       ├── workers/         # SQS Lambda workers + local async worker
│       └── routes/          # FastAPI endpoints
├── frontend/                # Next.js wizard UI
└── infra/                   # AWS CDK infrastructure (ECS, Lambda, DynamoDB, etc.)

Configuration Reference

See Configuration Guide for full schema documentation.

Environment Variables

All env vars use the AI_DEPLOY_ prefix. Key variables:

Variable	Required	Description
`AI_DEPLOY_AWS_REGION`	Yes	AWS region for Bedrock and services (default: `us-west-2`)
`AI_DEPLOY_KNOWLEDGE_BASE_ID`	No	Bedrock KB ID (omit for local KB)
`AI_DEPLOY_DYNAMODB_TABLE`	No	DynamoDB table name (default: `ai-deploy-table`)
`AI_DEPLOY_S3_ARTIFACTS_BUCKET`	No	S3 bucket for artifacts (default: `ai-deploy-artifacts`)
`AI_DEPLOY_PRIMARY_MODEL_ID`	No	Bedrock model for design/planning (default: Sonnet 4.5)
`AI_DEPLOY_LIGHTWEIGHT_MODEL_ID`	No	Bedrock model for interview execution (default: Haiku 4.5)
`AI_DEPLOY_COGNITO_USER_POOL_ID`	No	Cognito pool ID (omit for local dev)
`AI_DEPLOY_AGENTCORE_MEMORY_ID`	No	AgentCore Memory ID for cross-session memory
`AI_DEPLOY_DEBUG`	No	Enable debug mode (default: `false`)

See backend/.env.sample for the complete list.

Development

# Run backend only
cd backend && uv run uvicorn src.main:app --reload

# Run tests
cd backend && uv run pytest tests/ -q

# Run frontend
cd frontend && pnpm dev

Local Knowledge Base

For development without AWS Bedrock access, place documents in knowledge-base/:

knowledge-base/
  {use_case}/
    {deployment_type}/
      {document_type}.md    # architecture, sizing, configuration, etc.

The local KB provider indexes these files and performs TF-IDF text search with the same metadata filtering as Bedrock. Set AI_DEPLOY_KNOWLEDGE_BASE_ID="" to use local mode.

Security & Known Posture Items

This repository is a sample. The list below tracks known posture items that are intentionally deferred or suppressed with rationale. Treat this as a starting point for your own security review, not a sign-off.

Known deferred refactors

Bedrock Knowledge Base ARN scope — The Lambda and ECS task IAM grants for bedrock-agent-runtime:Retrieve / RetrieveAndGenerate use a knowledge-base/* resource wildcard scoped to the deployment account+region. This is because the policy is generated before the KB ID is known. The recommended hardening is to thread the actual KB ID (or its CfnOutput-derived ARN) into the IAM grant so the wildcard is replaced with knowledge-base/<id>. Findings are suppressed in infra/lib/lambda.ts and infra/lib/ecs.ts with reference to this section. Tracked as a follow-up.

Suppressed cdk-nag findings (with rationale at the suppression site)

AwsSolutions-APIG4 on WebSocket $disconnect and subscribe routes — API Gateway WebSocket APIs only accept authorizers on $connect (AWS platform constraint). Auth context propagates via requestContext.authorizer and the subscribe handler enforces tenant isolation. See infra/lib/websocket.ts and backend/lambdas/ws/ws_subscribe.py.
AwsSolutions-IAM4 on Lambda service roles — the AWS-managed AWSLambdaBasicExecutionRole and AWSLambdaVPCAccessExecutionRole are the canonical platform integrations. See infra/lib/lambda.ts.
AwsSolutions-IAM5 action wildcards on kms:GenerateDataKey* and kms:ReEncrypt* — these expand within the kms: namespace only and are resource-bound to the customer-managed key on the same statement. See infra/lib/ecs.ts.

Deployment

See AWS Deployment Guide for CDK-based production deployment.

Authors

Ryan Dsilva, AI Acceleration Architect at AWS

Ragib Ahsan, AI Acceleration Architect II at AWS

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
backend		backend
docs		docs
frontend		frontend
infra		infra
knowledge-base		knowledge-base
scripts		scripts
.gitignore		.gitignore
.semgrep.yml		.semgrep.yml
.semgrepignore		.semgrepignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
arch.png		arch.png
catalog.lock.yaml		catalog.lock.yaml
compose.yaml		compose.yaml
config.yaml		config.yaml
dev.sh		dev.sh
mise.toml		mise.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Deploy Assistant

Quick Start

Architecture

Core Concepts

Two Config Files

4-Stage Pipeline

Knowledge Base Provider

Project Structure

Configuration Reference

Environment Variables

Development

Local Knowledge Base

Security & Known Posture Items

Known deferred refactors

Suppressed cdk-nag findings (with rationale at the suppression site)

Other posture notes

Deployment

Authors

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Deploy Assistant

Quick Start

Architecture

Core Concepts

Two Config Files

4-Stage Pipeline

Knowledge Base Provider

Project Structure

Configuration Reference

Environment Variables

Development

Local Knowledge Base

Security & Known Posture Items

Known deferred refactors

Suppressed cdk-nag findings (with rationale at the suppression site)

Other posture notes

Deployment

Authors

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages