Welcome to the Video Agents Foundry Solution, designed to help businesses use AI agents to automate complex video analysis tasks. It provides an end-to-end framework for deploying AI-powered video analysis at the edge using Azure Video Indexer with Azure Arc, combined with intelligent agents for real-time insights and automated decision-making. The solution enables processing of live and recorded video streams at the edge, extracting insights like face detection, object tracking, and action recognition, while using AI agents to automate workflows with low latency and full data sovereignty.
In order to successfully deploy the VI Extension, you must first submit your subscription for approval and be approved. Please use this application link. this form.
Deployment typically takes 45-60 minutes end-to-end (
azd up). During this time, some resources in the Azure Portal may appear as "Failed" or "Unavailable"—this is expected as certain services take longer to initialize. Please allow up to an hour for everything to become fully operational. If issues persist, refer to the troubleshooting guide.
SOLUTION OVERVIEW | Getting Started | RESOURCE CLEAN-UP | BUSINESS SCENARIO | SUPPORTING DOCUMENTATION | TROUBLESHOOTING
Note: With any AI solutions you create using these templates, you are responsible for assessing all associated risks and for complying with all applicable laws and safety standards. Learn more in the transparency documents for Agent Service and Agent Framework.
The solution leverages Azure Arc Service to extend Azure Video Indexer capabilities to edge environments running on Kubernetes clusters. By deploying the Video Indexer Arc extension on an Arc-enabled AKS cluster with GPU support (NVIDIA), the solution enables real-time video processing close to the data source — reducing latency and ensuring data sovereignty. AI agents built on Azure OpenAI orchestrate the video analysis pipeline, automating tasks such as content moderation, safety monitoring, and business intelligence extraction from video feeds. The agentic architecture allows multiple specialized agents to collaborate on complex video analysis tasks, with each agent handling a specific domain such as speech analysis, visual recognition, or compliance checking.
This solution optionally creates a Microsoft Foundry project and Foundry Tools (enabled by default; set CREATE_FOUNDRY_PROJECT=false to skip). More details about the resources can be found in the resources documentation.
![]() |
|---|
Learn more about the key features this solution enables
- Edge-Deployed Video Analysis — Run Azure Video Indexer on Arc-enabled Kubernetes clusters at the edge, bringing AI-powered video analysis closer to data sources for low-latency processing.
- Real-Time & Batch Video Processing — Support for both live video pipeline ingestion (RTSP streams) and batch processing of recorded video files.
- Multimodal AI Insights — Extract rich insights from video content including speech transcription, OCR, face detection, object tracking, scene detection, and action recognition.
- AI Agent–Based Automation — Leverage intelligent agents powered by Azure OpenAI to automate complex video analysis workflows, enabling automated decision-making and action triggering.
- Cloud-Managed Edge Deployments — Manage and monitor edge deployments centrally through Azure Arc while processing video data locally.
- Data Sovereignty & Compliance — Process sensitive video data on-premises or at the edge, ensuring data residency and compliance requirements are met.
- Extensible Agentic Architecture — Build custom agent teams for domain-specific video analysis scenarios such as retail analytics, manufacturing quality control, or safety monitoring.
Follow the quick deploy steps on the deployment guide to deploy this solution to your own Azure subscription.
Note: This solution accelerator requires Azure Developer CLI (azd) version 1.18.0 or higher. Please ensure you have the latest version installed before proceeding with deployment. Download azd here.
Click here to launch the deployment guide
⚠️ Important: Check Azure OpenAI Quota Availability
To ensure sufficient quota is available in your subscription, please follow quota check instructions guide before you deploy the solution.
This template creates everything you need to get started with Microsoft Foundry (when CREATE_FOUNDRY_PROJECT=true, the default):
| Resource | Description |
|---|---|
| Azure AKS | Provides the Kubernetes cluster infrastructure with GPU support for running edge video processing workloads |
| Video Indexer Account | Azure Video Indexer account for AI-powered video analysis, extracting insights such as speech transcription, OCR, face detection, and object tracking |
| Video Indexer Enabled by Arc | Extends Azure Video Indexer capabilities to edge environments via Azure Arc, enabling real-time video processing on Arc-enabled Kubernetes clusters |
| Azure AI Project | Provides a collaborative workspace for AI development with access to models, data, and compute resources |
| Azure OpenAI Service | Powers the AI model. Supports only gpt-5.2 |
| Storage Account | Provides blob storage for application data and file uploads |
| Application Insights | Optional - Provides application performance monitoring, logging, and telemetry for debugging and optimization |
| Log Analytics Workspace | Optional - Collects and analyzes telemetry data for monitoring and troubleshooting |
Note: You should validate model availability in your region before deployment. The default model deployed with this solution is gpt-5.2, but you can customize the model and deployment configurations as needed. For more information on available models and their capabilities, please refer to the Azure OpenAI Service documentation: Foundry Models
To prevent incurring unnecessary charges, it's important to clean up your Azure resources after completing your work with the application.
-
When to Clean Up:
- After you have finished testing or demonstrating the application.
- If the application is no longer needed or you have transitioned to a different project or environment.
- When you have completed development and are ready to decommission the application.
-
Deleting Resources: To delete all associated resources and shut down the application, execute the following command:
azd down
Please note that this process may take up to 20 minutes to complete.
- Azure Subscription — An active Azure subscription. Create one for free.
- VI Arc Extension Approval — Submit your subscription for approval via this form before deploying.
- GPU Quota — Sufficient GPU quota for
Standard_NC4as_T4_v3VMs in your target region (minimum 4 vCPUs for the NCasv3 family). - Azure CLI — Version 2.x or higher. Install Azure CLI.
- kubectl — Kubernetes command-line tool. Install kubectl.
- Helm 3 — Kubernetes package manager. Install Helm.
- Azure Developer CLI (azd) — Version 1.18.0 or higher. Install azd.
Pricing varies per region and usage, so it isn't possible to predict exact costs for your usage. The majority of the Azure resources used in this infrastructure are on usage-based pricing tiers.
You can try the Azure pricing calculator for the resources:
- Microsoft Foundry: Free tier. Pricing
- Azure Storage Account: Standard tier, LRS. Pricing is based on storage and operations. Pricing
- Foundry Tools: S0 tier, supports only gpt-5.2. Pricing is based on token count. Pricing
⚠️ Important: To avoid unnecessary costs, remember to take down your app if it's no longer in use, either by deleting the resource group in the Portal or runningazd down.
![]() |
|---|
Learn more about what value this solution provides
- Real-Time Decision Making — Enable immediate, automated responses to events captured in video feeds, reducing the time from observation to action from hours or days to seconds.
- Reduced Operational Costs — Automate video monitoring and analysis tasks that previously required dedicated staff for manual review, significantly lowering labor costs.
- Edge Data Sovereignty — Process sensitive video data locally at the edge, ensuring compliance with data residency regulations and reducing bandwidth costs from streaming video to the cloud.
- Scalable Intelligence — Deploy consistent AI-powered video analysis across multiple locations (stores, factories, facilities) using a centralized cloud management model via Azure Arc.
- Actionable Insights — Transform raw video feeds into structured, actionable business intelligence through multimodal AI analysis and agent-driven automation.
Learn more about what use cases this solution provides
| Use Case | Persona | Challenges | Summary/Approach |
|---|---|---|---|
| Retail Analytics | Store Manager | Manual monitoring of customer behavior across multiple camera feeds is time-consuming and error-prone | AI agents analyze video feeds in real time for foot traffic patterns, shelf interaction, customer demographics, and queue lengths to optimize store layout and staffing |
| Manufacturing Quality Control | Quality Engineer | Visual inspection of production lines requires constant human attention and is subject to fatigue-related errors | Automated defect detection and quality monitoring via edge-deployed video analysis with AI agents that flag anomalies and trigger alerts in real time |
| Safety & Compliance Monitoring | Safety Officer | Reviewing security footage after incidents is reactive rather than preventive | Real-time safety monitoring with AI agents that detect PPE violations, unauthorized access, hazardous conditions, and automatically generate compliance reports |
| Smart City & Traffic Management | Urban Planner | Collecting and analyzing traffic data across a city requires expensive infrastructure and manual data processing | Edge-deployed video analysis at intersections provides real-time traffic flow data, incident detection, and pedestrian safety monitoring powered by AI agents |
This template also uses Managed Identity for local development and deployment.
To ensure continued best practices in your own repository, we recommend that anyone creating solutions based on our templates ensure that the GitHub secret scanning setting is enabled.
You may want to consider additional security measures, such as:
- Enabling Microsoft Defender for Cloud to secure your Azure resources.
- Protecting the Azure Container Apps instance with a firewall and/or Virtual Network.
Important Security Notice
This template, the application code and configuration it contains, has been built to showcase Microsoft Azure specific services and tools. We strongly advise our customers not to make this code part of their production environments without implementing or enabling additional security features.
For a more comprehensive list of best practices and security recommendations for Intelligent Applications, visit our official documentation.
- Azure Video Indexer enabled by Arc — Ignite 2025 Preview Blog
- Azure Video Indexer Documentation
- Azure Arc Documentation
- AKS Cluster Setup for Video Indexer Arc Extension
- Azure Video Indexer Samples on GitHub
Have questions, find a bug, or want to request a feature? Submit a new issue on this repo and we'll connect.
Please refer to Transparency FAQ for responsible AI transparency details of this solution accelerator.
This release is an artificial intelligence (AI) system that generates text based on user input. The text generated by this system may include ungrounded content, meaning that it is not verified by any reliable source or based on any factual data. The data included in this release is synthetic, meaning that it is artificially created by the system and may contain factual errors or inconsistencies. Users of this release are responsible for determining the accuracy, validity, and suitability of any content generated by the system for their intended purposes. Users should not rely on the system output as a source of truth or as a substitute for human judgment or expertise.
This release only supports English language input and output. Users should not attempt to use the system with any other language or format. The system output may not be compatible with any translation tools or services, and may lose its meaning or coherence if translated.
This release does not reflect the opinions, views, or values of Microsoft Corporation or any of its affiliates, subsidiaries, or partners. The system output is solely based on the system's own logic and algorithms, and does not represent any endorsement, recommendation, or advice from Microsoft or any other entity. Microsoft disclaims any liability or responsibility for any damages, losses, or harms arising from the use of this release or its output by any user or third party.
This release does not provide any financial advice, legal advice and is not designed to replace the role of qualified client advisors in appropriately advising clients. Users should not use the system output for any financial decisions, legal guidance or transactions, and should consult with a professional financial advisor and or legal advisor as appropriate before taking any action based on the system output. Microsoft is not a financial institution or a fiduciary, and does not offer any financial products or services through this release or its output.
This release is intended as a proof of concept only, and is not a finished or polished product. It is not intended for commercial use or distribution, and is subject to change or discontinuation without notice. Any planned deployment of this release or its output should include comprehensive testing and evaluation to ensure it is fit for purpose and meets the user's requirements and expectations. Microsoft does not guarantee the quality, performance, reliability, or availability of this release or its output, and does not provide any warranty or support for it.
This Software requires the use of third-party components which are governed by separate proprietary or open-source licenses as identified below, and you must comply with the terms of each applicable license in order to use the Software. You acknowledge and agree that this license does not grant you a license or other right to use any such third-party proprietary or open-source components.
To the extent that the Software includes components or code used in or derived from Microsoft products or services, including without limitation Microsoft Azure Services (collectively, "Microsoft Products and Services"), you must also comply with the Product Terms applicable to such Microsoft Products and Services. You acknowledge and agree that the license governing the Software does not grant you a license or other right to use Microsoft Products and Services. Nothing in the license or this ReadMe file will serve to supersede, amend, terminate or modify any terms in the Product Terms for any Microsoft Products and Services.
You must also comply with all domestic and international export laws and regulations that apply to the Software, which include restrictions on destinations, end users, and end use. For further information on export restrictions, visit https://aka.ms/exporting.
You acknowledge that the Software and Microsoft Products and Services (1) are not designed, intended or made available as a medical device(s), and (2) are not designed or intended to be a substitute for professional medical advice, diagnosis, treatment, or judgment and should not be used to replace or as a substitute for professional medical advice, diagnosis, treatment, or judgment. Customer is solely responsible for displaying and/or obtaining appropriate consents, warnings, disclaimers, and acknowledgements to end users of Customer's implementation of the Online Services.
You acknowledge the Software is not subject to SOC 1 and SOC 2 compliance audits. No Microsoft technology, nor any of its component technologies, including the Software, is intended or made available as a substitute for the professional advice, opinion, or judgment of a certified financial services professional. Do not use the Software to replace, substitute, or provide professional financial advice or judgment.
BY ACCESSING OR USING THE SOFTWARE, YOU ACKNOWLEDGE THAT THE SOFTWARE IS NOT DESIGNED OR INTENDED TO SUPPORT ANY USE IN WHICH A SERVICE INTERRUPTION, DEFECT, ERROR, OR OTHER FAILURE OF THE SOFTWARE COULD RESULT IN THE DEATH OR SERIOUS BODILY INJURY OF ANY PERSON OR IN PHYSICAL OR ENVIRONMENTAL DAMAGE (COLLECTIVELY, "HIGH-RISK USE"), AND THAT YOU WILL ENSURE THAT, IN THE EVENT OF ANY INTERRUPTION, DEFECT, ERROR, OR OTHER FAILURE OF THE SOFTWARE, THE SAFETY OF PEOPLE, PROPERTY, AND THE ENVIRONMENT ARE NOT REDUCED BELOW A LEVEL THAT IS REASONABLY, APPROPRIATE, AND LEGAL, WHETHER IN GENERAL OR IN A SPECIFIC INDUSTRY. BY ACCESSING THE SOFTWARE, YOU FURTHER ACKNOWLEDGE THAT YOUR HIGH-RISK USE OF THE SOFTWARE IS AT YOUR OWN RISK.

