2.1 Kicking Off Azure AI Week!

October 15, 2024 · 8 min read

Senior AI Advocate

Senior Developer Advocate

Welcome to Day 1️⃣ of Azure AI Week on the #30 Days Of IA series!

Want to build a custom copilot with more control over every step of the end-to-end workflow from prompt to production? Join us for a week of posts as we walk through two signature application samples that teach you how to deliver generative AI solutions using Prompty for ideation, Azure AI Studio for development, and Azure Container Apps for deployment.

Ready? Let's get started!

What We'll Cover Today

What are we building? - Let's talk about the application scenarios.
How are we building it? - Let's discuss the app architecture.
What tasks do I need to do? - Let's review the end-to-end developer workflow.
The Week Ahead - Let's preview the week's content.
Call To Action - Take these 3 actions to skill up on Azure AI!

Card Banner

When we begin any application development journey, we start with design before proceeding to development and delivery of the solution. For the design phase, we focus on three questions:

App Scenario: What use cases and interaction flows should we support?
App Architecture: What resources do we need to build the application?
App Lifecycle: What is our end-to-end workflow from prompt to production?

We can then think about development and delivery requirements from three perspectives:

Design Patterns: How do we orchestrate our workflow for efficiency?
Developer Experience: What tools do we use to streamline the process?
Responsible AI: What principles & practices do we need for trustworthy AI?

This week, we'll explore these questions with two distinct AI applications that will demonstrate the power and versatility of the Azure AI platform for building intelligent applications.

1. What are we building?

Our first application is Contoso Chat, a custom copilot that uses the Retrieval Augmented Generation pattern (RAG) to ground responses in your data. The second application is Contoso Creative Writer, a content publishing assistant that uses the Multi-Agent Conversation pattern to coordinate and execute multiple tasks autonomously, on behalf of the user. Let's look at each.

1.1 Contoso Chat

The Contoso Chat app scenario involves a retail company ("Contoso Outdoors") that sells hiking and camping gear to outdoor enthusiasts via their website. The company wants to build and integrate a custom copilot that can help customers ask questions about their product catalog, and receive recommendations based on their previous purchases. This requires the copilot to ground responses in the retailer's product catalog and customer orders data for quality responses.

Contoso Chat

1.2 Contoso Creative Writer

The Contoso Creative Writer app scenario involves a content authoring assistant that can help the company's marketing team quickly assemble and publish content for their product blog. The company needs these articles to be grounded in the product data, but expand on key themes to create compelling narratives for their readers that would motivate more relevant purchases. This requires agentic workflows that can execute tasks concurrently (research topic, find products) and coordinate them autonomously (write content, review for publication) to streamline content creation workflows at scale.

Contoso Creative Writer

2. How are we building it?

Building generative AI applications requires developers to choose relevant models, deploy them for use, and orchestrate their interactions with other services to deliver a complete solution that meets quality and safety requirements.

The Azure AI Studio platform provides a unified experience for developers to complete this workflow end-to-end, using a web portal, CLI, or SDK. The Azure AI Studio architecture involves the three resources shown below - a hub for administration, a project for development and services that provide model endpoints for invocation. Learn more here.

Azure AI

The AI Application Architecture builds on this foundation, implementing the required design patterns that define the application's behavior. This typically involves orchestrating interactions between the Azure AI platform resources and third party services, to deliver end-to-end user experience as shown in the examples below. Let's take a look.

2.1 Contoso Chat

The figure shows the AI Application Architecture for the Contoso Chat sample.

Contoso Chat Architecture

The application is deployed as a hosted endpoint in Azure Container Apps, using Azure Managed Identity for keyless authentication. The user request ("prompt") is received at this endpoint, and processed in three steps:

The prompt query is expanded to extract relevant product query terms
The expanded query is sent to Azure AI Search to retrieve relevant product documents
The original query is also sent to Azure Cosmos DB to retrieve relevant customer profile data

The results from the last two steps provide grounding data that shapes the "enhanced prompt" we then dispatch to the Azure OpenAI chat model. The generated response is then returned to the user. This workflow requires orchestration of invocations to both the AI models (embeddings, chat) and the data retrieval services (search index, customer database) for each request.

2.2 Contoso Creative Writer

The figure shows the AI Application Architecture for the Contoso Creative Writer sample.

Contoso Creative Writer Architecture

Similar to Contoso Chat, the application is deployed as a hosted endpoint in Azure Container Apps, using Azure Managed Identity for keyless authentication - with App Insights for monitoring app performance. The difference is in the orchestration required for processing the user request ("prompt") in this application.

The prompt query is expanded to extract relevant topic query terms and relevant products retrieved using Bing Search and Azure AI Search.
The expanded query is sent to a "Writer Agent" (chat model) which uses the provided query and grounding context to generate a draft article based on the designed prompt template.
The draft article is then sent to an "Editor Agent" (chat model) which assesses the article for acceptance based on the designed prompt template.

An approved article is then published as a blog post. The user interface allows you to view the progression of these tasks visually, to get an intuitive sense for multi-agent coordination.

3. What tasks do I need to do?

We know the application architecture to implement, and we know the infrastructure requirements for building on. But we still need to understand the end-to-end developer workflow that will help us go from initial prototype to production-ready solution.

This is where the GenAIOps lifecycle comes in. Because generative AI applications rely on natural language processing and pre-trained models, they need additional steps to ensure that responses meet quality and safety standards for the diverse real-world prompts they will encounter. The application lifecycle now looks something like this, with three main stages:

Ideation - go from prompt to prototype, and validate with a test input.
Evaluation - go from prototype to production-ready by validating with larger test input set.
Operationalization - deploy for real-world use, and monitor for cost and performance.

Gen AI Ops

From a practical development perspective, we can map these many steps in the workflow shown below, highlighting the key task to be performed and the tool best-suited for that task. We'll look at these in more detail in the next few posts this week.

Developer Workflow

But, to deliver trustworthy AI solutions, we also need to consider security, privacy, and safety requirements for the application. In the last row above, we highlight key features or practices that enable a Responsible AI approach to development. These range from practical considerations (like using curated models from trusted providers) to more complex features (like evaluation, safety systems, and managed identity) that add layers of protection to the application across the development lifecycle.

4. The Week Ahead!

We'll explore these in more detail in the next few posts, as we build and deploy the Contoso Chat and Contoso Creative Writer applications using the Azure AI platform. Here's what to expect:

Day 1: Kickoff Post - What are we building?
Day 2: Provision With AZD - Setup for development.
Day 3: Ideate with Prompty - Prompt Engineering & Data Grounding
Day 4: Evaluate with AI - Custom Evaluators & AI-Assisted Scoring
Day 5: Deploy with ACA - Operationalization & Monitoring

We'll start our journey tomorrow by provisioning Azure resources (Infrastructure as code) and setting up our development environment (Configuration as code) in a manner that enables consistent and reproducible usage across teams.

5. Call To Action

info

Want to get hands-on experience building intelligent apps on Azure?

Take these actions today, to jumpstart your skilling journey:

Register for Microsoft AI Tour - join an instructor-led workshop session.
Register for Microsoft Ignite - look for related lab & breakout sessions on Azure AI.
Browse the AI Templates Collection - explore samples for new frameworks and scenarios.

What We'll Cover Today​

1. What are we building?​

1.1 Contoso Chat​

1.2 Contoso Creative Writer​

2. How are we building it?​

2.1 Contoso Chat​

2.2 Contoso Creative Writer​

3. What tasks do I need to do?​

4. The Week Ahead!​

5. Call To Action​