Codice
Durata
Prezzo
Lingua
Extract insights from visual data on Azure (AI-3008)
This 1‑day course focuses on building intelligent applications that can see, interpret, and reason over images and documents using different multimodal models and agent-based tools. Learners explore how visual and document inputs can be combined with language models to enable structured extraction, analysis, and decision-making workflows. The course emphasizes practical patterns for extracting information, orchestrating tools, and grounding model responses in visual data.
Modalità di erogazione
In aula o Live Virtual Classroom
Attestato di partecipazione
Al termine del corso verrà rilasciato l’attestato di frequenza
Develop a vision-enabled generative AI application
A picture says a thousand words, and multimodal generative AI models can interpret images to respond to visual prompts. Learn how to build vision-enabled chat apps.
- Introduction
- Use a vision-capable model in the Microsoft Foundry portal
- Develop a vision-based chat app
- Exercise – Develop a vision-enabled chat app
- Module assessment
- Summary
Generate images with AI
In Microsoft Foundry, you can use image generation models to create original images based on natural language prompts.
- Introduction
- What are image-generation models?
- Explore image-generation models in Microsoft Foundry portal
- Create a client application that uses an image generation model
- Exercise – Generate images with AI
- Module assessment
- Summary
Generate videos with Microsoft Foundry
Learn how to generate videos from text prompts with Sora 2 in Microsoft Foundry.
- Introduction
- Deploy a video generating model
- Generate video from a prompt
- Generate video in Python
- Exercise – Generate video with Sora 2 in Microsoft Foundry
- Module assessment
- Summary
Analyze images with Content Understanding
Learn how to analyze images with Azure Content Understanding.
- Introduction
- What is Content Understanding?
- Analyze images with Content Understanding
- Exercise – Analyze images with Content Understanding
- Module assessment
- Summary
Create a multimodal analysis solution with Azure Content Understanding
Use Azure Content Understanding for multimodal content analysis and information extraction.
- Introduction
- What is Azure Content Understanding?
- Create a Content Understanding analyzer
- Use the Content Understanding API
- Exercise – Extract information from multimodal content
- Module assessment
- Summary
Create an Azure Content Understanding client application
Use the Azure Content Understanding API for multimodal content analysis and information extraction.
- Introduction
- Prepare to use the AI Content Understanding API
- Create a Content Understanding analyzer
- Analyze content
- Exercise – Develop a Content Understanding client application
- Module assessment
- Summary
Extract data with Azure Document Intelligence
Azure Document Intelligence uses OCR and deep learning models to extract text, key-value pairs, tables, and structured data from forms and documents. Learn how to use prebuilt and custom models to automate document processing.
- Introduction
- What is Azure Document Intelligence?
- Use the Document Intelligence Studio
- Use prebuilt models
- Train and use custom models
- Exercise – Analyze documents with Document Intelligence
- Module assessment
- Summary
Create a knowledge mining solution with Azure AI Search
Unlock the hidden insights in your data with Azure AI Search. In this module, you’ll learn how to implement a knowledge mining solution that extracts and enriches data, making it searchable and ready for deeper analysis.
- Introduction
- What is Azure AI Search?
- Extract data with an indexer
- Enrich extracted data with AI skills
- Search an index
- Persist extracted information in a knowledge store
- Exercise – Create a knowledge mining solution
- Module assessment
- Summary
This course is designed for developers, AI engineers, and technical professionals who want to build applications that work with images and documents using multimodal, agent-driven approaches. It’s best suited for learners with basic programming experience and a general understanding of cloud or AI concepts.
Before starting this learning path, you should already have:
- Familiarity with Azure and Microsoft Foundry.
- Programming experience.
This 1‑day course focuses on building intelligent applications that can see, interpret, and reason over images and documents using different multimodal models and agent-based tools. Learners explore how visual and document inputs can be combined with language models to enable structured extraction, analysis, and decision-making workflows. The course emphasizes practical patterns for extracting information, orchestrating tools, and grounding model responses in visual data.