Microsoft’s Copilot in Windows 11 has quickly evolved beyond simple text-based assistance. With the latest updates, it now includes a feature called Copilot Vision, allowing the AI to understand and interpret images directly on your screen.
Whether you want to describe what’s shown in a photo, extract text from an image, or get contextual help about visual elements on your desktop, Copilot Vision brings powerful visual intelligence right into Windows.
In this guide, we’ll explain what Copilot Vision is, how to enable it, and how to use it effectively on Windows 11.
How to Use Copilot Vision on Windows 11
You’ll need a Windows 11 PC running the latest 23H2 or 25H2 update (Copilot Vision is available starting in newer builds). Ensure your system is connected to the internet and signed in with a Microsoft account.
Tip: If you don’t see the Copilot icon on your taskbar, update Windows via Settings → Windows Update, and install all optional feature updates.
Step 1: What Is Copilot Vision in Windows 11?
Copilot Vision is an AI-powered enhancement to Microsoft Copilot that lets it analyze visual content displayed on your screen. Unlike the standard Copilot, which focuses on text and commands, the Vision feature can understand images, screenshots, diagrams, and even interface elements.
You can use it to:
- Describe or summarize an image.
- Extract text from a screenshot.
- Identify on-screen objects or apps.
- Get help about something visible on your desktop (for example, “What does this setting do?”).
Essentially, it turns your Windows assistant into a visual AI companion that can “see” what you see.
Step 2: Enable Copilot Vision in Windows 11
If you’re running a recent version of Windows 11, Copilot Vision may already be active. But if it’s not, you can manually enable it.
- Open Settings → Personalization → Taskbar.
- Toggle on Copilot (preview) to show the Copilot icon on your taskbar.
- Click the Copilot icon or press Windows + C to open it.
- If prompted, sign in with your Microsoft account.
Once Copilot is open, you’ll see a camera icon or image upload option — that’s where Copilot Vision begins.
Step 3: Use Copilot Vision with On-Screen Content
With Copilot Vision enabled, you can ask Copilot to analyze visual content directly from your desktop or an app window.
- Open Copilot by pressing Windows + C.
- If you see something on your screen you want to analyze, take a screenshot using Windows + Shift + S (Snipping Tool).
- Paste or drag the screenshot directly into Copilot’s chat window.
- Ask a question like:
- “What’s in this image?”
- “Can you summarize the text from this screenshot?”
- “Explain this chart.”
Copilot Vision will process the image and respond with a detailed answer, often including extracted text, context, or a description of visual elements.
Step 4: Use Copilot Vision to Analyze Files or Photos
You can also upload an image or file directly into Copilot to get information about it.
- Open Copilot.
- Click the Attach (paperclip) or Image icon.
- Select an image file (PNG, JPG, or PDF with images).
- Once it uploads, type your request — for example:
- “Describe this picture.”
- “Translate the text in this photo.”
- “What object is this?”
Copilot Vision uses Microsoft’s multimodal AI technology to interpret and respond intelligently to the visual data.
Step 5: Get Visual Help with On-Screen Elements
Copilot Vision isn’t just for images — it can assist with the Windows interface itself. For example:
- If a settings window or error message appears, open Copilot and type:
“What does this window mean?” or “How can I fix this error?” - Copilot will analyze the visible content or screenshot and provide context-sensitive guidance.
This is particularly useful for troubleshooting or navigating unfamiliar settings in Windows 11.
Step 6: Use Copilot Vision with the Snipping Tool
Microsoft has integrated Copilot Vision with the Snipping Tool, making it easy to capture and analyze screenshots instantly.
- Press Windows + Shift + S to open the Snipping Tool.
- Select the area you want to capture.
- When the screenshot appears in the notification area, click Open in Snipping Tool.
- Click the Copilot or Ask Copilot button inside the Snipping Tool window.
- Copilot Vision will analyze your screenshot automatically and show results in the Copilot panel.
You can then ask questions like “What does this mean?” or “Summarize this data.”
Step 7: Use Copilot Vision to Extract Text (OCR)
One of Copilot Vision’s most useful capabilities is text extraction (OCR).
- Capture or upload an image containing text (e.g., a scanned document or screenshot).
- In Copilot, type:
- “Extract the text from this image.”
- “Copy the words in this screenshot.”
- Copilot Vision will extract and display all readable text, ready to copy or edit.
This is perfect for grabbing text from PDFs, screenshots, or non-editable web content.
Step 8: Use Copilot Vision for Image Descriptions and Accessibility
Copilot Vision also improves accessibility for visually impaired users by generating natural-language descriptions of on-screen visuals.
For instance, if you drag a photo into Copilot, it can describe objects, scenes, or even emotions in the picture.
You can ask questions like:
- “What’s happening in this image?”
- “Is this text readable?”
- “Describe the person’s expression.”
This feature combines image recognition and natural language understanding, offering context-rich results that make digital content more accessible.
Step 9: Troubleshoot Copilot Vision Not Working
If Copilot Vision isn’t responding to images or the image upload option is missing:
- Ensure your PC is running the latest Windows 11 build (23H2 or later).
- Go to Settings → Windows Update → Check for updates.
- Restart your PC after installing updates.
- Open Copilot again using Windows + C.
- If the Vision feature is still missing, ensure Region and Language are set to the United States (some regions may not have Vision available yet).
If you use work or school accounts, your IT administrator might have restricted Copilot features.
Step 10: Make the Most of Copilot Vision
Here are a few creative ways to use Copilot Vision in Windows 11:
- Analyze infographics and charts for reports.
- Extract text from screenshots of PDFs or locked documents.
- Identify visual bugs or icons in app interfaces.
- Get translations of images containing foreign text.
- Explain complex error messages using screenshots.
Once you start using it, you’ll realize Copilot Vision is like having a built-in visual assistant that understands both what you see and what you ask.
Wrapping Up
Copilot Vision marks a major step forward in how AI integrates with Windows 11. It brings visual understanding directly to your desktop — letting you analyze screenshots, extract text, describe images, and get help based on what’s on your screen.
Whether you’re a content creator, student, or power user, Copilot Vision can save time and make everyday tasks smarter. As Microsoft continues to expand Copilot’s visual intelligence, Windows 11 users gain one of the most advanced desktop AI experiences yet.