Loading...
Three white icons on a blue-to-purple gradient background: the first icon shows an image/photo; the second icon depicts a computer monitor with vertical bars; the third icon displays three connected circles with user silhouettes.
Microsoft Research Blog

MMCTAgent: Enabling multimodal reasoning over large video and image collections 

November 12, 2025 | Akshay Nambi, Kavyansh Chourasia, and Tanuja Ganu

MMCTAgent enables dynamic multimodal reasoning with iterative planning and reflection. Built on Microsoft’s AutoGen framework, it integrates language, vision, and temporal understanding for complex tasks like long video and image analysis.

a field of green plants with two people walking in between the rows
Stories

Advancing AI to meet needs of the global majority 

November 12, 2025

AI tools can perform poorly in non-Western languages and lack critical cultural context for many populations. Project Gecko uses small language models to bring vital expertise to farmers in underserved areas using local languages and multi-modal content.

In the news | AI Ireland

E204 ‘Multilingual Innovation and AI’ with Microsoft’s Kalika Bali 

July 3, 2025

Today’s guest is Kalika Bali, Senior Principal Researcher at Microsoft Research India. In the episode, Kalika talks about her intriguing journey and impactful work, as well as peeling back the layers of how accidental moments can shape careers and how…

Lady in blue sari, holding a smartphone, looking at the camera. Seated outside in front of a couple of trees.
Stories

How ASHABot empowers rural India’s frontline health workers 

June 24, 2025

From childbirth to chronic disease, India’s rural healthcare workers face it all. Now, they have a tool that listens—and answers—in the languages they use every day.

In the news | CNBC

How Microsoft, Physics Wallah are bringing AI-driven learning to Indian classrooms 

June 4, 2025

As part of its broader $3 billion AI and cloud investment in India, Microsoft is deepening its focus on education through a collaboration with edtech company Physics Wallah, aimed at improving learning outcomes using AI-powered tools and personalised academic support.

Research Focus: May 07, 2025
Microsoft Research Blog

Research Focus: Week of May 7, 2025 

May 7, 2025

In this issue: New research on compound AI systems and causal verification of the Confidential Consortium Framework; release of Phi-4-reasoning; enriching tabular data with semantic structure, and more.

Research Focus: April 23, 2025
Microsoft Research Blog

Research Focus: Week of April 21, 2025 

April 23, 2025

In this issue: our CHI 2025 & ICLR 2025 contributions, plus research on causal reasoning & LLMs; countering LLM jailbreak attacks; and how people use AI vs. AI-alone. Also, SVP of Microsoft Health Jim Weinstein talks rural healthcare innovation.

Research Forum Episode 5 | Pantazis Deligiannis and Aseem Rastogi
Articles

LLMs for safe low-level programming 

February 25, 2025

Aseem Rastogi and Pantazis Deligiannis talk about two technical results from ICSE 2025 on using large language models (LLMs) for safe low-level programming. The results demonstrate LLMs inferring machine-checkable memory safety invariants in legacy C code and how LLMs assist…

Physics Wallah blog | education icons
Microsoft Research Blog

Microsoft Research and Physics Wallah team up to enhance AI-based tutoring 

February 12, 2025 | Chris Stetkiewicz

Limited resources, geography, and economic factors present barriers to quality education for many students in India. Learn how Microsoft Research is collaborating with Physics Wallah to make AI-based tutoring more accurate, reliable, and affordable.

  • Previous
  • 1
  • 2
  • 3
  • …
  • 25
  • Next