Loading...
Research Focus: April 23, 2025
Microsoft Research Blog

Research Focus: Week of April 21, 2025 

April 23, 2025

In this issue: our CHI 2025 & ICLR 2025 contributions, plus research on causal reasoning & LLMs; countering LLM jailbreak attacks; and how people use AI vs. AI-alone. Also, SVP of Microsoft Health Jim Weinstein talks rural healthcare innovation.

A stylized illustration of a green line-drawn hand holding a transparent prism with colorful bands of light being refracted through it against a black background.
Microsoft Research Blog

BiomedParse: A foundation model for smarter, all-in-one biomedical image analysis 

November 18, 2024 | Hoifung Poon, Theodore Zhao, Yu (Aiden) Gu, Mu Wei, and Sheng Wang

BiomedParse reimagines medical image analysis, integrating advanced AI to capture complex insights across imaging types—a step forward for diagnostics and precision medicine.

Research Focus Week of February 19, 2024
Microsoft Research Blog

Research Focus: Week of February 19, 2024 

February 21, 2024

In this issue: CaaSPER: vertical autoscaling algorithm dynamically maintains optimal CPU utilization; Improved scene landmark detection for camera localization runs faster, uses less storage; ESUS simplifies usability questionnaires for technical products and services.

RF NeurIPS Edition December 11, 2023
Microsoft Research Blog

NeurIPS 2023 highlights breadth of Microsoft’s machine learning innovation 

December 11, 2023

We’re proud to have 100+ accepted papers At NeurIPS 2023, plus 18 workshops. Several submissions were chosen as oral presentations and spotlight posters, reflecting groundbreaking concepts, methods, or applications. Here’s an overview of those submissions.

Responsible AI blog - hero graphic with connected circles with icons depicting closed captions, calendar, image, and document inside of the circles
Microsoft Research Blog

Frontiers of multimodal learning: A responsible AI approach 

September 6, 2023

New evaluation methods and a commitment to continual improvement are musts if we’re to build multimodal AI systems that advance human goals. Learn about cutting-edge research into the responsible development and use of multimodal AI at Microsoft.

Microsoft Research Focus 03: Week of November 7th, 2022
Microsoft Research Blog

Research Focus: Week of November 7, 2022 

November 8, 2022

Welcome to Research Focus, a new series of blog posts that highlights notable publications, events, code/datasets, new hires and other milestones from across the research community at Microsoft. Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei,…

Diagram showing GODEL’s architecture. The environment of the dialog system consists of both structured and unstructured content, which it uses to retrieve information. This source content, which we term “grounding,” is updated and repeatedly used by GODEL to produce a new response after each user input.
Microsoft Research Blog

GODEL: Combining goal-oriented dialog with real-world conversations 

June 23, 2022 | Baolin Peng, Michel Galley, Lars Liden, Chris Brockett, Zhou Yu, and Jianfeng Gao

They make restaurant recommendations, help us pay bills, and remind us of appointments. Many people have come to rely on virtual assistants and chatbots to perform a wide range of routine tasks. But what if a single dialog agent, the…

animation showing hyperparameters
Microsoft Research Blog

µTransfer: A technique for hyperparameter tuning of enormous neural networks 

March 8, 2022 | Edward Hu, Greg Yang, and Jianfeng Gao

Great scientific achievements cannot be made by trial and error alone. Every launch in the space program is underpinned by centuries of fundamental research in aerodynamics, propulsion, and celestial bodies. In the same way, when it comes to building large-scale…

Diagram: The proposed SOLOIST model architecture and training objectives. Dialog history, Belief state, DB state, and Response make up the pipeline. Task 1, belief state prediction, corresponds with belief state. Task 2 and Task 3, grounded response generation and contrastive objective correspond with response. A user is shown thinking a goal, which points from the dialog history to the user (response), and then back to dialog history (input). Belief state points down to an image of a computer server (Belief state query) and then back to DB State (DB state results). The server points to readouts labeled “entity.”
Microsoft Research Blog

SOLOIST: Pairing transfer learning and machine teaching to advance task bots at scale 

June 16, 2021 | Baolin Peng, Chunyuan Li, Jinchao Li, Lars Liden, and Jianfeng Gao

The increasing use of personal assistants and messaging applications has spurred interest in building task-oriented dialog systems (or task bots) that can communicate with users through natural language to accomplish a wide range of tasks, such as restaurant booking, weather…

  • Previous
  • 1
  • 2
  • 3
  • Next