Microsoft Research Blog

Agent Lightning: Adding reinforcement learning to AI agents without code rewrites

December 11, 2025

By decoupling how agents work from how they’re trained, Agent Lightning turns each step an agent takes into data for reinforcement learning. This makes it easy for developers to improve agent performance with almost zero code changes.

Recent Posts

Filter by Research Area

Agent Lightning: Adding reinforcement learning to AI agents without code rewrites

December 11, 2025

By decoupling how agents work from how they’re trained, Agent Lightning turns each step an agent takes into data for reinforcement learning. This makes it easy for developers to improve agent performance with almost zero code changes.
Breaking the networking wall in AI infrastructure

September 9, 2025 | Paolo Costa

Datacenter memory and network limits are restraining AI system performance. MOSAIC uses microLEDs and a wide-and-slow optical architecture to deliver faster, longer, more reliable, and energy efficient connections that could transform AI cluster designs.
Research Focus: Week of May 7, 2025

May 7, 2025

In this issue: New research on compound AI systems and causal verification of the Confidential Consortium Framework; release of Phi-4-reasoning; enriching tabular data with semantic structure, and more.
Research Focus: Week of March 24, 2025

March 26, 2025

In this issue, we examine a new conversation segmentation method that delivers more coherent and personalized agent conversation, and we review efforts to improve MLLMs’ understanding of geologic maps. Check out the latest research and other updates.
Metasurface: Unlocking the future of wireless sensing and communication

March 19, 2025 | Lili Qiu and Hao Pan

Metasurfaces explore engineered 2D materials that manipulate electromagnetic and mechanical waves, offering advances in wireless tech. They can power indoor GPS, extend 5G/6G coverage, and enable wireless sensing and imaging.
Advances to low-bit quantization enable LLMs on edge devices

February 5, 2025 | Shijie Cao, Lingxiao Ma, and Ting Cao

Advances in low-bit quantization techniques enable efficient operation of LLMs on resource-constrained edge devices. Discover how innovations like T-MAC, Ladder, and LUT Tensor Core improve computational efficiency and enhance hardware compatibility.
Research Focus: Week of January 27, 2025

January 31, 2025

In this issue: A new approach to multimodal pretraining for remote sensing; Managed-retention memory for the AI era; Improving detection of macular telangiectasia type 2; Generalizing symbolic automata.
Research Focus: Week of December 2, 2024

December 4, 2024

Can a new SOS-RMT protocol enable more efficient CL-MPC?; A fair-by-design, cloud-based algorithmic trading platform; LLM2CLIP unlocks richer visual representation; New technique enhances Low-Rank Adaptation’s expressiveness, generalization capabilities.
Research Focus: Week of November 11, 2024

November 13, 2024

Holistic motion-capture calibration technique without calibration, manual intervention or custom hardware; Research on AI agents for autonomous clouds; Automating proof-oriented program construction; One-to-many testing for natural language code generation.
Preventing side-channels in the cloud

November 12, 2024 | Stavros Volos and Boris Köpf

Sophisticated side-channel attacks present new security challenges for cloud providers. Learn how Microsoft is exploring defenses against emerging attacks with principled microarchitectural isolation:
Microsoft at SOSP 2024: Innovations in systems research

November 4, 2024

Building resilient systems, scaling deep learning computation, and reproducing failures in production are just some of the ways Microsoft researchers are advancing the state of the art in computer systems research at SOSP 2024.
Research Focus: Week of October 28, 2024

November 1, 2024

New Research | FLASH: Workflow automation agent for diagnosing recurring incidents; METAREFLECTION: Learning instructions for language agents using past reflections; Boosting LLM training efficiency through faster communication between GPUs; and more.

Explore More

Events & conferences

Meet our community of researchers, learn about exciting research topics, and grow your network
Podcasts

Ongoing conversations at the cutting edge of research
Microsoft Research Forum

Join us for a continuous exchange of ideas about research in the era of general AI

Microsoft Research Blog

Follow Microsoft Research

Subscribe to our newsletter

Recent Posts

Explore More