In the news | Microsoft Garage Blog
Art is one of the few languages which transcends barriers of country, culture, and time. Most people view art subjectively through a lens shaped by their experiences and environment. Finding commonalities among pieces from different eras and mediums calls for…
| Chunyuan Li, Lei Zhang, and Jianfeng Gao
Humans perceive the world through many channels, such as images viewed by the eyes or voices heard by the ears. Though any individual channel might be incomplete or noisy, humans can naturally align and fuse the information collected from multiple…
In the news | Microsoft Asia News Center
Hollywood loves making movies about computers going crazy, robots running riot, and technology taking over. Science fiction blockbusters with apocalyptic twists often top the box office. But have they ever made you wonder about something more serious? After all, advances…
| Takuya Yoshioka, Dimitrios Dimitriadis, Andreas Stolcke, and William Hinthorn
Recent advances in machine learning and signal processing, as well as the availability of massive computing power, have resulted in dramatic and steady improvement in speech recognition accuracy. Voice interfaces to digital devices have become more and more common. Lectures…
Episode 76, May 15, 2019 When was the last time you had a meaningful conversation with your computer… and felt like it truly understood you? Well, if Dr. Xuedong Huang, a Microsoft Technical Fellow and head of Microsoft’s Speech and…
In the news | ZDNet
Microsoft Research's 'Project Denmark' technology allows users to use the microphones in phones and laptops to create a virtual array that can handle conversation transcription and more.
| Xuedong Huang
Deep learning algorithms, supported by the availability of powerful Azure computing infrastructure and massive training data, constitutes the most significant driving force in our AI evolution journey. In the past three years, Microsoft reached several historical AI milestones being the…
In the news | SlashGear
Microsoft has figured out real-time conversation transcription, revealing a new Azure-integrated conical reference design speaker along with a way to turn every phone and laptop in a meeting into an ad-hoc voice recognition array.
A team of researchers from the Natural Language Processing (NLP) Group at Microsoft Research Asia (MSRA) and the Speech Dialog Research Group at Microsoft Redmond are currently leading in the Conversational Question Answering (CoQA) Challenge organized by Stanford University. In…