Our team in Microsoft Azure AI Platform is at the forefront of developing multimodal AI technologies that combine language, vision, and other sensory inputs to power Microsoft AI products. We are seeking a Senior Researcher…
We present a method for prediction of a person’s hairstyle from a single image. Despite growing use cases in user digitization and enrollment for virtual experiences, available methods are limited, particularly in the range of…
We tackle the problem of highly-accurate, holistic performance capture for the face, body and hands simultaneously. Motion-capture technologies used in film and game production typically focus only on face, body or hand capture independently, involve…
OmniParser is a comprehensive method for parsing user interface screenshots into structured and easy-to-understand elements, which significantly enhances the ability of GPT-4V to generate actions that can be accurately grounded in the corresponding regions of…
Microsoft Audience Network (MSAN) part of the Microsoft AI (Artificial Intelligence) is seeking a Senior Applied Scientist-Ads. As the Senior Applied Scientist, you will specialize in creating and enhancing machine learning technologies in areas such…
The Interactive Multimodal AI Systems focuses on creating interactive systems and experiences that blend the richness and complexity of people and their real, physical world with advanced technology. We seek to leverage multimodal generative AI…