illustrations of two gears turning
July 20, 2020 - July 23, 2020

Frontiers in Machine Learning 2020

9:00 AM–12:30 PM Pacific

Location: Virtual

Thursday, July 23, 2020

Theme: Machine Learning Systems

Time (PDT) Session Title Speaker / Talk Title
9:00 AM–10:30 AM Learning from Limited Labeled Data: Challenges and Opportunities for NLP
[Video]

Session Lead: Ahmed Hassan Awadallah, Microsoft

Session Abstract: Modern machine learning applications have enjoyed a great boost utilizing neural networks models, allowing them to achieve state-of-the-art results on a wide range of tasks. Such models, however, require large amounts of annotated data for training. In many real-world scenarios, such data is of limited availability making it difficult to translate these gains into real-world impact. Collecting large amounts of annotated data is often difficult or even infeasible due to the time and expense of labelling data and the private and personal nature of some of these datasets. This session will discuss several approaches to address the labelled data scarcity. In particular, the session will discuss work on: (1) transfer learning techniques that can transfer knowledge between different domains or languages to reduce the need for annotated data; (2) weakly-supervised learning where distant or heuristic supervision is derived from the data itself or other available metadata; (3) and techniques which learn from user interactions or other reward signals directly with techniques such as reinforcement learning. The discussion will be grounded on real-world applications where we aspire to bring AI experiences quickly and efficiently to everyone in more tasks, markets, languages, and domains.

Ahmed Hassan Awadallah, Microsoft
Bringing AI Experiences to Everyone

Marti Hearst, University of California, Berkeley
Summarization without the Summaries

Graham Neubig, Carnegie Mellon University
Lessons from the Long Tail: Methods for NLP in the Next 1,000 Languages

Alex Ratner, University of Washington
ML Development with Weak Supervision: Notes from the Field

Q&A panel with all 4 speakers

10:30 AM–11:00 AM BREAK
11:00 AM–12:40 PM Climate Impact of Machine Learning
[Video]

Session Lead: Philip Rosenfield, Microsoft

Session Abstract: Microsoft has made an ambitious commitment to remove its carbon footprint in response to the overwhelming urgency of addressing climate change. Meanwhile, recent advances in machine learning (ML) models, such as transformer-based NLP, have produced substantial gains in accuracy at the cost of exceptionally large compute resources and, correspondingly, carbon emissions from energy consumption. Understanding and mitigating the climate impact of ML has emerged at the frontier of ML research, spanning multiple areas including hardware design, computational efficiency, and incentives for carbon efficiency.

The goal of this session is to identify priority areas to drive research agendas that are best-suited to efforts in academia, in industry, or in collaboration. We aim to inspire research advances and action, within both academia and industry, to improve the sustainability of machine learning hardware, software and frameworks.

Nicolo Fusi, Microsoft
Opening Remarks

Emma Strubell, Carnegie Mellon University
Learning to Live with BERT

Vivienne Sze, Massachusetts Institute of Technology
Reducing the Carbon Emissions of ML Computing – Challenges and Opportunities

Diana Marculescu, University of Texas at Austin
When Climate Meets Machine Learning: The Case for Hardware-ML Model Co-design

Q&A panel with all 4 speakers

12:40 PM–12:45 PM Closing Remarks Sandy Blyth, Managing Director
Microsoft Research Outreach

Vani Mandava, Director
Microsoft Research Outreach