CoWrangler: Recommender System for Data-Wrangling Scripts

SIGMOD |

Organized by Microsoft

We present CoWrangler, a real-time data-wrangling recommender system, that can recommend the next-best data-wrangling operations along with the corresponding human-readable and efficient code snippets to expedite data exploration and wrangling efforts. A key feature of CoWrangler is that it provides explanations for the generated suggestions in the form of data insights, allowing the user to place confidence in the system. Under the hood, CoWrangler relies on intelligent generation of candidate suggestions using program synthesis techniques and ranking of a set of suggestions based on the notion of data quality improvement. We demonstrate how CoWrangler provides a human-in-the-loop data-wrangling experience, and helps users make informed data pre-processing decisions, while saving their time and effort.