LIDA: Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models

ACL 2023 Demonstrations |

Organized by The 61st Annual Meeting of the Association for Computational Linguistics

Systems that support users in the automatic creation of visualizations must address several subtasks – understand the semantics of data, enumerate relevant visualization goals and generate visualization specifications. In this work, we pose visualization generation as a multi-stage generation problem and argue that well-orchestrated pipelines based on large language models (LLMs) and image generation models (IGMs) are suitable to addressing these tasks. We present LIDA, a novel tool for generating grammar-agnostic visualizations and infographics. LIDA comprises of 4 modules – A SUMMARIZER that converts data into a rich but compact natural language summary, a GOAL EXPLORER that enumerates visualization goals given the data, a VISGENERATOR that generates, refines, executes and filters visualization code and an INFOGRAPHER module that yields data-faithful stylized graphics using IGMs. LIDA provides a python api, and a hybrid user interface (direct manipulation and multilingual natural language) for interactive chart, infographics and data story generation.

 

LIDA - Automatic Generation of Grammar-Agnostic Visualizations and InfographicsSystem architecture for LIDA

 

 

Example infographics generated by LIDA

Example infographics generated with LIDA

Publication Downloads

LIDA

September 7, 2023

LIDA is a library for generating grammar-agnostic (will work with any programming language and visualization libraries e.g. matplotlib, seaborn, altair, d3 etc) visualizations and infographics. LIDA comprises of 4 modules - A SUMMARIZER that converts data into a rich but compact natural language summary, a GOAL EXPLORER that enumerates visualization goals given the data, a VISGENERATOR that generates, refines, executes and filters visualization code and an INFOGRAPHER module that yields data-faithful stylized graphics using IGMs. LIDA provides a python api, and a hybrid user interface (direct manipulation and multilingual natural language) for interactive chart, infographics and data story generation. LIDA works with multiple large language model providers (OpenAI, PaLM, Cohere, Huggingface).