LIDA: Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Organized by The 61st Annual Meeting of the Association for Computational Linguistics
Systems that support users in the automatic creation of visualizations must address several subtasks – understand the semantics of data, enumerate relevant visualization goals and generate visualization specifications. In this work, we pose visualization generation as a multi-stage generation problem and argue that well-orchestrated pipelines based on large language models (LLMs) and image generation models (IGMs) are suitable to addressing these tasks. We present LIDA, a novel tool for generating grammar-agnostic visualizations and infographics. LIDA comprises of 4 modules – A SUMMARIZER that converts data into a rich but compact natural language summary, a GOAL EXPLORER that enumerates visualization goals given the data, a VISGENERATOR that generates, refines, executes and filters visualization code and an INFOGRAPHER module that yields data-faithful stylized graphics using IGMs. LIDA provides a python api, and a hybrid user interface (direct manipulation and multilingual natural language) for interactive chart, infographics and data story generation.
System architecture for LIDA
Example infographics generated with LIDA
Publication Downloads
LIDA
September 7, 2023
LIDA is a library for generating grammar-agnostic (will work with any programming language and visualization libraries e.g. matplotlib, seaborn, altair, d3 etc) visualizations and infographics. LIDA comprises of 4 modules - A SUMMARIZER that converts data into a rich but compact natural language summary, a GOAL EXPLORER that enumerates visualization goals given the data, a VISGENERATOR that generates, refines, executes and filters visualization code and an INFOGRAPHER module that yields data-faithful stylized graphics using IGMs. LIDA provides a python api, and a hybrid user interface (direct manipulation and multilingual natural language) for interactive chart, infographics and data story generation. LIDA works with multiple large language model providers (OpenAI, PaLM, Cohere, Huggingface).