A Quick review of LLM for Data Visualization

Updated on 8/2/2024

Hey there! Today, let's talk about something really cool: how big AI models can help us make awesome data visualizations. Data visualization is basically turning boring data into colorful charts and graphs that are easy to understand. But making these charts can be tricky and usually needs a lot of skill. Luckily, with the latest AI tech, we can make this process much easier. Let’s dive into how these big AI models, called Large Language Models (LLMs), like GPT-3.5, are changing the game.

The Journey from Words to Visuals

Transforming natural language into visualizations has come a long way. At first, people used rule-based systems. These systems, like Articulate and DataTone, took natural language (like you asking, "Show me a bar chart of sales") and turned it into charts using set rules. But these systems weren't great at understanding complicated queries.

Then came neural network-based methods. These use deep learning to handle more complex questions. Models like ADVISor started to combine deep learning with rules, making them better at understanding what we wanted to see in our charts.

The Rise of Big AI Models

Enter the big players: LLMs like GPT-3.5. These models are really good at understanding natural language and generating code. This makes them perfect for creating visualization specifications. Recently, researchers tested GPT-3.5 to see how well it could make visualizations using a tool called Vega-Lite, which is great for creating interactive graphics.

What We Found Out

Zero-Shot vs. Few-Shot Prompts:
- When GPT-3.5 was given no examples (zero-shot) versus a few examples (few-shot) of how to make the charts, it did much better with a few examples. It got things right about 50% of the time with examples compared to 43% without. So, giving the model some examples really helps!
Different Charts and Tasks:
- GPT-3.5 was better at making certain types of charts, like scatter plots and pie charts, but struggled with bar charts. It also found it harder to make charts for more complex tasks.
Common Mistakes:
- The AI sometimes made mistakes like creating invalid JSON files (a format for data), using Vega-Lite properties incorrectly, or not fully understanding the task or data. Fixing these issues is key to making the AI better at creating charts.

Cool Stuff Other People Have Done

There’s been a lot of work done in this field. Here are some highlights:

Rule-Based Systems:
- Early systems like Articulate and DataTone used set rules to turn natural language into charts. These were simple but couldn’t handle complex queries well.
Neural Networks:
- Then came deep learning models like ADVISor and ncNet. These used large datasets to learn how to make charts from natural language, doing a much better job than rule-based systems.
LLM-Based Systems:
- Recently, people have started using big AI models like Chat2VIS and LIDA. These models can generate code for visualizations, though they mostly focus on Python code rather than tools like Vega-Lite. VizGPT (opens in a new tab) is one such model that focuses on Vega-Lite, making it easier to create interactive visualizations.

Challenges and Future Fun

Even though GPT-3.5 is pretty good, there are still some things we need to work on:

Fixing Grammar Mistakes:
- The AI often made grammar errors in Vega-Lite. We need to help it understand the rules better and catch mistakes.
Understanding Tasks Better:
- Sometimes, the AI misunderstood what we were asking for. Making it better at understanding questions is a big goal.
Improving Datasets:
- The datasets we use to train and test these models, like nvBench, sometimes have errors. Making these datasets better will help us get more accurate results from the AI.

Looking Ahead

There’s a lot of potential for these big AI models in making data visualization easier. Here are some things to look forward to:

Better Prompts:
- By giving the AI better examples, we can help it understand what we want more easily.
Interactive AI:
- Imagine being able to chat with the AI to refine your charts step by step. This could make the process much smoother and more accurate.
Higher Quality Datasets:
- Improving the datasets we use will give us more reliable AI models, making the visualizations even better.

Wrapping Up

In short, big AI models like GPT-3.5 are making it easier to turn words into beautiful charts. While there’s still work to be done to iron out some issues, the future looks bright. These advancements mean we can all spend less time fiddling with chart tools and more time understanding our data. So, get ready to see some amazing charts with the help of AI!