Agentic Visual Reporting

AI Research Visualization Opensource Web Full-stack Analytics

August 2025 - Present

Winner of the IEEE VIS 2025 VISxGenAI Challenge. An agentic system for data analysis that pairs LLM agents with deterministic visualization modules. It produces interactive reports for readers and executable notebooks for analysts, so results can be inspected, rerun, and adapted.

Developed in collaboration with

University of Vienna

Carnegie Mellon University

Mohamed bin Zayed University of Artificial Intelligence

Austrian Institute of Technology

Most data analysis workflows force a difficult trade-off between speed and reliability. Manual analysis produces trustworthy results but doesn’t scale, while automated tools are fast but often yield black-box outputs that are difficult to verify or adapt. This project explores a different approach to resolve this tension.

The system uses a pipeline of eleven specialized AI agents, but its core design principle is a hybrid architecture. It delegates creative and interpretive work—like planning insights and generating narratives—to AI, while assigning tasks that demand precision and consistency to deterministic components. This separation of concerns is fundamental to producing results that are both insightful and verifiable.

The result is a set of two complementary outputs that serve different needs. For readers, it generates interactive web reports for exploring data beyond the initial analysis. For analysts, it produces executable Marimo notebooks, providing full transparency to verify or adapt the process. This dual-output model is a deliberate step towards a more effective human-AI collaboration, where the AI augments human analysis rather than trying to replace it.

The system is built with observability in mind. Every model call and data transformation is tracked using tools like Langfuse . It also uses in-browser computation for interactive exploration and object storage like Cloudflare R2 for report artifacts.

Invited for a live presentation at the IEEE VISxGenAI 2025 Workshop Challenge, the project prototypes a workflow where AI drafts and deterministic modules enforce constraints. The goal is to augment rather than replace human analysis by keeping outputs inspectable and rerunnable.

Stack

While the problem is more important than the tools, the tech stack tells a story about the project's architecture and trade-offs. Here's what this project is built on:

Platforms & Runtimes

Python

Runs the 11-agent orchestration, data processing, report generation pipeline, and provenance notebooks.

Pyodide

Provides the browser-side Python runtime for dynamically generated Marimo notebooks.

Clingo

Solves logic programs for selecting visualizations under formal design constraints.

Node.js

Runs documentation tooling and build steps for report artifacts and site generation.

TypeScript

Implements the Observable Notebook builder service and the VitePress config for the docs site.

JavaScript

Implements the generated Observable notebooks and their interactive report UI.

Frontend & Visualization

Observable Notebook

Hosts interactive HTML reports that readers can explore and analysts can edit.

Mosaic

Enables coordinated views and cross-filtering in generated reports.

Draco

Synthesizes visualization specifications from constraints for chart generation.

Vega-Lite

Renders charts from Draco-derived specifications for the final report visuals.

Vue.js

Implements interactive components within the documentation site and gallery.

VitePress

Builds the documentation site and gallery as a static site.

AI & Machine Learning

DSPy

Orchestrates 11 specialized agents via modular prompts and tracing.

OpenAI API

Provides models for dataset description, insight planning, and narrative generation.

Anthropic API

Provides models for coding-heavy agent steps alongside other providers.

Gemini

Provides models used for mapping codes to human-readable labels.

Google Vertex AI

Provides an additional managed inference backend used selectively for agent runs.

OpenRouter API

Provides alternative models behind a uniform API for stage-specific routing.

Data Engineering

DuckDB

Runs SQL queries both on the server and in the browser (via WASM) for interactive exploration.

Polars

Applies transformations to the raw input dataset based on agent-generated metadata.

NumPy

Computes dataset statistics used during insight discovery.

Parquet

Stores dataset artifacts so they can be queried and processed outside the agent pipeline.

Pandas

Reads remote datasets in Pyodide-backed notebooks when browser-side engines cannot fetch them directly.

Apache Arrow

Moves data between engines without copies (e.g., DuckDB ↔ Polars).

Backend & APIs

Pydantic

Validates and serializes agent inputs/outputs and supports schema-driven prompting.

OpenTelemetry

Collects agent traces in a standard format for observability and debugging.

Vega-Altair

Renders Vega-Lite visualizations from Draco specifications via a Python API.

External Services

Langfuse

Tracks traces, token usage, and latency across agents for observability and QA.

Infisical

Manages API keys used by agents for submissions to the evaluation server.

Umami

Tracks usage analytics for the documentation site.

Cloud & DevOps

Cloudflare R2

Stores report assets and artifacts via S3-compatible object storage.

Coolify

Runs self-hosted Langfuse, Infisical, Umami, and the Observable Notebook Builder service.

Docker

Packages the Observable Notebook Builder service for deployment on Coolify.

GitHub Actions

Builds the VitePress documentation site and deploys it to GitHub Pages.

Development Tooling

Installs and resolves Python dependencies for local development and reproducible runs.

Ruff

Lints and formats Python sources in the agent pipeline.

Marimo

Notebook environment for running end-to-end pipelines during development and debugging.

pnpm

Manages Node.js dependencies for docs and report build tooling.

Vite

Builds and previews the documentation site and related frontend assets.

Biome

Formats and lints JavaScript/TypeScript code for docs and build tooling.