Editable AI-Generated Research Figures: Introducing AutoFigure-Edit from Westlake University

The article presents AutoFigure-Edit, an open‑source AI system that turns long‑form scientific text into fully editable SVG figures, solves the uneditable‑image problem of existing AIGC tools, and demonstrates superior performance on the FigureBench benchmark and real‑user studies.

Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Editable AI-Generated Research Figures: Introducing AutoFigure-Edit from Westlake University

Problem

Researchers often generate scientific diagrams as static PNG images, which cannot be edited without re‑generating the whole figure.

AutoFigure‑Edit

AutoFigure‑Edit extends the AutoFigure framework to produce fully editable vector graphics directly from paper text.

Five‑stage pipeline

Style‑conditioned image generation : a raster image is generated from the long‑form method description, optionally conditioned on a user‑provided style reference.

Segmentation & structural indexing : Meta’s SAM3 model segments visual components (icons, modules, arrows) and builds a structural skeleton.

Asset extraction : each component is extracted with a transparent background (RGBA).

SVG template generation & refinement : a layout template in SVG is created and refined using the extracted assets.

Asset injection : assets are injected into the template, yielding a layer‑wise editable SVG where every element can be moved, recolored, or relabeled.

Key innovations

Pixel‑to‑vector conversion enables direct in‑browser editing of diagrams.

Reference‑image‑guided style transfer learns color palette, fonts, and icon style from a single example.

Built‑in interactive editor allows immediate layout, text, and icon adjustments on the generated SVG.

“Reasoned Rendering” decouples logical layout from aesthetic rendering, supporting a critique‑and‑refine loop and OCR‑based replacement of blurry raster text with clean vector text.

SAM3‑driven automatic segmentation produces transparent assets for each visual component.

Experimental evaluation

Evaluation on the FigureBench benchmark shows that AutoFigure‑Edit outperforms prior methods (GPT‑Image, SVG‑Code, Diagram Agent) across accuracy, completeness, and adaptability.

Win‑rate with style guidance: 83 % vs 76 % without.

Human evaluation (217 participants, 262 generated figures):

Semantic correctness 4.04/5

Information completeness 4.11/5

Visual quality 3.95/5

Style consistency 4.09/5

48 % of figures deemed ready for paper submission without modification.

Automatic metrics (FigureBench):

Accuracy 8.83, completeness 8.26, adaptability 8.37 (significantly higher than baselines).

Without reference style: overall score 8.29, visual 8.32, expressiveness 8.66.

SVG conversion correctness 3.60/5; 36 % of users gave a perfect score, and low scores (<2) occurred in fewer than 12 % of cases.

Open‑source resources

Code and models are released under an open‑source license at https://github.com/ResearAI/AutoFigure-Edit. The FigureBench dataset is available on HuggingFace.

Impact

By delivering editable SVG diagrams, AutoFigure‑Edit removes the “final‑step” bottleneck in scientific publishing, enables consistent visual style across papers, and supports fully autonomous AI‑driven research workflows.

AISVGStyle TransferAutoFigureScientific Visualization
Machine Learning Algorithms & Natural Language Processing
Written by

Machine Learning Algorithms & Natural Language Processing

Focused on frontier AI technologies, empowering AI researchers' progress.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.