Evaluating High-Resolution Vessel Mask-to-Fundus Translation under Non-Monotonic GAN Dynamics

Jiaqiang Yang

doi:10.2991/978-94-6239-648-7_98

<Previous Article In Volume

Next Article In Volume>

Evaluating High-Resolution Vessel Mask-to-Fundus Translation under Non-Monotonic GAN Dynamics

Authors

Jiaqiang Yang¹^{, *}

¹College of Liberal Arts and Sciences, University of Connecticut, Storrs, CT, 06269, USA

^*Corresponding author. Email: tbg24003@uconn.edu

Corresponding Author

Jiaqiang Yang

Available Online 24 April 2026.

DOI: 10.2991/978-94-6239-648-7_98 How to use a DOI?
Keywords: Retinal fundus synthesis; Vessel mask; Conditional GAN; Pix2Pix; Evaluation metrics
Abstract: Clinically viable retinal fundus synthesis from vessel masks requires photorealistic appearance while maintaining anatomical agreement with the input structure. The task is treated as a structure-sensitive, high-resolution (512 × 512) paired translation problem, with a Pix2Pix-style model as a baseline. Evaluation uses distribution metrics (FID, KID) alongside paired measures (LPIPS, MS-SSIM) to separate set-level realism from target-aligned fidelity. Because adversarial training can vary substantially across epochs, several late-stage checkpoints are compared. The checkpoint with the best FID/KID often differs from the one with the best LPIPS/MS-SSIM, so checkpoint choice depends on whether the goal is realism for augmentation or stricter per-sample correspondence. Qualitative inspection is supported by an auxiliary vessel segmenter that visualizes the input mask, generated fundus image, and re-segmented vessels in a single layout. All experiments follow a fixed protocol on the combined DRIVE + CHASE DB1 training set (48 image–mask pairs) as an in-sample reference. Later checkpoints (e.g., epoch 190) show fewer texture/color artifacts and more consistent vessel structure than earlier ones (e.g., epoch 70), indicating that training stage can strongly affect the realism–fidelity balance under limited paired supervision.
Copyright: © 2026 The Author(s)
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the International Workshop on Advances in Deep Learning for Image Analysis and Computer Vision (IWADIC 2025)
Series: Advances in Computer Science Research
Publication Date: 24 April 2026
ISBN: 978-94-6239-648-7
ISSN: 2352-538X
DOI: 10.2991/978-94-6239-648-7_98 How to use a DOI?
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

ris enw bib

TY  - CONF
AU  - Jiaqiang Yang
PY  - 2026
DA  - 2026/04/24
TI  - Evaluating High-Resolution Vessel Mask-to-Fundus Translation under Non-Monotonic GAN Dynamics
BT  - Proceedings of the International Workshop on Advances in Deep Learning for Image Analysis and Computer Vision (IWADIC 2025)
PB  - Atlantis Press
SP  - 913
EP  - 922
SN  - 2352-538X
UR  - https://doi.org/10.2991/978-94-6239-648-7_98
DO  - 10.2991/978-94-6239-648-7_98
ID  - Yang2026
ER  -

download .riscopy to clipboard