Multi-agent collaborative pathways for Chinese traditional architectural image generation

Artificial Intelligence Generated Content (AIGC) technology demonstrates significant potential in the fields of cultural heritage digitalization and cultural tourism design. However, when confronted with specific subjects such as Chinese traditional architecture, which embodies rich cultural connota...

Full description

Saved in:
Bibliographic Details
Published inScientific reports Vol. 15; no. 1; pp. 34596 - 15
Main Authors Lu, Yi, Yuan, Wenmiao, Wang, Mengyao, Wang, Pohsun, Wu, Sifan, Wu, Jiacheng, Xing, Wenzhuo, Xie, Wanxu, Yu, Fuze
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 03.10.2025
Nature Publishing Group
Nature Portfolio
Subjects
Online AccessGet full text
ISSN2045-2322
2045-2322
DOI10.1038/s41598-025-18130-7

Cover

More Information
Summary:Artificial Intelligence Generated Content (AIGC) technology demonstrates significant potential in the fields of cultural heritage digitalization and cultural tourism design. However, when confronted with specific subjects such as Chinese traditional architecture, which embodies rich cultural connotations and complex visual elements, existing technologies still exhibit limitations in understanding vague user requirements, ensuring cultural accuracy, and generating diverse and high-fidelity images. To address these challenges, this study proposes a text-to-image generation framework oriented towards Chinese traditional architecture, based on multi-agent collaboration. This framework integrates multiple intelligent agents responsible for user intent understanding, creative prompt generation, traditional architectural image generation, aesthetic and cultural relevance assessment, and collaborative workflow scheduling. By constructing a Chinese Traditional Architecture Cultural Knowledge Base (exemplified by the Beijing Central Axis) and designing a collaborative workflow among the agents, the framework can efficiently and accurately transform users’ colloquial and vague descriptions into Chinese traditional architectural images with profound cultural connotations and high visual fidelity. Experimental results demonstrate that this collaborative multi-agent framework significantly outperforms baseline models in terms of handling vague inputs, ensuring the cultural accuracy of generated images, enhancing user intent matching, and creative diversity. This research not only provides new theoretical perspectives and practical pathways for AI technology in the digital preservation, creative transformation, and intelligent design of cultural heritage, but also offers an effective tool for empowering innovative digital cultural tourism experiences, holding significance for technological exploration and cultural inheritance.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-025-18130-7