Proceedings of the 2025 5th International Conference on Culture, Design and Social Development (CDSD 2025)

A Semantic–Visual Reasoning Framework for Guangdong Temple Fairs

Authors
Yang Zhao1, *, Haorui Yu2, Lingan Chen3, Qing Zhao4
1School of Art and Design, Guangzhou Institute of Science and Technology, Guangzhou, 510540, China
2Duncan of Jordanstone College of Art & Design (DJCAD), University of Dundee, Dundee, DD1 4HT, United Kingdom
3Fashion School, Guangdong Polytechnic, Foshan, 528041, China
4College of Economics and Management, Wuhu Vocational Technical University, Wuhu, 241003, China
*Corresponding author. Email: zhaoyang971010@gmail.com
Corresponding Author
Yang Zhao
Available Online 26 February 2026.
DOI
10.2991/978-2-38476-541-6_72How to use a DOI?
Keywords
Digital Cultural Heritage; Guangdong Temple Fairs; Knowledge Graph; Vision-Language Models; Semantic Reasoning
Abstract

Digital technologies have significantly advanced the visual archiving of intangible cultural heritage (ICH). However, existing methods often struggle to interpret the complex social structures embedded within these activities. This limitation is particularly evident in the digitization of cultural heritage in Guangdong. While state-of-the-art visual language models (VLMs) can identify superficial visual elements—such as clothing, crowds, and architectural forms—they fail to recognize the underlying ritual rules, clan organizations, and spatial hierarchies. To bridge this cognitive gap, this paper proposes a Semantic-Visual Reasoning Framework (SVRF). This framework combines domain-specific knowledge graphs (KGs) with VLM-based visual extraction to ensure semantic consistency in scene understanding. Focusing on the intangible cultural heritage of Guangdong temple fairs, we constructed an ontology that explicitly models key sociological elements, including participants, sacred spaces, and ritual sequences. This ontology serves as a semantic constraint layer, guiding visual analysis. Through a qualitative case study of the Foshan Ancestral Temple Fair, we illustrate how SVRF reduces misinterpretations in high-context cultural scenes by integrating visual evidence with social logic. This research provides a novel computational approach for interpreting the multi-layered social dynamics of regional heritage activities, facilitating a shift beyond static archiving towards semantic interpretation.

Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the 2025 5th International Conference on Culture, Design and Social Development (CDSD 2025)
Series
Advances in Social Science, Education and Humanities Research
Publication Date
26 February 2026
ISBN
978-2-38476-541-6
ISSN
2352-5398
DOI
10.2991/978-2-38476-541-6_72How to use a DOI?
Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Yang Zhao
AU  - Haorui Yu
AU  - Lingan Chen
AU  - Qing Zhao
PY  - 2026
DA  - 2026/02/26
TI  - A Semantic–Visual Reasoning Framework for Guangdong Temple Fairs
BT  - Proceedings of the 2025 5th International Conference on Culture, Design and Social Development (CDSD 2025)
PB  - Atlantis Press
SP  - 657
EP  - 663
SN  - 2352-5398
UR  - https://doi.org/10.2991/978-2-38476-541-6_72
DO  - 10.2991/978-2-38476-541-6_72
ID  - Zhao2026
ER  -