Proceedings of the 2025 2nd International Conference on Mechanics, Electronics Engineering and Automation (ICMEEA 2025)

GPU Accelerated Image Processing Technology: Architectural Features, Parallel Optimization and Deep Learning Applications

Authors
Shaowei Chen1, *
1School of Physics and Information Engineering, Fuzhou University, Fuzhou, 350100, China
*Corresponding author. Email: 172109076@fzu.edu.cn
Corresponding Author
Shaowei Chen
Available Online 31 August 2025.
DOI
10.2991/978-94-6463-821-9_40How to use a DOI?
Keywords
GPU; Image Processing; CUDA; CNN
Abstract

This paper systematically discusses the technical basis and optimization methods of GPU-accelerated image processing, focusing on the analysis of GPU architectural features and CUDA and OpenCL parallel computing platforms. It also covers GPU optimization methods in image defogging and enhancement, frequency domain transform (FFT, DCT), feature detection (SIFT) and deep learning driven image processing. The experimental data shows that the parallel optimization strategy based on GPU, in which the computational efficiency of the defogging algorithm is improved by 10 times, the performance of FFT reaches 1000GFlops, and the acceleration ratio of SIFT is up to 121.99 times. Further combining with deep learning scenarios, GPU-based CNN training optimization improves the GEMM operation throughput by 1.97 times. Furthermore, it constructs the OUR-GAN framework to achieve 16k image generation while reducing memory usage to 12.5GB. The study proves that GPU-based software and hardware co-optimization can significantly improve the efficiency of image processing and provide technical support for real-time applications and large-scale computation.

Copyright
© 2025 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the 2025 2nd International Conference on Mechanics, Electronics Engineering and Automation (ICMEEA 2025)
Series
Advances in Engineering Research
Publication Date
31 August 2025
ISBN
978-94-6463-821-9
ISSN
2352-5401
DOI
10.2991/978-94-6463-821-9_40How to use a DOI?
Copyright
© 2025 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Shaowei Chen
PY  - 2025
DA  - 2025/08/31
TI  - GPU Accelerated Image Processing Technology: Architectural Features, Parallel Optimization and Deep Learning Applications
BT  - Proceedings of the 2025 2nd International Conference on Mechanics, Electronics Engineering and Automation (ICMEEA 2025)
PB  - Atlantis Press
SP  - 386
EP  - 393
SN  - 2352-5401
UR  - https://doi.org/10.2991/978-94-6463-821-9_40
DO  - 10.2991/978-94-6463-821-9_40
ID  - Chen2025
ER  -