Deep Sound Search Leveraging Pre-trained CNNs and Faiss for Animal Vocalization Analysis

D. Sathiya; P. Palanisamy; E. Nandhini; V. Jayamurugan; K. Kirthikcharan; N. Manoj

doi:10.2991/978-94-6463-718-2_36

<Previous Article In Volume

Next Article In Volume>

Deep Sound Search Leveraging Pre-trained CNNs and Faiss for Animal Vocalization Analysis

Authors

D. Sathiya¹^{, *}, P. Palanisamy², E. Nandhini², V. Jayamurugan³, K. Kirthikcharan³, N. Manoj³

¹Associate Professor, Computer Science and Engineering, KSR College of Engineering, Tiruchengode, Namakkal, Tamil Nadu, India

²Assistant Professor, Computer Science and Engineering, KSR College of Engineering, Tiruchengode, Namakkal, Tamil Nadu, India

³Student, Computer Science and Engineering, KSR College of Engineering, Tiruchengode, Namakkal, Tamil Nadu, India

^*Corresponding author. Email: sathiyad@ksrce.ac.in

Corresponding Author

D. Sathiya

Available Online 23 May 2025.

DOI: 10.2991/978-94-6463-718-2_36 How to use a DOI?
Keywords: deep learning; animal vocalization analysis; dual audio recordings; ensemble methods; autoencoders; wildlife monitoring
Abstract: Deep learning techniques such as pre-trained Convolutional Neural Networks (CNNs) and Faiss were applied to animal vocalization analysis in this work. Comprehensive bioacoustic data in recent years have shown the versatility and power of CNN based models, which have outperformed in classifying a myriad of animal sounds. The proposed architecture with the self-supervised transformers such as animal2vec improves the raw audio input severity by ensuring the model is designed to handle rare-event nature by including species that are less vocalized in the database. Use of dual audio recording systems provides reliable source attribution, and employing ensemble methods, where multiple classification models are trained, can further improve accuracy across a range of environments. Using pre-trained models accelerates the adaptation to new species, requiring less time to train new species, which makes these techniques more broadly applicable. Moreover, autoencoder-based vocal analysis helps suppress noise and capture minute patterns, enabling deep learning models to be more effective in challenging environments. The deep learning methods highlight scalability, efficiency, and versatility of the approach which can also be adopted as a rich tool in wildlife monitoring and species identification in diverse ecosystems.
Copyright: © 2025 The Author(s)
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the International Conference on Sustainability Innovation in Computing and Engineering (ICSICE 2024)
Series: Advances in Computer Science Research
Publication Date: 23 May 2025
ISBN: 978-94-6463-718-2
ISSN: 2352-538X
DOI: 10.2991/978-94-6463-718-2_36 How to use a DOI?
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

ris enw bib

TY  - CONF
AU  - D. Sathiya
AU  - P. Palanisamy
AU  - E. Nandhini
AU  - V. Jayamurugan
AU  - K. Kirthikcharan
AU  - N. Manoj
PY  - 2025
DA  - 2025/05/23
TI  - Deep Sound Search Leveraging Pre-trained CNNs and Faiss for Animal Vocalization Analysis
BT  - Proceedings of the International Conference on Sustainability Innovation in Computing and Engineering (ICSICE 2024)
PB  - Atlantis Press
SP  - 410
EP  - 423
SN  - 2352-538X
UR  - https://doi.org/10.2991/978-94-6463-718-2_36
DO  - 10.2991/978-94-6463-718-2_36
ID  - Sathiya2025
ER  -

download .riscopy to clipboard