NLP-Powered Disease Identification and Medical Coding Automation
- DOI
- 10.2991/978-94-6463-858-5_245How to use a DOI?
- Keywords
- ICD-10 Codes; NLP models; Automata clinical coding; OCR; Deep learning; TF-IDF vectorizer; Gemini AI
- Abstract
The proliferation of unstructured medical documents necessitates efficient methods for extracting and analyzing patient symptoms to enhance clinical decision-making. Now-a-days, in medical field clinical coding and symptoms identification are doing manually or they are manual process. This paper presents a comprehensive approach that integrates Optical Character Recognition (OCR), Natural language Processing (NLP), and machine learning techniques to automate the extraction of symptoms from various document formats and predict corresponding diseases along with their ICD-10 codes. The system leverages TensorFlow for disease prediction, a TF-IDF vectorizer for symptom representation and incorporates Google’s Gemini AI for advanced text analysis. The proposed methodology demonstrates significant potential in streamlining medical document processing and improving diagnostic accuracy.
- Copyright
- © 2025 The Author(s)
- Open Access
- Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
Cite this article
TY - CONF AU - Ram Prasad Reddy Sadi AU - Budi Ramya AU - Balivada Bhuvan Deepankar AU - V. Neelamraju AU - Sai Diwakara Subrahmanyam AU - Badam Ramesh PY - 2025 DA - 2025/11/04 TI - NLP-Powered Disease Identification and Medical Coding Automation BT - Proceedings of International Conference on Computer Science and Communication Engineering (ICCSCE 2025) PB - Atlantis Press SP - 2911 EP - 2928 SN - 2352-538X UR - https://doi.org/10.2991/978-94-6463-858-5_245 DO - 10.2991/978-94-6463-858-5_245 ID - Sadi2025 ER -