Proceedings of the International Conference on Recent Advances in Intelligent and Sustainable Technologies (RAIST 2026)

International Conference on Recent Advances in Intelligent and Sustainable Technologies (RAIST 2026)

📍Surat, India🗓️ 19-21 February 2026

Learnova – ML Powered Smart Learning System

Authors
Pragati Thawkar1, *, Mendu Vaishnavi1, Mittapalli Aneesha1, Saurav Dabhade1, Shrikant Salve1
1Department of Computer Science and Engineering, Indian Institute of Information Technology, Pune, India
*Corresponding author. Email: cathawkar04@gmail.com
Corresponding Author
Pragati Thawkar
Available Online 18 June 2026.
DOI
10.2991/978-94-6239-707-1_9How to use a DOI?
Keywords
Machine Learning; Natural Language Processing (NLP); OCR; Topic Modelling; Keyword Extraction; TextRank; Educational Data Mining; PYQ Analysis; Intelligent Learning Systems
Abstract

Students often struggle to prioritize concepts during exam preparation due to the lack of structured insights from Previous Years’ Question Papers (PYQs), which remain one of the most valuable yet underutilized academic resources. We have proposed Machine Learning (ML)- powered smart PYQ analyser system designed to help college students to identify high-weightage and frequently asked topics by analysing Previous Years’ Question Papers. Learnova (name given to our proposed solution) automates this entire process by extracting questions from scanned or digital PYQ documents using a pre-trained CNN-based Optical Character Recognition (OCR) model. The extracted text is processed through advanced Natural Language Processing (NLP) pipelines, where keyword extraction, topic identification and frequency computation are performed using techniques such as RAKE and TextRank. Additionally, a Sentence-BERT (SBERT) model is trained to generate semantic embeddings, enabling accurate semantic clustering of conceptually similar topics. After clustering, frequency mapping is applied to quantify topic recurrence across multiple question papers. Finally, the system ranks and displays the top ten high-weightage topics that are most likely to reappear in future examinations. By converting unstructured question papers into data-driven insights, Learnova enables efficient revision, reduces manual analysis time from several hours to just 40 seconds and enhances exam preparedness. Comparative analysis shows that the proposed system achieves 86% accuracy, significantly outperforming traditional manual methods and basic keyword-search systems in both precision and processing speed.

Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the International Conference on Recent Advances in Intelligent and Sustainable Technologies (RAIST 2026)
Series
Atlantis Highlights in Intelligent Systems
Publication Date
18 June 2026
ISBN
978-94-6239-707-1
ISSN
2589-4919
DOI
10.2991/978-94-6239-707-1_9How to use a DOI?
Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Pragati Thawkar
AU  - Mendu Vaishnavi
AU  - Mittapalli Aneesha
AU  - Saurav Dabhade
AU  - Shrikant Salve
PY  - 2026
DA  - 2026/06/18
TI  - Learnova – ML Powered Smart Learning System
BT  - Proceedings of the International Conference on Recent Advances in Intelligent and Sustainable Technologies (RAIST 2026)
PB  - Atlantis Press
SP  - 100
EP  - 109
SN  - 2589-4919
UR  - https://doi.org/10.2991/978-94-6239-707-1_9
DO  - 10.2991/978-94-6239-707-1_9
ID  - Thawkar2026
ER  -