Binary-Class AI-Generated Content Detection Through Comprehensive Feature Engineering
- DOI
- 10.2991/978-94-6239-664-7_74How to use a DOI?
- Keywords
- AI content detection; feature engineering; machine learning; deep learning; transformers; text classification; content authenticity
- Abstract
The proliferation of AI-generated content necessitates robust detection systems across academia, journalism, and digital media. This study presents a comprehensive feature engineering framework combining 2,537 linguistic, stylometric, and semantic features for automated discrimination of AI-generated versus human-written text across four content categories. We systematically evaluate nine classification models spanning traditional machine learning, deep neural networks, and transformer architectures on a balanced dataset of 1,367 samples. Our feature engineering approach, refined to 800 optimal discriminators through chisquared selection, demonstrates that carefully engineered features with classical classifiers can surpass complex neural architectures while maintaining computational efficiency. This work provides empirical evidence for feature-centric model design and practical deployment insights for production environments.
- Copyright
- © 2026 The Author(s)
- Open Access
- Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
Cite this article
TY - CONF AU - Afifa Hoque Tisha AU - Ayesha Banu AU - Fatema-Tuj-Johora AU - Riad Hossain PY - 2026 DA - 2026/06/08 TI - Binary-Class AI-Generated Content Detection Through Comprehensive Feature Engineering BT - Proceedings of the International Conference on Intelligent Data Analysis and Applications (IDAA 2025) PB - Atlantis Press SP - 1079 EP - 1095 SN - 1951-6851 UR - https://doi.org/10.2991/978-94-6239-664-7_74 DO - 10.2991/978-94-6239-664-7_74 ID - Tisha2026 ER -