The Linguistic Identity of Short Message Service (SMS) Sobis
A Corpus Linguistic Analysis Using Antconc
- DOI
- 10.2991/978-2-38476-394-8_18How to use a DOI?
- Keywords
- Sobis; corpus linguistics; AntConc; collocation
- Abstract
This study identifies the linguistic identity of fraudulent Short Message Service (SMS), known locally in South Sulawesi as “Sobis,” using a corpus linguistic approach with AntConc. Unlike the common machine learning-based approaches to fraud detection, this study offers a linguistic analysis to gain deeper insight into the language patterns used. From 100 complaint SMS, the study identified 2,546 tokens and 882 unique words. Using N-Gram and KWIC analysis, 73 N-Gram types with 338 tokens were identified, indicating recurring communication patterns by Sobis. Six Sobis groups were identified based on linguistic markers used in their messages. These groups show differences in the use of words and phrases, such as markers of urgency, grammatical errors, and references to company names or contact numbers. These findings offer significant insights into understanding text-based fraud communication patterns, providing new perspectives for analyzing fraudulent messages in Indonesia.
- Copyright
- © 2025 The Author(s)
- Open Access
- Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
Cite this article
TY - CONF AU - Ikhwan M. Said AU - Kartika PY - 2025 DA - 2025/05/19 TI - The Linguistic Identity of Short Message Service (SMS) Sobis BT - Proceedings of The 5th International Conference on Linguistics and Cultural Studies 5 (ICLC-5 2024) PB - Atlantis Press SP - 144 EP - 151 SN - 2352-5398 UR - https://doi.org/10.2991/978-2-38476-394-8_18 DO - 10.2991/978-2-38476-394-8_18 ID - Said2025 ER -