Proceedings of the Global Conference on Sustainable Energy Systems, Smart Electronics and Intelligent Computing (GCSESEIC 2025)

Big Data Event Streaming with Apache Kafka for Improved Data Flows in IoT using Optimized Kafka-Based Data Streaming Workflow

Authors
N. Javed1, *, R. Yogesh Rajumar2
1Department of Computer Science and Engineering, Bharath Institute of Higher Education and Research, Chennai, India
2Department of Information and Technology, Bharath Institute of Higher Education and Research, Chennai, India
*Corresponding author. Email: javednjaved@gmail.com
Corresponding Author
N. Javed
Available Online 24 April 2026.
DOI
10.2991/978-94-6239-654-8_24How to use a DOI?
Keywords
Apache Kafka; Data Streaming; Real-Time Analytics; Distributed Architecture; Message Queuing; Scalable Data Pipelines; Event Stream Throughput; Low-Latency Processing; Big Data Infrastructure
Abstract

The distributed architecture and message queuing features of Apache Kafka significantly improve the reliability and efficiency of batch and real-time data processing. This research aims to create a scalable and dependable data streaming setup by optimizing Kafka deployments, data splitting, and Kafka Connect integration. The study focuses on enhancing data processing, input, and distribution across applications and systems. The project seeks to achieve substantial improvements in data processing speed, real-time analytics, and scalable data pipelines by fine-tuning Kafka settings and using its features. The analysis of Kafka Event Stream Throughput Over Time and Latency Distribution Across Brokers demonstrates the system's performance and efficiency. The results show a latency distribution of 6-15 milliseconds and a throughput of 750-1340 events per hour. Additionally, the Consumer Lag Over Time analysis indicates consistent performance, with values ranging from 70 to 140. This study highlights the effectiveness of Apache Kafka in creating a reliable and efficient data streaming infrastructure, which advances big data processing. The findings provide valuable insights for businesses aiming to make the most of real-time analytics and improve their data pipelines.

Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the Global Conference on Sustainable Energy Systems, Smart Electronics and Intelligent Computing (GCSESEIC 2025)
Series
Advances in Engineering Research
Publication Date
24 April 2026
ISBN
978-94-6239-654-8
ISSN
2352-5401
DOI
10.2991/978-94-6239-654-8_24How to use a DOI?
Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - N. Javed
AU  - R. Yogesh Rajumar
PY  - 2026
DA  - 2026/04/24
TI  - Big Data Event Streaming with Apache Kafka for Improved Data Flows in IoT using Optimized Kafka-Based Data Streaming Workflow
BT  - Proceedings of the Global Conference on Sustainable Energy Systems, Smart Electronics and Intelligent Computing (GCSESEIC 2025)
PB  - Atlantis Press
SP  - 277
EP  - 287
SN  - 2352-5401
UR  - https://doi.org/10.2991/978-94-6239-654-8_24
DO  - 10.2991/978-94-6239-654-8_24
ID  - Javed2026
ER  -