z-logo
open-access-imgOpen Access
A Scalable Framework for Big Data Analytics in Psychological Research: Leveraging Distributed Systems and Cluster Management
Author(s) -
Nur Banu Ogur,
Celal Ceken,
Yavuz Selim Ogur,
Esra Yazici
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3617120
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Anxiety and depression are prevalent psychological disorders that can occur throughout life, with a notably higher prevalence among women during the perinatal period, encompassing pregnancy and the postpartum phase. The early detection and monitoring of these conditions are crucial for timely intervention and improved patient outcomes. Although healthcare analytics has progressed considerably, the extraction of actionable insights from large-scale patient data remains computationally intensive, especially under instant processing constraints. Furthermore, conventional healthcare infrastructures frequently lack the scalability, computational efficiency, and architectural flexibility required to integrate machine learning models into clinical workflows effectively. To address these challenges, the proposed distributed computing framework employs Apache Kafka for instant data streaming, Apache Spark for efficient in-memory machine learning–based analytics, and Kubernetes for orchestrating scalable, fault-tolerant deployment. This architectural configuration facilitates continuous data ingestion, accelerates analytical processing, and ensures system resilience, thereby enabling the timely identification of psychological conditions such as anxiety and depression during the perinatal period. Unlike schematic or batch-only prior work, we provide a production-ready, streaming-first clinical deployment and an empirical scaling analysis linking executors to end-to-end diagnostic latency and resource efficiency. Performance evaluations demonstrate the efficiency and scalability of the proposed system, highlighting its potential for real-world applications in healthcare analytics.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom