Machine Learning-Based Big Data Analytics for Personalized Health Insurance Services
Keywords:
Big Data Analytics, Personalized Health Insurance, Machine Learning, Predictive Risk Modeling, InsurTechAbstract
The conventional health insurance sector faces challenges in accurately evaluating risk and pricing policies, as it primarily relies on aggregated demographic data and general medical histories. This often leads to inefficient premium structures and limited preventive care opportunities, highlighting the need for more individualized risk assessment. This study aims to examine how Big Data Analytics (BDA) and machine learning can enhance the design of personalized health insurance products by integrating real-time, individual-level data beyond traditional actuarial methods. The method involves analyzing a large dataset of 50,000 policyholders over three years, including anonymized electronic health records (EHRs), wearable device data (activity and vital signs), social determinants of health (SDOH), and claims history. Predictive modeling was conducted using XGBoost and Random Forest to estimate individual-level claim frequency and severity. The results show that the BDA-driven approach outperforms traditional actuarial models, achieving an AUC of 0.89. Key predictors of high-cost claims include sleep quality and heart rate variability. These insights enable the creation of hyper-segmented insurance products with dynamic premiums and behavior-based incentives. In conclusion, integrating BDA into health insurance underwriting improves pricing accuracy, reduces adverse selection, and enhances profitability. It also supports the development of personalized insurance products that encourage proactive health management, representing a significant advancement in risk management within the InsurTech industry.
Downloads
References
[1] J. Yu and Y. Zhang, “Challenges and opportunities of deep learning-based process fault detection and diagnosis: a review,” Neural Computing and Applications, vol. 35, no. 1, pp. 211–252, 2023.
[2] R. Rosati, L. Romeo, G. Cecchini, F. Tonetto, P. Viti, A. Mancini, and E. Frontoni, “From knowledge-based to big data analytic model: a novel iot and machine learning based decision support system for predictive maintenance in industry 4.0,” Journal of Intelligent Manufacturing, vol. 34, no. 1, pp. 107–121, 2023.
[3] E. Susetyono, D. S. Priyarsono, A. Sukmawati, and P. Nurhayati, “Improving risk management maturity in ultra micro soe holding companies,” Aptisi Transactions on Technopreneurship (ATT), vol. 8, no. 1, pp. 310–324, 2026.
[4] R. G. Munthe, M. Susan, and B. M. Sulungbudi, “The role of internal marketing in building organizational commitment and reducing turnover intention affecting the improved performance of life insurance agents in indonesia,” Aptisi Transactions on Technopreneurship (ATT), vol. 6, no. 1, pp. 56–71, 2024.
[5] N. Sharma, M. Saharia, and G. Ramana, “High resolution landslide susceptibility mapping using ensem-ble machine learning and geospatial big data,” Catena, vol. 235, p. 107653, 2024.
[6] U. Rahardja, “Risk assessment, risk identification, and control in the process of steel smelting using the hiradc method,” APTISI Transactions on Management, vol. 7, no. 3, pp. 261–272, 2023.
[7] I. Khong, N. A. Yusuf, A. Nuriman, and A. B. Yadila, “Exploring the impact of data quality on decision-making processes in information intensive organizations,” APTISI Transactions on Management, vol. 7, no. 3, pp. 253–260, 2023.
[8] A. S. Rafika, A. Faturahman, B. N. Henry, F. D. Yulian, and M. Hassan, “Ai-driven big data solutions for personalized healthcare: Analyzing patient data to improve treatment outcomes,” Journal of Computer Science and Technology Application, vol. 2, no. 1, pp. 36–45, 2025.
[9] D. Jonas, H. D. Purnomo, A. Iriani, I. Sembiring, D. P. Kristiadi, and Z. Nanle, “Iot-based community smart health service model: Empowering entrepreneurs in health innovation,” Aptisi Transactions on Technopreneurship (ATT), vol. 7, no. 1, pp. 61–71, 2025.
[10] G. Kaur and A. Sharma, “A deep learning-based model using hybrid feature extraction approach for consumer sentiment analysis,” Journal of big data, vol. 10, no. 1, p. 5, 2023.
[11] R. D. Destiani and A. N. Mufiidah, “Era baru ekonomi digital: Studi komprehensif tentang teknologi dan pasar,” ADI Bisnis Digital Interdisiplin Jurnal, vol. 5, no. 1, pp. 47–50, 2024.
[12] I. Hidayat, M. Z. Ali, and A. Arshad, “Machine learning-based intrusion detection system: an experimen-tal comparison,” Journal of Computational and Cognitive Engineering, vol. 2, no. 2, pp. 88–97, 2023.
[13] W. Usino, D. A. R. Kusumawardhani, T. Ramadhan, A. Pratiangga, and O. Qurotulain, “Big data analyt-ics: Transforming business intelligence and decision making,” Journal of Computer Science and Technol-ogy Application, vol. 1, no. 2, pp. 154–163, 2024.
[14] Z. Liu, L. Fang, D. Jiang, and R. Qu, “A machine-learning-based fault diagnosis method with adaptive secondary sampling for multiphase drive systems,” IEEE transactions on power electronics, vol. 37, no. 8, pp. 8767–8772, 2022.
[15] Z. H. Jaffari, H. Jeong, J. Shin, J. Kwak, C. Son, Y.-G. Lee, S. Kim, K. Chon, and K. H. Cho, “Machine-learning-based prediction and optimization of emerging contaminants’ adsorption capacity on biochar materials,” Chemical Engineering Journal, vol. 466, p. 143073, 2023.
[16] M. Choiri, E. S. Pramudito, F. Sutisna, and R. S. Sean, “Business artificial intelligence for enhancing sustainable decision intelligence,” IAIC Transactions on Sustainable Digital Innovation (ITSDI), vol. 7, no. 1, pp. 106–116, 2025.
[17] E. Ileberi, Y. Sun, and Z. Wang, “A machine learning based credit card fraud detection using the ga algorithm for feature selection,” Journal of Big Data, vol. 9, no. 1, p. 24, 2022.
[18] S. Septiani, P. Seviawani et al., “Penggunaan big data untuk personalisasi layanan dalam bisnis e-commerce,” ADI Bisnis Digital Interdisiplin Jurnal, vol. 5, no. 1, pp. 51–57, 2024.
[19] J. D. Gates, Y. Yulianti, and G. A. Pangilinan, “Big data analytics for predictive insights in healthcare,” International Transactions on Artificial Intelligence, vol. 3, no. 1, pp. 54–63, 2024.
[20] M. H. R. Chakim, R. T. Utami, T. W. Sitanggang, A. Tanjung, A. Rizky, and E. A. Beldiq, “Innova-tion behavior research: Global trends and emerging themes in entrepreneurial business practices,” Aptisi Transactions on Technopreneurship (ATT), vol. 6, no. 3, pp. 574–585, 2024.
[21] C. Zhang, H. Dong, Y. Geng, H. Liang, and X. Liu, “Machine learning based prediction for china’s municipal solid waste under the shared socioeconomic pathways,” Journal of environmental management, vol. 312, p. 114918, 2022.
[22] V. Agarwal, M. Lohani, and A. S. Bist, “A novel deep learning technique for medical image analysis using improved optimizer,” Health Informatics Journal, vol. 30, no. 2, p. 14604582241255584, 2024.
[23] J. Chen, S. Chen, R. Fu, D. Li, H. Jiang, C. Wang, Y. Peng, K. Jia, and B. J. Hicks, “Remote sensing big data for water environment monitoring: Current status, challenges, and future prospects,” Earth’s Future, vol. 10, no. 2, p. e2021EF002289, 2022.
[24] U. Rahardja, S.-C. Chen, Y.-C. Lin, T.-C. Tsai, Q. Aini, A. Khan, F. P. Oganda, E. R. Dewi, Y.-C. Cho, and C.-H. Hsu, “Evaluating the mediating mechanism of perceived trust and risk toward cryptocurrency: An empirical research,” SAGE Open, vol. 13, no. 4, p. 21582440231217854, 2023.
[25] B. Sharma, L. Sharma, C. Lal, and S. Roy, “Explainable artificial intelligence for intrusion detection in iot networks: A deep learning based approach,” Expert Systems with Applications, vol. 238, p. 121751, 2024.
[26] L. Kask, N. Bloom, and R. Porta, “Health informatics: Utilization of information technology in health care and patient management,” International Journal of Cyber and IT Service Management, vol. 4, no. 1, pp. 53–58, 2024.
[27] C. Lukita, N. Lutfiani, A. R. S. Panjaitan, U. Rahardja, M. L. Huzaifah et al., “Harnessing the power of random forest in predicting startup partnership success,” in 2023 Eighth International Conference on Informatics and Computing (ICIC). IEEE, 2023, pp. 1–6.
[28] M. Migunani, A. Setiawan, and I. Sembiring, “Optimizing automated machine learning for ensemble performance and overfitting mitigation,” Aptisi Transactions on Technopreneurship (ATT), vol. 7, no. 3, pp. 808–822, 2025.
[29] C. Lukita, A. W. A. Rahman, I. N. Hikam, and U. Rahardja, “Integrating strategic management with sdg 10 for sustainable development and equity,” Aptisi Transactions on Technopreneurship (ATT), vol. 7, no. 2, pp. 638–649, 2025.
[30] U. Rusilowati, U. Narimawati, Y. R. Wijayanti, U. Rahardja, and O. A. Al-Kamari, “Optimizing human resource planning through advanced management information systems: A technological approach,” Aptisi Transactions on Technopreneurship (ATT), vol. 6, no. 1, pp. 72–83, 2024.
[31] P. P. Mondal, A. Galodha, V. K. Verma, V. Singh, P. L. Show, M. K. Awasthi, B. Lall, S. Anees, K. Poll-mann, and R. Jain, “Review on machine learning-based bioprocess optimization, monitoring, and control systems,” Bioresource technology, vol. 370, p. 128523, 2023.
[32] U. Rahardja, Q. Aini, A. S. Bist, S. Maulana, and S. Millah, “Examining the interplay of technology readiness and behavioural intentions in health detection safe entry station,” JDM (Jurnal Dinamika Man-ajemen), vol. 15, no. 1, pp. 125–143, 2024.
[33] M. Hatta, W. N. Wahid, F. Yusuf, F. Hidayat, N. A. Santoso, and Q. Aini, “Enhancing predictive models in system development using machine learning algorithms,” International Journal of Cyber and IT Service Management, vol. 4, no. 2, pp. 80–87, 2024.
[34] M. A. Talukder, M. M. Islam, M. A. Uddin, K. F. Hasan, S. Sharmin, S. A. Alyami, and M. A. Moni, “Machine learning-based network intrusion detection for big and imbalanced data using oversampling, stacking feature embedding and feature extraction,” Journal of big data, vol. 11, no. 1, p. 33, 2024.
[35] A. Sutarman, D. Juliastuti, I. Yati, L. P. Pasha et al., “Enhancing security and privacy in blockchain systems for tax administration,” Blockchain Frontier Technology, vol. 4, no. 2, pp. 145–155, 2025.
[36] I. N. Pratiwi, D. D. O. Prabawati, E. D. Wahyuni, N. Nursalam, I. Y. Widyawati, and N. A. Yahaya, “En-trepreneurship in social media literacy and intentions for diabetes prevention among adolescent students,” Aptisi Transactions on Technopreneurship (ATT), vol. 8, no. 1, pp. 85–98, 2026.
[37] M. B. Karo, B. P. Miller, and O. A. Al-Kamari, “Leveraging data utilization and predictive analytics: Driv-ing innovation and enhancing decision making through ethical governance,” International Transactions on Education Technology (ITEE), vol. 2, no. 2, pp. 152–162, 2024.
[38] W. Usino, M. M. Sari, F. P. Oganda, O. P. M. Daeli, and E. Smith, “Artificial intelligence integration for sustainable business model innovation insights from global startups,” Sundara Advanced Research on Artificial Intelligence, vol. 1, no. 2, pp. 82–89, 2025.
[39] S. Tian, Y. Zhong, Z. Zheng, A. Ma, X. Tan, and L. Zhang, “Large-scale deep learning based binary and semantic change detection in ultra high resolution remote sensing imagery: From benchmark datasets to urban application,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 193, pp. 164–186, 2022.
[40] S. Punia and S. Shankar, “Predictive analytics for demand forecasting: A deep learning-based decision support system,” Knowledge-Based Systems, vol. 258, p. 109956, 2022.
[41] R. D. Hadiwidjaja, A. I. Suroso, H. Siregar, and I. Sailah, “Performance paradigm: Entrepreneurial good university governance mediating leadership style in state universities,” Aptisi Transactions on Techno-preneurship (ATT), vol. 6, no. 3, pp. 492–508, 2024.
[42] U. Rahardja, P. A. Sunarya, Q. Aini, S. Millah, and S. Maulana, “Technopreneurship in healthcare: Evalu-ating user satisfaction and trust in ai-driven safe entry stations,” Aptisi Transactions on Technopreneurship (ATT), vol. 6, no. 3, pp. 404–417, 2024.
[43] I. Sembiring, B. K. Aji, and T. I. Bayu, “Consortium blockchain framework for secure digital medical record innovation,” Aptisi Transactions on Technopreneurship (ATT), vol. 8, no. 1, pp. 138–151, 2026.
[44] R. E. Santoso, A. G. Prawiyogi, U. Rahardja, F. P. Oganda, and N. Khofifah, “Penggunaan dan manfaat big data dalam konten digital,” ADI Bisnis Digital Interdisiplin Jurnal, vol. 3, no. 2, pp. 157–160, 2022.
[45] U. Rahardja, Q. Aini, D. Manongga, I. Sembiring, and I. D. Girinzio, “Implementation of tensor flow in air quality monitoring based on artificial intelligence,” International Journal of Artificial Intelligence Research, vol. 6, no. 1, 2023.
[46] Q. Aini, I. Sembiring, A. Setiawan, I. Setiawan, and U. Rahardja, “Perceived accuracy and user behavior: Exploring the impact of ai-based air quality detection application (aiku),” Indonesian Journal of Applied Research (IJAR), vol. 4, no. 3, pp. 209–224, 2023.
[47] R. Salam, Q. Aini, B. A. A. Laksminingrum, B. N. Henry, U. Rahardja, and A. A. Putri, “Consumer adoption of artificial intelligence in air quality monitoring: A comprehensive utaut2 analysis,” in 2023 Eighth International Conference on Informatics and Computing (ICIC). IEEE, 2023, pp. 1–6.
[48] A. W. Kusuma, Y. Jumaryadi, A. Fitriani et al., “Examining the joint effects of air quality, socioeconomic factors on indonesian health,” Aptisi Transactions on Technopreneurship (ATT), vol. 5, no. 2sp, pp. 186–195, 2023.
[49] T. S. Goh, D. Jonas, B. Tjahjono, V. Agarwal, and M. Abbas, “Impact of ai on air quality monitoring systems: A structural equation modeling approach using utaut,” Sundara Advanced Research on Artificial Intelligence, vol. 1, no. 1, pp. 9–19, 2025.
[50] M. R. Takakobi and K. D. Hartomo, “Analisis metode klasifikasi nasabah potensial dalam membuka deposito jangka panjang melalui telemarketing menggunakan metode gradient boosting classifier,” Jurnal Impresi Indonesia, vol. 4, no. 5, pp. 1799–1809, 2025.
[51] T. Hidayat, D. Manongga, Y. Nataliani, S. Wijono, S. Y. Prasetyo, E. Maria, U. Raharja, I. Sembiring et al., “Performance prediction using cross validation (gridsearchcv) for stunting prevalence,” in 2024 IEEE International Conference on Artificial Intelligence and Mechatronics Systems (AIMS). IEEE, 2024, pp. 1–6.
[52] F. Syafariani, M. S. Lola, S. S. S. Abd Mutalib, W. N. F. W. Nasir, A. A. K. A. Hamid, and N. H. Zainuddin, “Leveraging a hybrid machine learning model for enhanced cyberbullying detection,” Aptisi Transactions on Technopreneurship (ATT), vol. 7, no. 2, pp. 371–386, 2025.
[53] D. Robert, F. P. Oganda, A. Sutarman, W. Hidayat, and A. Fitriani, “Machine learning techniques for predicting the success of ai-enabled startups in the digital economy,” CORISINTA, vol. 1, no. 1, pp. 61–69, 2024.
[54] D. Bennet, S. A. Anjani, O. P. Daeli, D. Martono, and C. S. Bangun, “Predictive analysis of startup ecosystems: Integration of technology acceptance models with random forest techniques,” CORISINTA, vol. 1, no. 1, pp. 70–79, 2024.
[55] M. Hardini, R. A. Sunarjo, M. Asfi, M. H. R. Chakim, and Y. P. A. Sanjaya, “Predicting air quality index using ensemble machine learning,” ADI Journal on Recent Innovation, vol. 5, no. 1Sp, pp. 78–86, 2023.
[56] D. R. Saputra, H. Nugroho, D. Julianingsih, and Z. Queen, “Understanding air pollution through ma-chine learning: Predictive analytics for urban management,” IAIC Transactions on Sustainable Digital Innovation (ITSDI), vol. 6, no. 1, pp. 75–85, 2024.
[57] R. Royani, S. D. Maulina, S. Sugiyono, R. W. Anugrah, and B. Callula, “Recent developments in health-care through machine learning and artificial intelligence,” IAIC Transactions on Sustainable Digital In-novation (ITSDI), vol. 6, no. 1, pp. 86–94, 2024.
[58] H. Zalukhu, K. W. D. Prastiyanto, I. Ramadhan, N. R. Ramadhan et al., “Penggunaan machine learning dalam startup dengan pemanfaatan smart pls,” Jurnal MENTARI: Manajemen, Pendidikan Dan Teknologi Informasi, vol. 2, no. 2, pp. 111–122, 2024.
[59] R. Aprianto, R. Haris, A. Williams, H. Agustian, and N. Aptwell, “Social influence on ai-driven air quality monitoring adoption: Smartpls analysis,” Sundara Advanced Research on Artificial Intelligence, vol. 1, no. 1, pp. 28–36, 2025.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Sipho Dlamini, Endah Kurniyaningrum, Kustiyono Kustiyono, Untung Rahardja, Qurotul Aini, Sahal Mahfudz (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.




