Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.11851/12488
Title: A Machine Learning Approach for Predicting Familial and Sporadic Disease Cases Based on Clinical Symptoms: Introduction of a New Dataset
Authors: Sharafi, P.
Arslan, H.
Evans, S.E.
Varan, A.
Ayter, Ş.
Keywords: Ailesel Vakalar
Familial Cases
Machine Learning
Makine Öğrenmesi
Neurofibromatosis Type 1
Nörofibromatozis Tip 1
Sporadic Cases
Sporadik Vakalar
Publisher: Refik Saydam National Public Health Agency (RSNPHA)
Abstract: Objective: Neurofibromatosis type 1 (NF1) is a common yet complex neurogenetic disorder characterized by a highly variable clinical presentation, influenced by both genetic and environmental factors. While its genetic basis is well understood, the variability in symptoms among patients presents significant challenges for diagnosis and management. This study focuses on examining the differences in clinical features between sporadic and familial NF1 cases. Additionally, it evaluates the potential of machine learning techniques to predict sporadic NF1 cases based on clinical symptoms, offering insights into how computational approaches can complement traditional diagnostic methods. Methods: A retrospective analysis was conducted on the medical records of 241 NF1 patients, including 121 sporadic and 120 familial cases. The frequency of various clinical features, such as Lisch nodules, pseudoarthrosis, and hypertension, was compared between the groups. analysis of variance (ANOVA) was used to identify the most important features distinguishing sporadic cases from familial ones. Furthermore, multiple machine learning algorithms, including k-nearest neighbors, artificial neural networks, support vector machines, decision trees, and XGBoost, were employed to predict sporadic cases based on the identified features. Results: Among the machine learning models tested, the XGBoost algorithm demonstrated the highest predictive accuracy at 62.86%, indicating moderate reliability in identifying sporadic cases. Despite this limitation, the analysis revealed significant differences in clinical manifestations between the two groups. These differences suggest that shared genetic modifiers may play a critical role in shaping the observed genotype-phenotype relationship in NF1. Conclusion: This study represents the first detailed comparison of a broad spectrum of clinical symptoms between sporadic and familial NF1 cases. While machine learning models showed only moderate success in prediction, the findings provide valuable insights into the phenotypic variability of NF1 and underscore the importance of larger, more diverse datasets for improving predictive accuracy. These results hold significant potential for guiding personalized diagnostic and therapeutic strategies for NF1 patients. © (2025), (Refik Saydam National Public Health Agency (RSNPHA)). All rights reserved.
URI: https://doi.org/10.5505/TurkHijyen.2025.06337
https://hdl.handle.net/20.500.11851/12488
ISSN: 0377-9777
Appears in Collections:Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Show full item record



CORE Recommender

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.