Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.11851/12488
Title: | A Machine Learning Approach for Predicting Familial and Sporadic Disease Cases Based on Clinical Symptoms: Introduction of a New Dataset | Authors: | Sharafi, P. Arslan, H. Evans, S.E. Varan, A. Ayter, Ş. |
Keywords: | Ailesel Vakalar Familial Cases Machine Learning Makine Öğrenmesi Neurofibromatosis Type 1 Nörofibromatozis Tip 1 Sporadic Cases Sporadik Vakalar |
Publisher: | Refik Saydam National Public Health Agency (RSNPHA) | Abstract: | Objective: Neurofibromatosis type 1 (NF1) is a common yet complex neurogenetic disorder characterized by a highly variable clinical presentation, influenced by both genetic and environmental factors. While its genetic basis is well understood, the variability in symptoms among patients presents significant challenges for diagnosis and management. This study focuses on examining the differences in clinical features between sporadic and familial NF1 cases. Additionally, it evaluates the potential of machine learning techniques to predict sporadic NF1 cases based on clinical symptoms, offering insights into how computational approaches can complement traditional diagnostic methods. Methods: A retrospective analysis was conducted on the medical records of 241 NF1 patients, including 121 sporadic and 120 familial cases. The frequency of various clinical features, such as Lisch nodules, pseudoarthrosis, and hypertension, was compared between the groups. analysis of variance (ANOVA) was used to identify the most important features distinguishing sporadic cases from familial ones. Furthermore, multiple machine learning algorithms, including k-nearest neighbors, artificial neural networks, support vector machines, decision trees, and XGBoost, were employed to predict sporadic cases based on the identified features. Results: Among the machine learning models tested, the XGBoost algorithm demonstrated the highest predictive accuracy at 62.86%, indicating moderate reliability in identifying sporadic cases. Despite this limitation, the analysis revealed significant differences in clinical manifestations between the two groups. These differences suggest that shared genetic modifiers may play a critical role in shaping the observed genotype-phenotype relationship in NF1. Conclusion: This study represents the first detailed comparison of a broad spectrum of clinical symptoms between sporadic and familial NF1 cases. While machine learning models showed only moderate success in prediction, the findings provide valuable insights into the phenotypic variability of NF1 and underscore the importance of larger, more diverse datasets for improving predictive accuracy. These results hold significant potential for guiding personalized diagnostic and therapeutic strategies for NF1 patients. © (2025), (Refik Saydam National Public Health Agency (RSNPHA)). All rights reserved. | URI: | https://doi.org/10.5505/TurkHijyen.2025.06337 https://hdl.handle.net/20.500.11851/12488 |
ISSN: | 0377-9777 |
Appears in Collections: | Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection |
Show full item record
CORE Recommender
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.