Resident Department of Neurosurgery, University of Cincinnati College of Medicine Cincinnati, OH, US
Introduction: With excitement in the medical community around artificial intelligence, machine learning (ML) techniques have been applied to correlate clinical and radiographic variables with intracranial aneurysm (IA) rupture status. Many of these studies focus on regression-based models, which can lack performance in complex nonlinear relationships. In this study, we applied various ML techniques, including Random Forest (RF), XGBoost (XGB), Support Vector Machines (SVM), and Multi-Layer Perceptron (MLP), to predict IA rupture status.
Methods: The dataset consisted of 178 IAs each with 58 clinical and radiographic features for analysis. We removed features with high correlation (>0.8) with respect to the target variable to reduce redundancy. We applied grid search to fine-tune the hyperparameters for each model and increased the class weights to address the imbalance in the data. To minimize the effects of variability inherent in model performance, each model was evaluated across five iterations of 5-fold cross-validation. Overall performance metrics (accuracy, precision, recall, and F1-score) were extracted. The Wilcoxon signed-rank test was used to compare the AUC scores between models.
Results: The average IA size was 8.5 mm. The most common locations were ICA (42), AComm (41), MCA (32), and PComm (25). The AUC for the RF (0.85) and XGB (0.76) models were significantly higher than those for the SVM (0.69) and MLP (0.65) models (p < 0.05). There was no statistical difference in accuracy between RF and XBG models (p = 0.144). Fractal dimension ranked as the most important feature for model performance across all models. 3-Dimensional (3D) shape features made up 10 of the 15 most important features driving model performance.
Conclusion : Among the models, RF achieved the highest accuracy (85%) with balanced precision and recall. Across models, 3D geometric features drove model performance highlighting the importance of these features in predicting rupture status.