Paper Summary
Share...

Direct link:

Predicting General Surgery Resident Competence: Evaluating Accuracy and Fairness of Statistical Learning Methods

Thu, April 9, 4:15 to 5:45pm PDT (4:15 to 5:45pm PDT), JW Marriott Los Angeles L.A. LIVE, Floor: 4th Floor, Diamond 3

Abstract

This study aimed to develop and evaluate machine learning models to predict intraoperative competence among General Surgery trainees. We analyzed ratings from 3,793 trainees using data from the Society for Improving Medical Professional Learning OR. Seven classification models were trained and evaluated in terms of accuracy and fairness. The k-nearest neighbors (KNN) demonstrated the highest sensitivity (0.747) and F1 score (0.741) while Gradient Boosting Machines (GBM) achieved the highest accuracy (0.717). Support Vector Machines (SVM) had the highest specificity (0.744). Fairness analysis showed minimal gender disparities, with KNN showing the most balanced performance. Results were consistent in a subset of laparoscopic cholecystectomy procedures. These findings support the potential of machine learning models to enhance surgical education through data-driven performance assessment.

Authors