Search
On-Site Program Calendar
Browse By Day
Browse By Time
Browse By Person
Browse By Room
Browse By Unit
Browse By Session Type
Search Tips
Change Preferences / Time Zone
Sign In
Bluesky
Threads
X (Twitter)
YouTube
Grounded in theoretical and empirical literature, this study utilized supervised machine learning to address four key questions on predicting STEM career interest at the end of high school. Using the Outreach Programs and Science Career Intentions dataset (N = 15,847), we identified 75 predictors of STEM career interest across various domains. ML models were trained using lasso regression, decision tree, random forest, extreme gradient boosting, and support vector machines. Key findings include: (1) XGBoost achieved the highest prediction accuracy of 80.76%, (2) Fifteen key predictors were identified, (3) Nonlinear patterns and interactions, such as gender and course likeness, were highlighted, and (4) ML revealed more variables and complex effects than traditional logistic regression. It advances theoretical understanding and complements traditional methods.