TEAM
Crisis to Thesis - Stress Analysis in PhD Students using ML

Background:
Graduate education, particularly at the PhD level, is known for its rigor, long duration, and high expectations, often leading to significant stress among students. As current or former graduate students, I'm sure we can all personally relate to the challenges associated with pursuing a PhD. The journey is filled with uncertainties—highly competitive job markets, low stipends that barely cover living expenses, and the constant struggle of self-doubt and imposter syndrome. Many PhD students feel the weight of peer pressure, comparing their progress, publications, and achievements to others. Additionally, despite years of specialized training, PhD graduates often face restricted job opportunities outside academia, as they are deemed “overqualified” for many positions but struggle to secure tenure-track roles. Understanding these stressors is crucial for creating a healthier academic environment. This project aims to analyze stress levels among PhD students using various demographic, academic, and lifestyle factors.
Objective:
The primary objective of this project is to identify and predict stress levels in PhD students based on multiple influencing factors. By employing data-driven techniques, we aim to uncover patterns and correlations that can provide insights into the primary stressors affecting doctoral students.
Data Collection & Sources:
- Survey Data: Design and distribute a survey targeting PhD students across various universities to collect first-hand responses.
- Existing Datasets: If surveying fails, explore publicly available datasets related to graduate student well-being, mental health, and stress.
- University Reports & Research Papers: Institutional reports and prior research on PhD student stress may provide valuable secondary data.
Potential Factors for Analysis:
- Demographics: Age, gender identity, sexual orientation, ethnicity, nationality (domestic/international), marital status, children
- Academic Factors: Year in program, university and program ranking, number of publications, advisor's age/gender, work load (teaching/research duties), financial status (stipend, funding security)
- Lifestyle Factors: Sleeping hours, social interactions, exercise frequency, access to mental health resources
- Environmental Factors: State or country of study, cost of living, institutional support for graduate students
- Career Uncertainty: Uncertainty about career future also causes stress, adding to the mental burden of PhD students.
Methodology:
- Data Preprocessing: Cleaning and handling missing data, normalizing numeric variables, and encoding categorical variables.
- Exploratory Data Analysis (EDA): Understanding distributions, correlations, and potential trends in stress levels.
Modeling Approaches:
- Linear Models: Multiple linear regression, logistic regression (for stress categories)
- Machine Learning Models: Decision trees, random forests, support vector machines (SVM)
- Neural Networks: Multi-layer perceptron (MLP) for stress prediction
Expected Outcomes:
- Identification of the most significant factors contributing to stress in PhD students.
- Predictive model to estimate stress levels based on individual profiles.
- Insights that can be shared with academic institutions to improve student support systems.
Impact & Relevance:
Understanding and addressing PhD student stress is crucial for fostering a healthier academic environment. The findings from this project can be used to advocate for improved institutional policies, better mental health resources, and structured support systems for graduate students. By highlighting the key stressors, universities can take data-driven actions to implement counseling services, financial aid initiatives, peer mentorship programs, and workload management strategies tailored to PhD students' needs. Our insights can help build a culture of mental well-being and create long-term improvements in how graduate programs support their students. This project is not just about analyzing stress but about driving change for a healthier and more sustainable academic experience.







