top of page
Ohio State 2023 Report

Spring 2023 Cohort

108 grad students, postdocs, and faculty affiliated with OSU registered out of 550 participants total.

All registrants had access to the following programs and services

  • Career Exploration Seminars

    • Invitations to Industry (Weekly)

    • Invitations to Entrepreneurship (Monthly)
       

  • Data Science Boot Camp

    • 1 month intensive with mentored projects (4 classes per week)
       

  • Mini-courses

    • Python Prep

    • Data Visualization​

    • Software Engineering for Data Scientists

    • Quantum Computing

    • Mixed Methods in Business Intelligence

    • UX Research

  • Interview Prep (Monthly)

    • Foundations: Application Materials, Informational Interviews, Behavioral Interviews​

    • Technical: Whiteboarding, Paired Coding, Case Studies
       

  • 1-on-1 Career Coaching​ (On-Demand)

    • Resume Review​

    • Mock Interviews

    • Navigating/Negotiating Offers
       

  • Employer & Alumni Career Connections

    • Internal and LinkedIn Job Board

    • Introductions to our Hiring Partners​

    • Direct Introductions to our Alumni Members

May 2023 Data Science Boot Camp

Out of 34 teams, 3 teams with participants affiliated with OSU landed in the Top 5 Finals of our Spring 2023 Data Science Boot Camp with one of the teams winning First Place overall:

  • Team 28, "Correcting Racial Bias in Measurement of Blood Oxygen Saturation", FIRST PLACE

  • Team 37, "Protein Function Prediction", TOP 5

  • Team 39, "AirBnB", TOP 5

SPRING 2023

TEAM

Correcting Racial Bias in Measurement of Blood Oxygen Saturation

clear.png

Data Science Boot Camp

Rohan Myers, Saad Khalid, woojeong kim, Brooks Miner, Jaychandran Padayasi

Fingertip pulse oximeters are the current standard for estimating blood oxygen saturation without a blood draw, both at home and in healthcare settings. However, pulse oximeters overestimate oxygen saturation, often resulting in ‘hidden hypoxemia’: a patient has hypoxemia (dangerously low oxygen saturation), but the oximeter returns a healthy oxygen value. Unfortunately, oximeter overestimation of oxygen saturation is exacerbated for patients with darker skin tones due to light-based oximeter technology. This results in Black patients experiencing hidden hypoxemia at twice the rate of white patients. By combining pulse oximeter readings (SpO2) with additional patient data, we develop improved methods for estimating arterial blood oxygen saturation (SaO2) and identifying Hidden Hypoxemia. The predictions of our models are more accurate than pulse-oximeter readings alone, and remove the systematic racial inequity inherent in the current medical practice of using oximeter readings alone.

Screen Shot 2022-06-03 at 11.31.35 AM.png
github URL

SPRING 2023

TEAM

Protein Function Prediction

clear.png

Data Science Boot Camp

Eamon Byrne, Dustin Nguyen, Ness Mayker, Salma Abdelbaky

Predict the biological function of proteins based upon their amino acid sequence and other publicly available data.

This project would double as a team submission to the Fifth Critical Assessment of Functional Annotation (CAFA 5) competition - similar to the Critical Assessment of Structure Prediction (CASP 14) competition in which AlphaFold2 gained prominence (in 2021).

Here is the Kaggle competition (submissions are open for the next 3 months): https://www.kaggle.com/competitions/cafa-5-protein-function-prediction/
(1st prize is $15,000... lol)

Screen Shot 2022-06-03 at 11.31.35 AM.png
github URL

SPRING 2023

TEAM

Airbnb

clear.png

Data Science Boot Camp

Zheyu Ni, Muhammad Reza Averly, Ricky Oropeza, Shubhrika Ahuja, Praveen Shahani

How to make money in LA with Airbnb? Being a host has never been easy since they need to provide a quality experience for guests while considering the profit margin. We aim to alleviate their burdens by streamlining the decision-making process. We explore and analyze tens of thousands of listings in LA for insights. Then, we combined structural modeling and machine learning to customize pricing for new listings based on property locations, features, ownership, and substitution between nearby listings to maximize the host’s profit. The structural model helps capture supply & demand dynamics and machine learning helps capture the consumer consideration set (hotspot market). We also use machine learning for price prediction to utilize rich features better. Furthermore, we recommend areas with the best rate of return for potential hosts and suggest possible amenities to become popular in Airbnb.

Screen Shot 2022-06-03 at 11.31.35 AM.png
github URL

Fall 2023 Cohort

109 grad students, postdocs, and faculty affiliated with OSU registered out of 506 participants total.

All registrants had access to the following programs and services

  • Career Exploration Seminars

    • Invitations to Industry (Weekly)

    • Invitations to Entrepreneurship (Monthly)
       

  • Data Science Boot Camp

    • 12 weeks with mentored projects (1 class per week)
       

  • Mini-courses

    • Python Prep

    • Data Visualization​

    • Software Engineering for Data Scientists

    • Quantum Computing

    • Deep Learning

    • UX Research

  • Interview Prep (Monthly)

    • Foundations: Application Materials, Informational Interviews, Behavioral Interviews​

    • Technical: Whiteboarding, Paired Coding, Case Studies
       

  • 1-on-1 Career Coaching​ (On-Demand)

    • Resume Review​

    • Mock Interviews

    • Navigating/Negotiating Offers
       

  • Employer & Alumni Career Connections

    • Internal and LinkedIn Job Board

    • Introductions to our Hiring Partners​

    • Direct Introductions to our Alumni Members

Fall 2023 Data Science Boot Camp

Out of 37 teams, 3 teams with participants affiliated with OSU landed in the Top 5 Finals, while two more were awarded "with distinction":
 

  • Team 7, "Funk", TOP 5

  • Team 20, "AI-generated Image Detection", TOP 5

  • Team 25, "The Silent Emergency - Predicting Preterm Birth", WITH DISTINCTION

  • Team 27, "Somm", TOP 5

  • Team 45, "Biomedical Categorization" WITH DISTINCTION

FALL 2023

TEAM

Predicting car crashes based on weather

clear.png

Data Science Boot Camp

Nicholas Rouse, woojeong kim, Marco Lopez, Ayca Fatma Durmaz, Tasnim Iqbal

We attempt to predict how the number of car crashes vary based on the weather. There is large data set here https://www.kaggle.com/datasets/sobhanmoosavi/us-accidents for us to use.

Screen Shot 2022-06-03 at 11.31.35 AM.png
github URL

FALL 2023

TEAM

Donut vs Bagel (Demo F23)

clear.png

Data Science Boot Camp

This is a placeholder project to demonstrate what an existing team will look like. (Do not join and do not judge)

Classifying images of Bagels vs Donuts

Screen Shot 2022-06-03 at 11.31.35 AM.png
github URL

FALL 2023

TEAM

Team Salmon

clear.png

Data Science Boot Camp

Sean Haight, Andrew Staal, Aycin Iplikci-Arodirik, Sayantan Roy

Salmon populations on the Pacific Coast of North America have been in decline for the last 40 years. Yet salmon have important economic, cultural, and environmental impacts. Indigenous peoples have fished salmon since time immemorial, while commercial and recreational salmon fisheries are worth ~$641 million annually, being a substantial employer in British Columbia. Negative environmental impacts of declining salmon stocks include the disruption of energy and nutrient transfers between the Pacific Ocean and freshwater rivers, and the disruption of the food chain for salmon predators.

Our goal is to study salmon viability, i.e. salmon survival rates, using data science. Here we identify environmental, predatory, and human factors that have the largest impact on salmon populations in the Strait of Georgia. We reduce to a minimal set of features that have the most effect on salmon viability, and build a linear regression model using estimated features.

Screen Shot 2022-06-03 at 11.31.35 AM.png
github URL

John Beacom Testimonial

©2017-2026 by The Erdős Institute.

bottom of page