top of page
Data Science Boot Camp

Fall 2024

Sep 5, 2024


Dec 13, 2024

This program is included with Fall 2024 Career Launch Cohort Enrollment and Erdős Institute Alumni Club Membership at no additional cost.

Checking your registration status...

To access the program content, you must first create an account and member profile and be logged in.


You are registered for this program.

Registration Deadlines

Sep 4, 2024


All Erdős Fall 2024 Career Launch Cohort or Alumni Club members who are not participating in the UX Research nor Deep Learning Boot Camps




Launch, Core Program, Boot Camp, Projects, Certificates


The Erdős Institute's signature Data Science Boot Camp has been running since May 2018 thanks to the generous support of our sponsors, members, and partners. Due to its popularity, we now offer our boot camp online three times per year in two different formats: a 1-month long intensive boot camp each May and a semester long version each Spring & Fall.



Organizers, Instructors, and Advisors


Steven Gubkin, PhD

Lead Instructor

Office Hours:

MTWRF 12pm - 1pm ET, and by appt.


Preferred Contact:


Please feel free to message me on Slack with any questions!


Alec Clott, PhD

Head of Data Science Projects

Office Hours:

By appt. only


Preferred Contact:


Participants are welcome to reach out to me via slack or email. I normally work standard EST hours (9am-5pm), but can always find time to meet folks via Zoom too after work. Let me know how I can help!


The goal of our Data Science Boot Camp is to provide you with the skills and mentorship necessary to produce a portfolio worthy data science/machine learning project while also providing you with valuable career development support and connecting you with potential employers.

Project Examples


Aware NLP Project III

Mohammad Nooranidoost, Baian Liu, Craig Franze, Mustafa Anıl Tokmak, Himanshu Raj, Peter Williams

Screen Shot 2022-06-03 at 11.31.35 AM.png
github URL

This project involves the investigation and evaluation of different methodologies for retrieval for use in RAG (Retrieval-Augmented Generation) systems. In particular, this project investigates retrieval quality for information downloaded from employee subreddits. We investigated the impacts of using clustering, multi-vector indexing, and multi-querying in advanced retrieval methodologies against baseline naive retrieval.

First Steps/Prerequisites

Participants should have a base-level familiarity with Python. Participants should also be familiar with some basic math concepts. Finally, you will also need to have your laptop or desktop computer set up for the course. If you are new to Python, need a quick math refresher, or if you need help setting up your computer, then please follow the link below.

Program Content

I'm a paragraph. Click here to add your own text and edit me. It's easy.

Program Content


Project/Homework Instructions

I'm a paragraph. Click here to add your own text and edit me. It's easy.

Project/Team Formation
Project Submission
Projects README


Click on any date for more details

Please check your registration email for program schedule and zoom links.

Project/Homework Deadlines

bottom of page