Software Engineering for Data Scientists

Asynchronous

You are registered for this program.

Registration Deadlines

Category

Advance, Supplemental, Self-Directed, Mini-Course

Overview

The Software Engineering for Data Scientists course is meant to help data scientists write production ready code as well as gain familiarity with the tools used to make models available to their users. The core idea we will be exploring is making code robust and re-usable across a team. This course can also serve as an introduction toward ideas used in ML Ops and Data Engineering.

Syllabus

Organizers and Instructors

Kevin Nowland

Lead instructor, ML Ops Engineer

Office Hours:

Intermittent Thursday Afternoons

Email:

kevin@erdosinstitute.org

Preferred Contact:

Slack

Please reach out on slack if you have any questions about the content in this course!

Objectives

After completing this course, you will be able to the following:
- Understand common tools used to deploy models for real-time inference
- Improve your code's robustness through unit testing
- Improve your code's readability through using linters and type checking
- Use basic command line commands
- Be able to implement a simple continuous integration pipeline using GitHub Actions

Slack Channel: #slack-channel

Project Examples

View the full project database >>>

First Steps/Prerequisites

Figure out how to access a terminal emulator, e.g., the Terminal program on Mac OS / Ubuntu
If using Windows, enable the Windows Subsystem for Linux and access a terminal emulator
Download pyenv and use it to install python 3.10.x

First Steps

Program Content

https://github.com/TheErdosInstitute/swe-for-ds

Program Content

Textbook/Notes

Intro to the CLI - part 1

Getting ready to code

We’ll be talking about the different shells that allow you to interact with your computer, navigating the filesystem, and basic ways to manipulate files.

Slides

Transcript

Code

Project/Homework Instructions

Project/Team Formation

Project Submission

Projects README

Schedule

Click on any date for more details

Please check your registration email for program schedule and zoom links.

Project/Homework Deadlines

To access the program content, you must first create an account and member profile and be logged in.

THE ERDŐS INSTITUTE

Helping PhDs get and create jobs they love at every stage of their career.

Software Engineering for Data Scientists

Textbook/Notes

Intro to the CLI - part 1

Getting ready to code