Software Engineering for Data Scientists
Asynchronous
-
Checking your registration status...
You are registered for this program.
Registration Deadlines
-
-
-
Category
Launch, Supplemental, Self-Directed, Mini-Course
Overview
The Software Engineering for Data Scientists course is meant to help data scientists write production ready code as well as gain familiarity with the tools used to make models available to their users. The core idea we will be exploring is making code robust and re-usable across a team. This course can also serve as an introduction toward ideas used in ML Ops and Data Engineering.
#slack-channel
Organizers, Instructors, and Advisors
Steven Gubkin
Head of Training and Assessment
Office Hours:
By appointment only
Email:
Preferred Contact:
Slack
Your primary contact for GitHub access.
Kevin Nowland
Lead instructor, ML Ops Engineer
Office Hours:
Intermittent Thursday Afternoons
Email:
Preferred Contact:
Slack
Please reach out on slack if you have any questions about the content in this course!
Objectives
After completing this course, you will be able to the following:
- Understand common tools used to deploy models for real-time inference
- Improve your code's robustness through unit testing
- Improve your code's readability through using linters and type checking
- Use basic command line commands
- Be able to implement a simple continuous integration pipeline using GitHub Actions
First Steps/Prerequisites
- Figure out how to access a terminal emulator, e.g., the Terminal program on Mac OS / Ubuntu
- If using Windows, enable the Windows Subsystem for Linux and access a terminal emulator
- Download pyenv and use it to install python 3.10.x
Program Content
I'm a paragraph. Click here to add your own text and edit me. It's easy.
Textbook/Notes
Project/Homework Instructions
I'm a paragraph. Click here to add your own text and edit me. It's easy.
Schedule
Click on any date for more details
Please check your registration email for program schedule and zoom links.
Project/Homework Deadlines