Skip to main content Link Search Menu Expand Document (external link)

Practice and Application of Data Science

DSC 80, Winter 2024 at UC San Diego

Suraj Rampure
he/him

rampure@ucsd.edu

Lecture(s): TuTh 3:30-4:50PM, Pepper Canyon Hall 109

Jump to the current week

Click the πŸŽ₯ button to view the recording of a lecture/discussion.
Click the πŸ“ button to view lecture notebooks after they’ve been filled in during lecture.

This is the website of a prior offering of DSC 80. The recordings for all lectures are publicly available and can be found below. To see the latest version of DSC 80, go to dsc80.com.

Week 1 - From BabyPandas to Pandas
🚨 Thursday's lecture was on Zoom, so the recording is only available below and not at podcast.ucsd.edu. (If you're curious, it's because Suraj was at a conference.)

Tue Jan 9

LEC 1 Introduction, Data Science Lifecycle

πŸŽ₯ | Ch. 1

Wed Jan 10

DISC 1 Environment Setup

πŸŽ₯

Thu Jan 11

PRE 2 Pre-Lecture Reading

LEC 2 DataFrame Fundamentals

πŸŽ₯ | Ch. 6, 6.1

Week 3 – Messy Data, Statistical Testing

Mon Jan 22

LAB 2 DataFrames and Grouping

Tue Jan 23

LEC 5 Exploratory Data Analysis and Data Cleaning πŸ“

πŸŽ₯ | Ch. 9 and 10

Wed Jan 24

DISC 3 Lab 2 Reflection

πŸŽ₯ | Notebook

Thu Jan 25

PRE 6 Pre-Lecture Reading

LEC 6 Hypothesis Testing; Aside: Fast Permutation Tests

πŸŽ₯ | Ch. 2

Sat Jan 27

PROJ 1 Gradebook πŸ’―

Week 4 – Missing Values
🚨 Thursday's lecture recording doesn't have audio after ~20 minutes. Instead, you can watch Winter 2023's podcasts. Watch from the 35 minute mark onwards from this video, all of this video, and then the first 11 minutes in this video.

Mon Jan 29

LAB 3 Merging and Pivoting

Tue Jan 30

LEC 7 Missingness Mechanisms πŸ“

πŸŽ₯ | A1, A2

Wed Jan 31

DISC 4 Lab 3 Reflection

πŸŽ₯ | Notebook

Thu Feb 1

LEC 8 Imputation

πŸŽ₯ | DSP 6.3-6.5

Week 5 – HTTP, Midterm Exam
🚨 Let Suraj know which topics/old exam questions you want him to take up in Tuesday's lecture at q.dsc80.com.

Mon Feb 5

LAB 4 Hypothesis and Permutation Testing

Tue Feb 6

LEC 9 HTTP Basics, Midterm Review πŸ“

πŸŽ₯ | Ch. 14.2-14.4

PROJ 2 Loan Applications πŸ’Έ (Checkpoint)

Wed Feb 7

DISC 5 Lab 4 Reflection

πŸŽ₯ | Notebook

Thu Feb 8

EXAM Midterm Exam (in person, during lecture)

Week 6 – Web Data, Text Data

Mon Feb 12

LAB 5 Missing Values and Imputation

Tue Feb 13

LEC 10 Web Scraping πŸ“

πŸŽ₯ | Ch. 14.2-14.4

PROJ 2 Loan Applications πŸ’Έ

Wed Feb 14

DISC 6 Lab 5 Reflection

πŸŽ₯ | Notebook

Thu Feb 15

LEC 11 Regular Expressions πŸ“

πŸŽ₯ | Ch. 13

Sat Feb 17

SUR Mid-Quarter Survey

Week 7 – Text Data, Linear Regression

Tue Feb 20

LEC 12 Text Features

πŸŽ₯ | Ch. 13.4

Wed Feb 21

LAB 6 HTTP and HTML (due 5PM, no slip days)

DISC 7 Lab 6 Reflection

πŸŽ₯

Thu Feb 22

LEC 13 Linear Regression πŸ“

πŸŽ₯ | Ch. 15.0-15.6

PROJ 3 Language Models πŸ—£οΈ (Checkpoint)

Week 8 – Feature Engineering and Generalization

Mon Feb 26

LAB 7 Regular Expressions and Text Data

Tue Feb 27

LEC 14 Feature Engineering

πŸŽ₯ | Ch. 15.7-15.9

Wed Feb 28

DISC 8 Lab 7 Reflection

πŸŽ₯

Thu Feb 29

LEC 15 Standardization, Multicollinearity, and Generalization

πŸŽ₯ | Ch. 16, 17.6

PROJ 3 Language Models πŸ—£οΈ

Week 10 – Classifier Evaluation, Conclusion

Mon Mar 11

LAB 9 Pipelines

Tue Mar 12

LEC 18 Classifier Evaluation and Model Fairness

πŸŽ₯ | Ch. 19.5

Wed Mar 13

DISC 10 Lab 9 Reflection

πŸŽ₯

Thu Mar 14

LEC 19 Review, Conclusion (blank, filled)

πŸŽ₯

Sat Mar 16

SUR SETs and End-of-Quarter Survey (due 8AM)

Week 11 – Final Exam and Project 5

Tue Mar 19

EXAM Final Exam (3-6PM, Pepper Canyon Hall 109)

Thu Mar 21

PROJ 4 Data Science Lifecycle (no slip days!)