Skip to main content Link Search Menu Expand Document (external link)

Practice and Application of Data Science

DSC 80, Summer Session 2 2024 at UC San Diego

Brendan Tomoschuk
he/him

brtomoschuk@ucsd.edu

Lecture(s): TuTh 5:00-7:50PM (A00), 11:00AM-12:20PM (B00) in Pepper Canyon Hall 120

Podcasts Welcome Survey Extension Request Form

Week 1 - From BabyPandas to Pandas, Dataframes

Tue Aug 6

LEC 1

Introduction, Data Science Lifecycle

Ch. 1

LEC 2

DataFrame Fundamentals

Ch. 6, 6.1

Wed Aug 7

DISC 1

Environment Setup, Exam Prep

Thu Aug 8

LEC 3

Ch. 6.2

LEC 4

Simpson's Paradox, Joining, Transforming

Ch. 6.3-6.5

Fri Aug 9

LAB 1

Week 2 – Messy Data, Statistical Testing, Missing Values

Mon Aug 12

LAB 2

Tue Aug 13

LEC 5

Exploring and Cleaning Data

Ch. 9 and 10

LEC 6

Hypothesis and Permutation Testing

DSC 10 Review Notebook, Ch. 17

Wed Aug 14

PROJ 1

DISC 2

Thu Aug 15

LEC 7

Missingness Mechanisms

Fast Permutation Tests, A1, A2

LEC 8

DSP 6.3-6.5

Fri Aug 16

LAB 3

Week 3 – HTTP, Web data

Mon Aug 19

LAB 4

Ch. 17

Tue Aug 20

EXAM

Midterm Exam (in class)

LEC 9

HTTP Basics

Ch. 14.2-14.4

Wed Aug 21

PROJ 2

DISC 3

Thu Aug 22

LEC 10

Web Scraping

Ch. 14.2-14.4

LEC 11

Regular Expressions

Ch. 13

Fri Aug 23

LAB 5

Week 4 – Text data, Modeling, Feature Engineering

Mon Aug 26

LAB 6

Tue Aug 27

LEC 12

Text Features

Ch. 13.4

LEC 13

Linear Regression

Ch. 15.0-15.6

Wed Aug 28

PROJ 3

DISC 4

Thu Aug 29

LEC 14

Feature Engineering

Ch. 15.7-15.9

LEC 15

Pipelines, Multicollinearity, and Generalization

Ch. 16, 17.6

Fri Aug 30

LAB 7

Week 5 – Modeling in Practice, Evaluating classifiers

Mon Sep 2

LAB 8

Tue Sep 3

LEC 16

Hyperparameters, Cross-Validation, and Decision Trees

Ch. 16

LEC 17

Grid Search, Random Forests, Classifier Evaluation

Wed Sep 4

DISC 5

Slides

Thu Sep 5

LEC 18

Classifier Evaluation and Model Fairness

Ch. 19.5

LEC 19

Career Advice, Review, Conclusion

Fri Sep 6

LAB 9

Sat Sep 7

EXAM

Final Exam

FINAL PROJ