Skip to main content Link Search Menu Expand Document (external link)

Practice and Application of Data Science

DSC 80, Fall 2023 at UC San Diego

Sam Lau

Sam Lau

he/him

lau@ucsd.edu

Lecture: TuTh 3:30-4:50PM, WLH 2005

Dec 6, 2023: The Final Exam will take place on Mon., Dec 11, from 3-6pm in WLH 2005 (our usual lecture room). If 85% of the class fills out both the Student Evaluations of Teaching and the End-of-Quarter Survey before 11:59pm Dec 8, the entire class will get +1% on their Final Exam grade.

Week 0 – No class

Thu Sep 28

NO LECTURE: Sam out of town

Week 1 – From BabyPandas to Pandas

Tue Oct 3

LEC 1 Introduction, Data Science Lifecycle

πŸŽ₯ | Ch. 1

Thu Oct 5

LEC 2 DataFrame Fundamentals

πŸŽ₯ | Ch. 6, 6.1

Fri Oct 6

DISC 1 Environment Setup

πŸŽ₯

Week 2 – Dataframes

Mon Oct 9

LAB 1 Python, NumPy, and Pandas

Tue Oct 10

LEC 3 Aggregating

πŸŽ₯ | Ch. 6.2

Wed Oct 11

PROJ 1 Project 1 checkpoint

Thu Oct 12

LEC 4 Simpson's Paradox, Joining, Transforming

πŸŽ₯ | Ch. 6.3 - 6.5

Fri Oct 13

DISC 2 Lab 1 Reflection

πŸŽ₯

Week 3 – Messy Data, Statistical Testing

Mon Oct 16

LAB 2 More Pandas

Tue Oct 17

LEC 5 Exploring and Cleaning Data

πŸŽ₯ | Ch. 9 and 10

Wed Oct 18

PROJ 1 Project 1

Thu Oct 19

LEC 6 Hypothesis Testing

πŸŽ₯ | Ch 2, Ch 17.0-17.2

Fri Oct 20

DISC 3 Lab 2 Reflection

πŸŽ₯

Week 4 – Missing Values

Mon Oct 23

LAB 3 DataFrame Manipulation

Tue Oct 24

LEC 7 Missingness Mechanisms

πŸŽ₯ | Fast Permutation Tests, A1, A2

Wed Oct 25

PROJ 2 Project 2 checkpoint

Thu Oct 26

LEC 8 Imputation

πŸŽ₯ | DSP 6.3-6.5

Fri Oct 27

DISC 4 Lab 3 Reflection

πŸŽ₯

Week 5 – HTTP

Mon Oct 30

LAB 4 Hypothesis and Permutation Testing

Tue Oct 31

LEC 9 HTTP Basics, Midterm Review

πŸŽ₯ | Ch 14.2-14.4

Wed Nov 1

PROJ 2 Project 2

Thu Nov 2

EXAM Midterm Exam (in class)

Solutions

Fri Nov 3

DISC 5 Lab 4 Reflection

Week 6 – Web data

Mon Nov 6

LAB 5 Missing Values and Imputation

Tue Nov 7

LEC 10 Web Scraping

πŸŽ₯ | Ch 14.2-14.4

Thu Nov 9

LEC 11 Regular Expressions

πŸŽ₯ | Ch 13

Fri Nov 10

NO DISCUSSION: Veteran’s Day

Week 7 – Text data, Modeling

Mon Nov 13

LAB 6 HTTP and HTML

Tue Nov 14

LEC 12 Text Features

πŸŽ₯ | Ch 13.4

Thu Nov 16

LEC 13 Modeling and Regression

πŸŽ₯ | Ch 15.0-15.6

Fri Nov 17

DISC 6 Lab 6 Reflection

PROJ 3 Project 3 (no checkpoint)

Week 8 – Feature Engineering

Mon Nov 20

LAB 7 Regular Expressions and Text Data

Tue Nov 21

LEC 14 Feature Engineering

πŸŽ₯ | Ch 15.7-15.9

Wed Nov 22

PROJ 4 NO Project 4 checkpoint, HAPPY THANKSGIVING!

Thu Nov 23

NO LECTURE: Thanksgiving break

Fri Nov 24

NO DISCUSSION: Thanksgiving break

Week 9 – Modeling in Practice

Mon Nov 27

LAB 8 Modeling and Feature Engineering

Tue Nov 28

LEC 15 Generalization, Cross-Validation

πŸŽ₯ | Ch 16

Thu Nov 30

LEC 16 More Generalization, Decision Trees

πŸŽ₯ | Ch 16

Fri Dec 1

DISC 7 Lab 8 Reflection

PROJ 4 Project 4

Week 10 – Evaluating Classifiers

Mon Dec 4

LAB 9 Pipelines

Tue Dec 5

LEC 17 Random Forests, Classifier Evaluation

Ch 19.5

Thu Dec 7

LEC 18 Classifier Evaluation, Conclusion, Final Review

Ch 19.5

Fri Dec 8

DISC 8 Lab 9 Reflection

Week 11 – Final Exam and Project 5

Mon Dec 11

EXAM Final Exam (3-6pm, WLH 2005)

Solutions

Wed Dec 13

PROJ 5 Project 5