Practice and Application of Data Science
DSC 80, Winter 2023 at UC San Diego
This is the website of a previous offering of DSC 80. To see the most recent offering, go to dsc80.com, and to see other DSC course websites, go to dsc-courses.github.io.
Week 1 β From BabyPandas to Pandas
- Jan 9
LEC 1 Introduction
- Jan 11
LEC 2 DataFrame Fundamentals
- Jan 13
Week 2 β DataFrames
- Jan 16
N/A (MLK Day)
- Jan 18
LEC 4 Grouping
Lab 1 Python, NumPy, and Pandas (due 1/18 at 4PM, no slip days)
- Jan 19
- Jan 20
Week 3 β Messy Data
- Jan 23
LEC 6 Concatenating and Merging
π₯ β’ Ch. 5.3-5.4
Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β The audio in the podcast died; refer to the podcast from Spring 2022.
Lab 2 More Pandas (due 1/23)
- Jan 25
LEC 7 Relational Algebra, Data Cleaning
- Jan 26
PROJ 1 Gradebook π― (due 1/26)
- Jan 27
Week 4 β Statistical Testing and Missing Values
- Jan 30
LEC 9 Hypothesis Testing
Lab 3 Grouping, Pivoting, and Combining (due 1/30)
- Feb 1
LEC 10 Permutation Testing
- Feb 2
- Feb 3
LEC 11 Permutation Testing, Missingness Mechanisms
π₯ β’ Ch. 6.1-6.2
Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Extra notebook on fast permutation tests
Week 5 β Missingness Mechanisms and Imputation
- Feb 6
LEC 12 Identifying Missingness Mechanisms
π₯ β’ Ch. 6.3-6.5
Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β Extra notebook with more missingness examples
- Feb 8
LEC 13 Imputation
π₯ β’ Ch. 6.3-6.5
- Feb 9
- Feb 10
LEC 14 HTTP Basics
π₯ β’ Ch. 7.1-7.2
Week 6 β HTTP and HTML, Midterm Exam
- Feb 13
LEC 15 Requests and Parsing HTML
π₯ β’ Ch. 7.1-7.2
- Feb 15
Exam Midterm Exam (in-person during lecture)
- Feb 17
LEC 16 More Parsing Examples
Week 7 β Text Data
- Feb 20
N/A (Presidentsβ Day)
- Feb 22
LEC 17 Regular Expressions
π₯ β’ Ch. 8.1-8.2
Lab 6 HTTP and HTML (due 2/22 at 4PM, no slip days)
- Feb 23
PROJ 3 Exploratory Data Analysis π (due 2/23, no checkpoint)
- Feb 24
LEC 18 Regular Expressions, Bag of Words
π₯ β’ Ch. 8.1-8.2
Week 8 β Modeling and Feature Engineering
- Feb 27
LEC 19 Bag of Words, TF-IDF
- Mar 1
LEC 20 Modeling and Linear Regression
- Mar 2
- Mar 3
LEC 21 Feature Engineering
Week 9 β sklearn
Pipelines, Generalization, and Cross-Validation
- Mar 6
LEC 22 Pipelines, Generalization
- Mar 8
LEC 23 Cross-Validation
- Mar 9
- Mar 10
Week 10 β Classifier Evaluation, Fairness Criteria
There won't be live lecture on Wednesday (capstone showcase) or Friday (Suraj is at a conference); videos will be posted for those lectures in advance. Come to the capstone showcase on Wednesday!
- Mar 13
LEC 25 Grid Search, Multicollinearity, Examples
Lab 9 Pipelines (due 3/13)
- Mar 15
LEC 26 Classifier Evaluation (no live lecture, video only)
- Mar 17
LEC 27 Fairness Criteria, Conclusion (no live lecture, video only)
- Mar 18
SUR End-of-Quarter Survey + CAPEs (due 3/18 at 8AM)