Principles of Data Science
DSC 10, Fall 2023 at UC San Diego
Janine Tiefenbruckshe/her
Lecture(s): MWF 9-9:50AM (B), 10-10:50AM (A), Mandeville B-210
The Final Exam is on Saturday from 7-10PM. Read this Ed post for more details, and check your assigned room and seat here. In lecture on Wednesday, we will take up the solutions to the Spring 2023 Final Exam, so you should work on it before then.
If at least 85% of the class fills out both SETs and the End-of-Quarter Survey by Saturday at 8AM, then we will add 1% of extra credit to everyoneβs overall grade. We appreciate your feedback!
The solutions to the Spring 2023 Final Exam have been posted; video walkthroughs of some problems (taken from Wednesdayβs lectures) can be found at the top.
Week 0 β Welcome to DSC 10!
- Fri Sep 29
LEC 1 Introduction βοΈ
Keywords: data science, course structure, policies, syllabus, Little Women demo
SUR Welcome Survey
Week 1 β Python Basics
- Mon Oct 2
LEC 2 Expressions and Data Types βοΈ
Keywords: Jupyter notebooks, expressions, variables, assignment, functions, int, float
- Wed Oct 4
LEC 3 Strings, Lists, and Arrays βοΈ
Keywords: string methods, mean, median, lists, arrays, array arithmetic
- Fri Oct 6
LEC 4 Arrays and DataFrames βοΈ
Keywords: array methods, np.arange, .read_csv, .get, .assign, .sort_values, .iloc, .loc, index
- Sat Oct 7
Week 2 β DataFrames and Visualization
- Mon Oct 9
LEC 5 Querying and Grouping βοΈ
Keywords: .set_index, Booleans, querying, .shape, &, |, .take, .groupby, aggregation
- Wed Oct 11
LEC 6 Grouping and Data Visualization βοΈ
Keywords: .groupby, numerical vs. categorical, scatter plot, line plot, bar chart
DIS 2 Arrays and DataFrames
QUIZ 1 Solutions
- Thu Oct 12
Lab 1 Arrays and DataFrames
- Fri Oct 13
LEC 7 Distributions and Histograms βοΈ
Keywords: distributions, density histograms, binning, total area, overlaid plots
- Sat Oct 14
Week 3 β Functions and Control Flow
- Mon Oct 16
LEC 8 Functions and Applying βοΈ
Keywords: functions, arguments, print vs. return, .apply, .reset_index
- Wed Oct 18
LEC 9 Grouping on Multiple Columns, Merging βοΈ
Keywords: .groupby([col_1, col_2, β¦]), subgroups, MultiIndex, .merge, number of rows
- Thu Oct 19
- Fri Oct 20
LEC 10 Conditional Statements and Iteration βοΈ
Keywords: in, not, and, or, if, else, elif, for-loops, np.append, accumulator pattern
- Sat Oct 21
Week 4 β Probability and Simulation
- Mon Oct 23
LEC 11 Probability (annotated: 8AM β’ 1PM)
Keywords: event, conditional prob., multiplication and addition rules, independence
- Wed Oct 25
LEC 12 Simulation βοΈ
Keywords: np.random.choice, replacement, np.count_nonzero, coin flipping, Monty Hall
DIS 4 DataFrames, Control Flow, and Probability
QUIZ 2 Solutions
- Thu Oct 26
- Fri Oct 27
LEC 13 Midterm Review (annotated: 8AM β’ 9AM β’ 10AM β’ 1PM)
- Sat Oct 28
Week 5 β Midterm Exam
- Mon Oct 30
EXAM Midterm Exam (in registered lecture section)
- Wed Nov 1
LEC 14 Distributions and Sampling βοΈ
Keywords: probability vs. empirical distribution, SRS, .sample, parameter, statistic
DIS 5 Midterm Exam Walkthrough
- Fri Nov 3
LEC 15 Bootstrapping and Confidence Intervals βοΈ
Keywords: inference, bootstrapping, resample, np.percentile, confidence interval
Week 6 β Confidence Intervals and the Normal Distribution
- Mon Nov 6
LEC 16 Confidence Intervals, Center, and Spread βοΈ
Keywords: interpreting CIs, robust vs. sensitive, center, standard deviation, Chebyshev
PROJ Midterm Project: Taylor Swift (see partner guidelines)
- Wed Nov 8
LEC 17 Standardization and the Normal Distribution βοΈ
Keywords: Chebyshev, standard units, normal distribution, CDF, inflection points
- Thu Nov 9
- Fri Nov 10
No Lecture (Veterans Day ποΈ)
- Sat Nov 11
HW 4 Simulation, Sampling, Bootstrapping
Week 7 β Central Limit Theorem
- Mon Nov 13
LEC 18 The Central Limit Theorem βοΈ
Keywords: interpreting CIs, robust vs. sensitive, center, standard deviation, Chebyshev
- Wed Nov 15
LEC 19 Choosing Sample Sizes, Statistical Models βοΈ
Keywords: standard deviation of 0s and 1s, np.random.multinomial, Robert Swain jury panel
DIS 7 Standardization and the Normal Distribution
QUIZ 3 In Discussion, Covers Lectures 14-17
- Fri Nov 17
LEC 20 Hypothesis Testing βοΈ
Keywords: null and alternative hypotheses, test statistic, fair or unfair coin
- Sat Nov 18
Week 8 β Hypothesis and Permutation Testing
- Mon Nov 20
LEC 21 Hypothesis Testing and Total Variation Distance βοΈ
- Tue Nov 21
- Wed Nov 22
LEC 22 TVD, Hypothesis Testing, and Permutation Testing βοΈ
- Fri Nov 24
No Lecture (Thanksgiving π¦)
Week 9 β Prediction
- Mon Nov 27
LEC 23 Permutation Testing βοΈ
Lab 6 Hypothesis Testing
- Wed Nov 29
LEC 24 Correlation βοΈ
DIS 9 Total Variation Distance and Permutation Testing
QUIZ 4 In Discussion, Covers Lectures 18-22
- Fri Dec 1
LEC 25 Regression and Least Squares βοΈ
Week 10 β Review
- Mon Dec 4
LEC 26 Residuals and Inference βοΈ
- Tue Dec 5
PROJ Final Project: Meteorites (see partner guidelines)
- Wed Dec 6
LEC 27 Review of the Spring 2023 Final Exam (annotated: 10AM + 1PM) π₯
DIS 10 Regression
- Thu Dec 7
Lab 7 Regression
- Fri Dec 8
LEC 28 Review, Conclusion βοΈ (review problems: blank)
- Sat Dec 9
EXAM Final Exam (7-10PM, see location here and details here)
SUR SETs and End-of-Quarter Survey (due 8AM)