Principles of Data Science
DSC 10, Winter 2024 at UC San Diego
Janine Tiefenbruckshe/her
Lecture(s): MWF 9-9:50AM (A), 10-10:50AM (B), 11-11:50AM (C) Mandeville B-202
The Final Exam is this Saturday 3/16 from 7-10PM in Catalyst 0125 near Plant Power. Join us for a collaborative study session on Friday 3/15 from 5-8PM in Solis 104.
If at least 75% of the class fills out both SETs and the internal End-of-Quarter Survey, then the entire class will have 1% of extra credit added to their overall grade. The deadline is Saturday 3/16 at 8AM.
Week 1 β Python Basics
- Mon Jan 8
LEC 1 Introduction βοΈ
Keywords: data science, course structure, policies, syllabus, Little Women demo
DISC Getting Started with Jupyter Notebooks
SUR Welcome Survey
- Wed Jan 10
LEC 2 Expressions and Data Types βοΈ
Keywords: Jupyter notebooks, expressions, variables, assignment, functions, int, float
- Fri Jan 12
LEC 3 Strings, Lists, and Arrays βοΈ
Keywords: string methods, mean, median, lists, arrays, array arithmetic
- Sat Jan 13
Week 2 β DataFrames
- Mon Jan 15
No Lecture (MLK Day)
- Wed Jan 17
LEC 4 Arrays and DataFrames βοΈ
Keywords: array methods, np.arange, .read_csv, .get, .assign, .sort_values, .iloc, .loc, index
- Fri Jan 19
LEC 5 Querying and Grouping βοΈ
Keywords: .set_index, Booleans, querying, .shape, &, |, .take, .groupby, aggregation
- Sat Jan 20
LAB 1 Arrays and DataFrames
Week 3 β Data Visualization and Functions
- Mon Jan 22
LEC 6 Grouping and Data Visualization βοΈ
Keywords: .groupby, numerical vs. categorical, scatter plot, line plot, bar chart
QUIZ 1 Quiz 1 covers Lectures 1-4
- Wed Jan 24
LEC 7 Distributions and Histograms βοΈ
Keywords: distributions, density histograms, binning, total area, overlaid plots
- Thu Jan 25
- Fri Jan 26
LEC 8 Functions and Applying βοΈ
Keywords: functions, arguments, print vs. return, .apply, .reset_index
Week 4 β Control Flow and Probability
- Mon Jan 29
LEC 9 Grouping on Multiple Columns, Merging βοΈ
Keywords: .groupby([col_1, col_2, β¦]), subgroups, MultiIndex, .merge, number of rows
QUIZ 2 Quiz 2 covers Lectures 5-7
- Tue Jan 30
- Wed Jan 31
LEC 10 Conditional Statements and Iteration βοΈ
Keywords: in, not, and, or, if, else, elif, for-loops, np.append, accumulator pattern
- Thu Feb 1
- Fri Feb 2
LEC 11 Probability (blank, 9AM, 10AM, 11AM)
Keywords: event, conditional prob., multiplication and addition rules, independence
- Sat Feb 3
Week 5 β Simulation, Sampling, and Confidence Intervals
- Mon Feb 5
LEC 12 Simulation βοΈ
Keywords: np.random.choice, replacement, np.count_nonzero, coin flipping, Monty Hall
QUIZ 3 Quiz 3 covers Lectures 8-11
- Wed Feb 7
LEC 13 Distributions and Sampling βοΈ
Keywords: probability vs. empirical distribution, SRS, .sample, parameter, statistic
- Thu Feb 8
- Fri Feb 9
LEC 14 Bootstrapping and Confidence Intervals βοΈ
Keywords: inference, bootstrapping, resample, np.percentile, confidence interval
Week 6 β Midterm Exam and the Normal Distribution
- Mon Feb 12
EXAM Midterm Exam
DISC Exam Solutions Review
- Wed Feb 14
LEC 15 Confidence Intervals, Center, and Spread βοΈ
Keywords: interpreting CIs, robust vs. sensitive, center, standard deviation, Chebyshev
- Thu Feb 15
PROJ Midterm Project
- Fri Feb 16
LEC 16 Standardization and the Normal Distribution βοΈ
Keywords: Chebyshev, standard units, normal distribution, CDF, inflection points
- Sat Feb 17
Week 7 β Central Limit Theorem
- Mon Feb 19
No Lecture (Presidentsβ Day)
- Wed Feb 21
LEC 17 The Central Limit Theorem βοΈ
Keywords: distribution of the sample mean, square root law, CLT-based CIs
- Thu Feb 22
- Fri Feb 23
LEC 18 Choosing Sample Sizes, Statistical Models βοΈ
Keywords: standard deviation of 0s and 1s, np.random.multinomial, Robert Swain jury
Week 8 β Hypothesis and Permutation Testing
- Mon Feb 26
LEC 19 Hypothesis Testing βοΈ
Keywords: null and alternative hypotheses, test statistic, fair or unfair coin
QUIZ 4 Quiz 4 covers Lectures 13-17
- Tue Feb 27
- Wed Feb 28
LEC 20 Hypothesis Testing and Total Variation Distance βοΈ
Keywords: fair or unfair coin, p-value, midterm exam scores, Alameda County jury, TVD
- Thu Feb 29
- Fri Mar 1
LEC 21 TVD, Hypothesis Testing, and Permutation Testing βοΈ
Keywords: confidence intervals for hypothesis testing, body temperature, smoking/babies
- Sat Mar 2
LAB 6 Hypothesis Testing
Week 9 β Prediction
- Mon Mar 4
LEC 22 Permutation Testing βοΈ
Keywords: smoking and birth weight, np.random.permutation, shuffling, Deflategate
QUIZ 5 Quiz 5 covers Lectures 18-21 (excluding Permutation Testing)
- Wed Mar 6
LEC 23 Correlation βοΈ
Keywords: association, correlation coefficient (r), predicting heights, regression line (su)
- Thu Mar 7
- Fri Mar 8
LEC 24 Regression and Least Squares βοΈ Recording π₯
Keywords: regression line in original units, outliers, errors, RMSE, best fit, least squares
Week 10 β Review
- Mon Mar 11
LEC 25 Residuals and Inference βοΈ
Keywords: residuals, residual plots, patterns, datasaurus dozen, prediction intervals
QUIZ 6 Quiz 6 covers Lectures 21-24
- Tue Mar 12
PROJ Final Project
- Wed Mar 13
- Thu Mar 14
LAB 7 Regression
- Fri Mar 15
LEC 27 Review, Conclusion βοΈ - Blank - Annotated 9AM, 10AM, 11AM
STUDY Collaborative Study Session (5-8PM in Solis 104)
- Sat Mar 16
EXAM Final Exam (7-10PM)
SUR SETs and End-of-Quarter Survey (due 8AM)