Data 100: Principles and Techniques of Data Science

UC Berkeley, Summer 2024

Ed Datahub Gradescope Lectures Playlist Emergency Accommodations Office Hours Queue

Kevin Miao

Kevin Miao

He/Him/His

kevinmiao@berkeley.edu

Office Hours: Mondays & Fridays from 11:00AM to 12:00PM in Evans 455

Maya Shen

Maya Shen

She/Her/Hers

mayashen@berkeley.edu

Office Hours: Tuesdays & Thursdays 11:00AM to 12:00PM in Evans 455

Schedule

Week 1

June 17
Lecture 1 Course Overview
Note 1
Lab 1 Prerequisite Coding, Plotting, and Permutation (due 6/20)
June 18
Lecture 2 Pandas I
Note 2
Homework 1A Plotting and Permutation Tests (due 6/20)
Homework 1B Prerequisite Math (due 6/20)
Discussion 1 Prerequisites (virtual walkthrough only)
Solutions
June 19
Juneteenth
June 20
Lecture 3 Pandas II
Note 3
Lab 2 Pandas (due 6/23)
June 21
Lecture 4 Pandas III
Note 4
Homework 2 Food Safety I (due 6/24)

Week 2

Week 3

July 1
Lecture 9 Sampling
Note 9
Lab 5 Transformations (due 7/3)
Discussion 4 Visualization and Transformation
Solutions
July 2
Lecture 10 Modeling and SLR
Note 10
Homework 5 Bike Sharing (due 7/4)
July 3
No Discussion
July 4
No Lecture
Homework 6A Sampling (due 7/8)
Homework 6B Modeling (due 7/8)
July 5
No Lecture

Week 4

Week 5

Week 6

July 22
Lecture 18 Estimators, Bias, and Variance
Note 18
Lab 10 Probability (due 7/24)
Discussion 9 Cross-Validation and Regularization
Solutions
July 23
Lecture 19 Parameter Inference and Bootstrapping
Note 19
Homework 8 Probability and Estimators (due 7/25)
July 24
Discussion 10 Random Variables, Bias, and Variance
Solutions
July 25
Lecture 20 SQL
Note 20, Note 21
Lab 11 SQL (due 7/28)
July 26
Lecture 21 Logistic Regression I
Note 22
Homework 9 SQL (due 7/29)

Week 7

July 29
Lecture 22 Logistic Regression II
Note 23
Lab 12 Logistic Regression (due 7/31)
Project B1 Spam and Ham I (due 8/1)
Discussion 11 SQL (Worksheet Notebook)
Solutions
July 30
Lecture 23 Decision Trees
Note 27
July 31
Discussion 12 Logistic Regression
Solutions
August 1
Lecture 24 PCA
Note 24, Note 25
Lab 13 PCA (due 8/4)
Project B2 Spam and Ham II (due 8/5)
August 2
Lecture 25 Clustering
Note 26
Homework 10 PCA and Decision Trees (Extra Credit) (due 8/5)

Week 8

August 5
Lecture 26 Intro to Diffusion Models & Conclusion
Lab 14 Clustering (due 8/7)
Discussion 13 PCA and Clustering
Solutions
August 6
Lecture 27 Industry/Academia Roundtable Discussion: Pioneering Pathways: Data Science, AI and Career
August 7
Discussion 14 Final Review
August 8
Final Exam Final