Data 100: Principles and Techniques of Data Science

UC Berkeley, Spring 2025

Ed Datahub Gradescope Lectures Playlist Additional Accommodations Office Hours Queue

Welcome to Week 5 of Data 100!

Lectures will be webcast at: https://berkeley.zoom.us/j/97347722542.

Schedule

Week 1

Tue Jan 21
Lecture 1 Introduction
Note 1
Lecture Participation 1 Lecture Participation 1
Wed Jan 22
Lab 1 Prerequisite Coding (due Tue Jan 28)
Homework 1 Prerequisite Math, Coding, and Syllabus Quiz (due Thu Jan 30)
Thu Jan 23
Lecture 2 Pandas I
Note 2
Lecture Participation 2 Lecture Participation 2

Week 2

Tue Jan 28
Lecture 3 Pandas II
Note 3
Lecture Participation 3 Lecture Participation 3
Discussion 1 Prerequisites
Mini-Lecture, Solutions, Walkthrough
Thu Jan 30
Lecture 4 Pandas III
Note 4
Lecture Participation 4 Lecture Participation 4
Lab 2A Pandas (due Tue Feb 4)
Exam Prep 1 Pandas
Solutions, Walkthrough
Fri Jan 31
Homework 2A Food Safety (due Thu Feb 6)
Drop Deadline (w/o fee) 11:59 PM

Week 3

Tue Feb 4
Lecture 5 Data Cleaning and EDA
Note 5
Lecture Participation 5 Lecture Participation 5
Discussion 2 Pandas I
Mini-Lecture, Notebook, Groupwork Notebook, Regular Walkthrough, Groupwork Walkthrough, Solutions, Notebook Solutions, Groupwork Notebook Solutions
Thu Feb 6
Lecture 6 Regex
Note 6
Lecture Participation 6 Lecture Participation 6
Lab 2B Data Cleaning and EDA (due Tue Feb 11)
Exam Prep 2 Pandas and EDA
Solutions, Walkthrough
Fri Feb 7
Homework 2B Food Safety II (due Thu Feb 13)
Add Deadline (w/o fee) 11:59 PM

Week 4

Tue Feb 11
Lecture 7 Visualization I
Note 7
Lecture Participation 7 Lecture Participation 7
Discussion 3 Pandas II, EDA
Mini-Lecture, Solutions, Walkthrough
Wed Feb 12
Drop Deadline (w/ fee) 11:59 PM
Thu Feb 13
Lecture 8 Visualization II
Note 8
Lecture Participation 8 Lecture Participation 8
Lab 3 Regex, EDA (due Tue Feb 18)
Exam Prep 3 RegEx
Solutions, Walkthrough
Fri Feb 14
Homework 3 Text Analysis of New York Times Articles (due Thu Feb 20)

Week 5

Tue Feb 18
Lecture 9 Sampling
Note 9
Lecture Participation 9 Lecture Participation 9
Discussion 4 Regex, Visualization, and Transformation
Mini-Lecture, Notebook, Solutions, Notebook Solutions, Walkthrough
Thu Feb 20
Lecture 10 Modeling, SLR
Note 10
Lecture Participation 10 Lecture Participation 10
Lab 4 Transformations (due Tue Feb 25)
Exam Prep 4 Data Visualization
Solutions, Walkthrough
Fri Feb 21
Homework 4 Bike Sharing (due Thu Feb 27)
Add/Drop Deadline for Graduate Students 11:59 PM

Week 6

Tue Feb 25
Lecture 11 Constant Model, Loss, and Transformations
Lecture Participation 11 Lecture Participation 11
Discussion 5 Transformations, Sampling, and SLR
Thu Feb 27
Lecture 12 OLS (Multiple Regression)
Lecture Participation 12 Lecture Participation 12
Lab 5 Modeling, Summary Statistics, Loss Functions, and OLS (due Tue Mar 4)
Fri Feb 28
Homework 5 Modeling and OLS (due Thu Mar 6)

Week 7

Tue Mar 4
Lecture 13 Gradient Descent / Sklearn
Lecture Participation 13 Lecture Participation 13
Discussion 6 Models, OLS
Thu Mar 6
Lecture 14 Feature Engineering
Lecture Participation 14 Lecture Participation 14
NO Lab
Fri Mar 7
NO Homework
Midterm Review

Week 8

Tue Mar 11
No Lecture
Discussion 7 Gradient Descent
Wed Mar 12
Midterm Exam Midterm (8-10 PM)
Thu Mar 13
Lecture 15 Case Study (HCE): CCAO
Lecture Participation 15 Lecture Participation 15
Lab 6 Gradient Descent / Sklearn (due Tue Mar 18)
Fri Mar 14
Project A1 Housing I (due Thu Mar 20)

Week 9

Tue Mar 18
Lecture 16 Cross-Validation and Regularization
Lecture Participation 16 Lecture Participation 16
Discussion 8 Feature Engineering, Housing
Thu Mar 20
Lecture 17 Random Variables
Lecture Participation 17 Lecture Participation 17
Lab 7 Model Selection (due Tue Apr 1)
Fri Mar 21
Project A2 Housing II (due Thu Apr 3)

Week 10

Mon Mar 24
Spring Break
Tue Mar 25
Spring Break
Wed Mar 26
Spring Break
Thu Mar 27
Spring Break
Fri Mar 28
Spring Break

Week 11

Tue Apr 1
Lecture 18 Estimators, Bias, and Variance
Lecture Participation 18 Lecture Participation 18
Discussion 9 Cross-Validation and Regularization
Thu Apr 3
Lecture 19 Parameter Inference & the Bootstrap
Lecture Participation 19 Lecture Participation 19
Lab 8 Probability (due Tue Apr 8)
Fri Apr 4
Homework 6 Probability (due Thu Apr 10)
Grade Option Change 11:59 PM

Week 12

Tue Apr 8
Lecture 20 SQL I
Lecture Participation 20 Lecture Participation 20
Discussion 10 RVs, Bias, and Variance
Thu Apr 10
Lecture 21 SQL II
Lecture Participation 21 Lecture Participation 21
Lab 9 SQL (due Tue Apr 15)
Fri Apr 11
Homework 7 SQL (due Thu Apr 17)

Week 13

Tue Apr 15
Lecture 22 Logistic Regression I
Lecture Participation 22 Lecture Participation 22
Discussion 11 SQL
Thu Apr 17
Lecture 23 Logistic Regression II
Lecture Participation 23 Lecture Participation 23
Lab 10 Logistic Regression (due Tue Apr 22)
Fri Apr 18
Project B1 Spam & Ham I (due Thu Apr 24)

Week 14

Tue Apr 22
Lecture 24 PCA I
Lecture Participation 24 Lecture Participation 24
Discussion 12 Logistic Regression
Thu Apr 24
Lecture 25 PCA II
Lecture Participation 25 Lecture Participation 25
Lab 11 PCA (due Tue Apr 29)
Fri Apr 25
Project B2 Spam & Ham II (due Thu May 1)

Week 15

Tue Apr 29
Lecture 26 Clustering
Lecture Participation 26 Lecture Participation 26
Discussion 13 PCA + Clustering
Thu May 1
Lecture 27 LLMs + Guest + Closing
Lecture Participation 27 Lecture Participation 27
Lab 12 Clustering (due Tue May 6)

Week 16

Mon May 5
RRR
Tue May 6
RRR
Final Review TBD
Wed May 7
RRR
Thu May 8
RRR
Fri May 9
RRR

Week 17

Fri May 16
Final Exam Final (11:30-2:30 PM)