Data 100: Principles and Techniques of Data Science

UC Berkeley, Spring 2025

Ed Datahub Gradescope Additional Accommodations Office Hours Queue

Narges Norouzi

Narges Norouzi

She/Her

norouzi@berkeley.edu

Office Hours: Tuesdays from 11:00AM to 1:00PM in Soda 775

Josh Grossman

Josh Grossman

He/Him

jdgg@berkeley.edu

Office Hours: Mondays from 1:00PM to 3:00PM in Evans 421

Welcome to Week 1 of Data 100!

Lectures will be webcast at: https://berkeley.zoom.us/j/97347722542.

Schedule

Week 1

Tue Jan 21
Lecture 1 Introduction
Note 1
Lecture Participation 1 Lecture Participation 1
Wed Jan 22
Lab 1 Prerequisite Coding (due Tue Jan 28)
Homework 1 Prerequisite Math (due Thu Jan 30)
Thu Jan 23
Lecture 2 Pandas I
Lecture Participation 2 Lecture Participation 2

Week 2

Tue Jan 28
Lecture 3 Pandas II
Lecture Participation 3 Lecture Participation 3
Discussion 1 Prerequisites
Thu Jan 30
Lecture 4 Pandas III
Lecture Participation 4 Lecture Participation 4
Lab 2A Pandas (due Tue Feb 4)
Fri Jan 31
Homework 2A Food Safety (due Thu Feb 6)
Drop Deadline (w/o fee) 11:59 PM

Week 3

Tue Feb 4
Lecture 5 Data Cleaning and EDA
Lecture Participation 5 Lecture Participation 5
Discussion 2 Pandas I
Thu Feb 6
Lecture 6 Regex
Lecture Participation 6 Lecture Participation 6
Lab 2B Data Cleaning and EDA (due Tue Feb 11)
Fri Feb 7
Homework 2B Food Safety II (due Thu Feb 13)
Add Deadline (w/o fee) 11:59 PM

Week 4

Tue Feb 11
Lecture 7 Visualization I
Lecture Participation 7 Lecture Participation 7
Discussion 3 Pandas II, EDA
Wed Feb 12
Drop Deadline (w/ fee) 11:59 PM
Thu Feb 13
Lecture 8 Visualization II
Lecture Participation 8 Lecture Participation 8
Lab 3 Regex, EDA (due Tue Feb 18)
Fri Feb 14
Homework 3 Text Analysis of Bloomberg Articles (due Thu Feb 20)

Week 5

Tue Feb 18
Lecture 9 Sampling
Lecture Participation 9 Lecture Participation 9
Discussion 4 Regex, Visualization, and Transformation
Thu Feb 20
Lecture 9 Modeling, SLR
Lecture Participation 10 Lecture Participation 10
Lab 4 Transformations (due Tue Feb 25)
Fri Feb 21
Homework 4 Bike Sharing (due Thu Feb 27)
Add/Drop Deadline for Graduate Students 11:59 PM

Week 6

Tue Feb 25
Lecture 11 Constant Model, Loss, and Transformations
Lecture Participation 11 Lecture Participation 11
Discussion 5 Probability, Sampling, and Visualization
Thu Feb 27
Lecture 12 OLS (Multiple Regression)
Lecture Participation 12 Lecture Participation 12
Lab 5 Modeling, Summary Statistics, and Loss Functions (due Tue Mar 4)
Fri Feb 28
Homework 5A Modeling (due Thu Mar 6)

Week 7

Tue Mar 4
Lecture 13 Gradient Descent / Sklearn
Lecture Participation 13 Lecture Participation 13
Discussion 6 Models
Thu Mar 6
Lecture 14 Feature Engineering
Lecture Participation 14 Lecture Participation 14
Lab 6 OLS (due Tue Mar 11)
Fri Mar 7
Homework 5B OLS (due Thu Mar 13)

Week 8

Tue Mar 11
No Lecture
Discussion 7 OLS, Gradient Descent
Wed Mar 12
Midterm Exam Midterm (8-10 PM)
Thu Mar 13
Lecture 15 Case Study (HCE): CCAO
Lecture Participation 15 Lecture Participation 15
Lab 7 Gradient Descent / Sklearn (due Tue Mar 18)
Fri Mar 14
Project A1 Housing I (due Thu Mar 20)

Week 9

Tue Mar 18
Lecture 16 Cross-Validation and Regularization
Lecture Participation 16 Lecture Participation 16
Discussion 8 Feature Engineering, Housing
Thu Mar 20
Lecture 17 Random Variables
Lecture Participation 17 Lecture Participation 17
Lab 8 Model Selection (due Tue Apr 1)
Fri Mar 21
Project A2 Housing II (due Thu Apr 3)

Week 10

Mon Mar 24
Spring Break
Tue Mar 25
Spring Break
Wed Mar 26
Spring Break
Thu Mar 27
Spring Break
Fri Mar 28
Spring Break

Week 11

Tue Apr 1
Lecture 18 Estimators, Bias, and Variance
Lecture Participation 18 Lecture Participation 18
Discussion 9 Cross-Validation and Regularization
Thu Apr 3
Lecture 19 Parameter Inference & the Bootstrap
Lecture Participation 19 Lecture Participation 19
Lab 9 Probability (due Tue Apr 8)
Fri Apr 4
Homework 6 Probability (due Thu Apr 10)
Grade Option Change 11:59 PM

Week 12

Tue Apr 8
Lecture 20 SQL I
Lecture Participation 20 Lecture Participation 20
Discussion 10 RVs, Bias, and Variance
Thu Apr 10
Lecture 21 SQL II
Lecture Participation 21 Lecture Participation 21
Lab 10 SQL (due Tue Apr 15)
Fri Apr 11
Homework 7 SQL (due Thu Apr 17)

Week 13

Tue Apr 15
Lecture 22 Logistic Regression I
Lecture Participation 22 Lecture Participation 22
Discussion 11 SQL
Thu Apr 17
Lecture 23 Logistic Regression II
Lecture Participation 23 Lecture Participation 23
Lab 11 Logistic Regression (due Tue Apr 22)
Fri Apr 18
Project B1 Spam & Ham I (due Thu Apr 24)

Week 14

Tue Apr 22
Lecture 24 PCA I
Lecture Participation 24 Lecture Participation 24
Discussion 12 Logistic Regression
Thu Apr 24
Lecture 25 PCA II
Lecture Participation 25 Lecture Participation 25
Lab 12 PCA (due Tue Apr 29)
Fri Apr 25
Project B2 Spam & Ham II (due Thu May 1)

Week 15

Tue Apr 29
Lecture 26 Clustering
Lecture Participation 26 Lecture Participation 26
Discussion 13 PCA + Clustering
Thu May 1
Lecture 27 LLM + Guest + Closing
Lecture Participation 27 Lecture Participation 27
Lab 13 Clustering (due Tue May 6)

Week 16

Mon May 5
RRR
Tue May 6
RRR
Wed May 7
RRR
Thu May 8
RRR
Fri May 9
RRR

Week 17

Fri May 16
Final Exam Final (11:30-2:30 PM)