Course schedule, 2020/2021
The (Rmarkdown) source code for these lectures is available at the github repository,
or by replacing the .slides.html
suffix with .Rmd
in the link below;
the slides are made using reveal.js.
Below:
Fall 2020
- Week 1 (9/29)
-
Overview of data science - description and estimation, uncertainty and simulation, with examples for comparing means and linear regression; smoothing.
- Slides
- topic slides: the t distribution
- topic slides: the central limit theorem and the Normal distribution
- Homework 1 (due 10/8)
- how to git the slide source
- Reading: Quinn & Keough chapters 1-4
- Week 2 (10/6)
-
Analysis of Variance (ANOVA) and experimental design; tidy data; power and false positives
- slides: confidence intervals
- slides: ANOVA
- slides: experimental design
- Homework 2 (due 10/15)
- Reading: Quinn & Keough chapters 5, 7, 8
- Week 3 (10/13)
-
Plotting/visualization, and permutation/bootstrapping
- slides: permutation tests and pdf version
- slides: tidy data and pdf version
- slides: visualizing data and pdf version
- slides: the bootstrap and pdf version
- Homework 3 (due 10/22)
- Reading: Quinn & Keough chapter 9
- Week 4 (10/20)
-
Multivariate ANOVA, regression, least-squares likelihood
- slides: multivariate ANOVA and pdf version
- slides: formulas and pdf version
- Homework 4 (due 10/30)
- Reading: Quinn & Keough chapter 6
- Week 5 (10/27)
-
Model selection; random effects and mixed models - a first look.
- slides: Linear models
- slides: Model comparison
- slides: Random effects
- Homework: peer review
- Reading: Quinn & Keough chapter 13
- Week 6 (11/3)
-
Multiple testing, error rates, and some history.
- slides: Multiple testing
- slides: Statistics and Eugenics
- install Stan
- Homework 6 (due 11/12)
- Week 7 (11/10)
-
Introduction to Bayesian statistics
- slides: Prior distributions and uncertainty
- slides: Probability rules
- slides: the Beta distribution
- slides: Sampling from the posterior with Markov chain Monte Carlo
- Reading: Kruschke, chapters 1, 2, 4, 5, 6, 7
- Homework 7 (due 11/19)
- Week 8 (11/17)
-
Bayesian hierarchical modeling - shrinkage, and sharing power
- slides: Hierarchical models: adding levels of randomness
- slides: Hierarchical models: Baseball data
- Reading: Kruschke, chapters 9, 10
- Homework 8 (due 12/03)
- Week 9 (11/24)
-
Logistic models, and sharing power
- slides: The logistic model
- slides: Sharing power and shrinkage
- slides: The Gamma and Exponential distributions
- Reading: Kruschke, chapters 13, 21
- no new homework this week (catch up on reading?)
- Week 10 (12/1)
-
Robust linear models; Generalized Linear Models (GLMs).
- slides: Fitting linear models, robustly
- slides: Generalized Linear Models
- slides: Summary and wrap-up
- slides: Poisson linear models
- slides: The Cauchy distribution
- slides: The Poisson distribution
- slides: Matrix multiplication
- Homework 9 (due 12/10)
- Reading: Kruschke, chapters 15, 16, 17
Winter 2021
- Week 11 (1/4)
-
Survival analysis and introductiom to
brms
- slides: Survival curves
- slides: Cox’s Proportional Hazards
- slides: the Weibull distribution
- slides: Parametric survival analysis
- slides: Introduction to brms
- Homework 11 (due 1/14)
- Week 12 (1/11)
-
Time series: temporal autocorrelation, autoregressive models; mechanistic models
- slides: Time series
- slides: Missing data and imputation
- slides: Trends, smoothing, autocorrelation, and cycles
- Homework 12 (due 1/21)
- Week 13 (1/18)
-
Categorical data: chi-square for contingency tables, permutation tests.
- slides: The chi-squared test for categorical data
- slides: Permutation testing for categorical data
- slides: Poisson models for categorical data (using brms)
- slides: The chi-squared distribution
- Homework 13 (group homework, due 1/27)
- Reading: Kruschke, chapter 16 (metric data with one or two groups), and chapter 24 (Poisson, contingency tables)
- Week 14 (1/25)
-
Crossvalidation for model comparison; sparsifying priors and variable selection
- slides: Crossvalidation and overfitting
- slides: Overdispersion
- slides: Reparameterization
- slides: R interlude: indexing
- Reading: Kruschke, chapters 17 (one-variable linear models), 18 (multivariate linear models)
- Week 15 (2/1)
-
Many response variables
- slides: The multivariate normal distribution
- slides: Multivariate responses
- Primer on linear algebra
- Rmd file for the primer
- Homework 14 (group homework, due 2/9)
- Week 16 (2/8)
-
Data analysis example
- slides: Hurricane lizards
- Group presentations
- Week 17 (2/15)
-
Factor analysis, dimensionality reduction, and visualization; clustering; PCA, PCoA, MDS, t-SNE, UMAP
- slides: Hurricane lizards, continued
- slides: Dimension reduction and PCA
- Homework 17 (due 2/19)
- Homework 18 (due 2/25)
- Week 18 (2/22)
-
Latent factors, deconvolution for mixtures of expression data; nonnegative matrix factorization
- slides: t-SNE
- slides: On ordination and dimension reduction methods
- slides: Nonnegative matrix factorization
- Homework 19 (due 3/4)
- Week 19 (3/1)
-
Deconvolution continued; introduction to spatial statistics and spatial autocorrelation.
- slides: Spatial autocorrelation
- slides: Spatial mapping
- Homework 20 (due 3/16)
- Week 20 (3/8)
-
Spatial statistics: kernel density estimation and interpolation.
- slides: Spatial density estimation
- slides: Spatial smoothing, and splines
- slides: Looking back: review