# Assignments

## Statistics 240, spring 2023

There will be approximately five problem sets, two coding projects involving open-source code contributions, and a longer term project involving data analysis.

The longer project involves re-analyzing data in a published paper that inappropriately used parametric methods
or used inappropriate nonparametric methods,
using appropriate nonparametric methods instead. 
Please start looking for a paper that interests you right away.


## Schedule

All assignments are due at 11:59pm Pacific Time unless indicated otherwise.

| Assignment    | due date |
|--------------|-----------:|
|[Problem set 1](./Hw/ps01-background.ipynb): math background | 1/30 (Monday)|
|[Problem set 2](./Hw/ps02-binary-experiments.ipynb): confidence bounds for the average treatment effect in binary experiments | 2/12 (Sunday) |
| [Problem set 3](./Hw/ps03-permute-rank.ipynb): permutation tests, rank-based tests, and simulating $P$-values | 2/26 (Sunday) |
|[Computational project 1](./Hw/cp1-tests.ipynb): unit tests for `permute` | 3/12 (Sunday)| 
| [Problem set 4](./Hw/ps-04-permute-distance.ipynb): permutation tests using the Kolmogorov statistic and its generalization to arbitrary VC classes | 3/19 (Sunday) |
|[Computational project 2](./Hw/cp2-function.ipynb): new functionality for `permute` | 4/16 (Sunday) |
| [Term project](./Hw/tp01.ipynb) | 5/5 (Friday) | 


1. Weeks 1-3: Review and inference about binary populations
    + If you have never logged into https://github.berkeley.edu, please do so asap so I can add you to the class organization.
    + If you do not already have an ORCID, sign up for one at https://orcid.org/. 
    + review: 
        - [mathematical foundations](../math-foundations.ipynb)
        - [mathematical inequalities](../math-inequalities.ipynb)
        - [discrete probability](../prob.ipynb)
    + Binomial and hypergeometric tests and confidence intervals
        - [testing](../tests.ipynb)
        - [duality between tests and confidence sets](../duality.ipynb)
        - [confidence sets, tailoring the rejection region, bootstrap percentile CIs](../confidence-sets.ipynb)
        - [binomial and hypergeometric simulations; comparison with the normal approximation](../binom.ipynb)
    + Randomized, controlled experiments
        - [Fisher's exact test](../fisher-exact.ipynb)
        - [Causal inference, Neyman's potential outcomes model, experiments with binary outcomes](../causal-inference.ipynb)
        - Li, X. and P. Ding, 2016. Exact confidence intervals for the average causal effect on a binary outcome, _Statistics in Medicine_, _35_, 6, 957-960.  10.1002/sim.6764 
        - Aronow, P.M., H. Chang, and P. Lopatto, 2022?. Fast computation of exact confidence intervals for randomized experiments with binary outcomes. https://lopat.to/permutation.pdf
     + [Inference about binary populations from stratified samples: Wright's method, Wendell & Schmee's method, and union-intersection tests using greedy optimization](../strat-binary.ipynb)
     + [The SPRT for Bernoulli trials](../sprt.ipynb)
        

1. Weeks 4-6: Permutation tests
    + [introduction to permutation tests](../permute-intro)
    + [PRNGs and simulations](../pseudo-random.ipynb)
    + [simulation and computational permutation tests](../permute-sample.ipynb)
    + [classical permutation tests based on ranks](../permute-classical.ipynb)
    + [tests of randomness and independence](../permute-independence.ipynb)
    + Stark, P.B., and K. Ottoboni, 2018. Random Sampling: Practice Makes Imperfect.
    + Ivanova, A., S. Lederman, P.B. Stark, G. Sullivan, and B. Vaughn, 2022. Randomization tests in clinical trials with multiple imputation for handling missing data, _Journal of Biopharmaceutical Statistics_, _32_, 441-449. https://doi.org/10.1080/10543406.2022.2080695
    + [permutation tests based on the distance between the empirical and the null](../permute-distance.ipynb)
    + [combining tests](../tests-combo.ipynb)
    + [E-values](../E-values.ipynb)
    
1. Weeks 7-8: Supermartingale-based tests
    + Reading:
        - [martingales](../martingales.ipynb)
        - nonnegative supermartingales and Ville's inequality
        - examples: additive, multiplicative, likelihood ratios, prior-posterior ratio, betting
        - the SPRT
    + Inference from stratified samples
        - combining tests using $E$-values: multiplication and averaging
        - stratified inference using union-intersection tests
        - product supermartingales: ALPHA and Sweeter than SUITE 

1. Weeks 9-10: Inference about bounded populations.
        - Kaplan, H., 1987. A Method of One-Sided Nonparametric Inference for the Mean of a Nonnegative Population, _The Amer. Statistician_, _41_, 157-158. https://www.tandfonline.com/doi/abs/10.1080/00031305.1987.10475470?journalCode=utas20
        - Stark, P.B., 2023. ALPHA: Audit that Learns from Previously Hand-Audited ballots, _Ann. Appl. Stat._, https://www.e-publications.org/ims/submission/AOAS/user/submissionFile/54812?confirm=3a9dc0d4
        - Vovk, V., and R. Wang, 2021. $E$-values: Calibration, combination, and applications, http://alrw.net/e/02.pdf
        - Waudby-Smith, I. and A. Ramdas, 2022. Estimating means of bounded random variables by betting, https://arxiv.org/abs/2010.09686

1. Weeks 11-13: Betting and $E$-values, more nonnegative martingales. Combining $E$-values. Multiple testing. The False Discovery Rate. Controlling FWER and FDR using $E$-values

1. Weeks 14-15: Conformal prediction 

