--- title: "Randomization" author: "Fill In Your Name" date: '`r format(Sys.time(), "%d %B %Y")`' bibliography: ../learningdays-book.bib biblio-style: apalike link-citations: yes colorlinks: yes fig_caption: yes header-includes: | \setbeamertemplate{footline}{\begin{beamercolorbox}{section in head/foot} \includegraphics[height=.5cm]{../Images/egap-logo.png} \hfill \insertframenumber/\inserttotalframenumber \end{beamercolorbox}} \usepackage{tikz} \usepackage{tikz-cd} \usepackage{textpos} \usepackage{booktabs,multirow,makecell} output: # powerpoint_presentation: # toc: true beamer_presentation: keep_tex: true toc: true pandoc_args: [ "--toc" ] fig_caption: true revealjs::revealjs_presentation: fig_caption: true theme: default highlight: pygments center: false transition: fade smart: false self_contained: false reveal_plugins: ["notes", "search", "chalkboard"] pandoc_args: [ "--toc" ] reveal_options: slideNumber: true previewLinks: true chalkboard: theme: whiteboard toggleNotesButton: false --- ```{r setup, include=FALSE,echo=FALSE} source('rmd_setup.R') ``` # Randomization ## Randomization of treatment assignment Start with a group units who you can directly observe: like the people who showed up to a lab, villages in a region, respondents willing to take a survey. - Every observation has a known probability of treatment assignment between 0 and 1. - No unit in the experimental sample is assigned to treatment or to control with certainty. - Units can vary in their probability of treatment assignment. - For example, the probability might vary by group: women might have a 75\% probability of being assigned to treatment, while men have 25\%. - Assignment probabilities can even vary across units **as long as you know the probability for each and every unit**, though that would complicate your analysis. ## Random assignment $\neq$ random sampling - Random sampling (from population): selecting subjects into your sample from a population with known probability. You cannot directly observe the whole population so you draw a sample. - Randomization (of treatment): assigning subjects from an existing group of subjects with known probability to experimental conditions. - You need to be able to see outcomes and assign treatment for every subject in the group - This random assignment of treatment can be combined with any kind of sample (random sample, convenience sample, etc.) or even with the whole population (all the hospitals, all the judges). ## Random assignment $\neq$ random sampling - How you recruit your initial group (or experimental pool) matters: - a big initial group means a big experiment and more statistical power to detect small effects. - an initial group which is a random sample from a known population helps you make the argument that your effects might be the same or similar if you did this experiment with another sample from that population. - an experimental pool that is a random sample might also help with the argument that the effect should be similar if you scaled up the intervention to the whole population --- depending on the factors driving a global equilibrium. ## Randomization is closely linked to your research design - See module on [Research Design](/research-design.html) - Remember that you need to define and justify your control condition: - this may be no intervention - or a placebo ## Level of treatments and outcomes - Treatment can be assigned at different levels: individuals, groups, institutions, communities, time periods, or many different levels. - You may be constrained in what level you can _assign_ treatment and _measure_ outcomes. - Example: Treatment at the classroom level, but outcomes at the student level. - Example: Treatment at the district level, but outcomes at the community level. - The level at which treatment is assigned and at which outcomes are measured affect what your study can demonstrate. # Some common types of randomization ## Simple randomization (coin-flipping) - For each unit, flip a coin to see if it will be treated. Then you measure outcomes at the same level as the coin. - The coins don’t have to be fair (50-50), but you have to know the probability of treatment assignment. - You can’t guarantee a specific number of treated units and control units. - Example: If you have 6 units and you flip a fair coin for each, you have about a 3% chance of assigning **all** units to treatment or assigning **all** units to control. ## Example code for simple randomization I ```{r echo=TRUE} # set the random number seed to make this replicable set.seed(12345) # set a sample size N <- 200 # Generate the simple random assignment # (Notice that in an experiment we have a single # trial and thus size=1) # Our object with N people total is called simple.ra simple.ra <- rbinom(n = N,size = 1, prob = .5) # 112 people ended up in the treatment group sum(simple.ra) ``` ## Example code for simple randomization II ```{r echo=TRUE} # you can also use the randomizr package library(randomizr) # for replicability set.seed(23456) # Simple random assignment uses the simple_ra function # Our object with N people total is called treatment treatment <- simple_ra(N = N, # total sample size prob = .5 # probability of receiving treatment ) sum(treatment) ``` ## Complete randomization (drawing from an urn) - A fixed number $m$ out of $N$ units are assigned to treatment. - The probability a unit is assigned to treatment is $m/N$. - This is like having an urn or bowl with $N$ balls, of which $m$ are marked as treatment and $N-m$ are marked as control. Public lotteries use this method. ## Example code for complete randomization I ``` {r echo=TRUE} # set sample size N N <- 200 # set number of treated units m m <- 100 # create a vector of m 1's and N-m 0's complete.ra <- c(rep(1,m), rep(0,N-m)) # And then scramble it randomly using sample() # The default is sampling without replacement set.seed(12345) # for replicability complete.ra <- sample(complete.ra) sum(complete.ra) ``` ## Example code for complete randomization II ``` {r echo=TRUE} # you can also use the randomizr package library(randomizr) # for replicability set.seed(23456) # Complete random assignment: treatment <- complete_ra(N = 200, # total sample size m = 100) # number to assign to treatment sum(treatment) # note what happens if you don't specify m! ``` ## Block (or stratified) randomization I - We create blocks of units and randomize separately within each block. We are doing mini-experiments in each block. - Example: block = district, units = communities. We randomize treatment at the community level **within district** and also measure outcomes at the community level. - Blocks that represent a substantively meaningful subgroup can help you to learn about how effects might differ by subgroup. - By controlling number of subjects per subgroup, you ensure that you have enough subjects in each group. - Especially useful when you have a rare group --- by chance you might get very few of them in treatment or control even under random assignment (or you might have some imbalance). ## Block (or stratified) randomization II - Blocks that are homogeneous on a given outcome increase precision of estimation for that outcome as compared with the experiment without blocks. (We will address this more in the power analysis section). ## Cluster randomization I - A cluster is a **group of units**. In a cluster-randomized study, all units in the cluster are assigned to the same treatment status. - Use cluster randomization if the intervention has to work at the cluster level. - For example, if the intervention is about school playgrounds, then the school is the unit of assignment even if student health may be an outcome measured at level of individual students. - Having fewer clusters hurts your ability to detect treatment effects and make cause misleading $p$-values and confidence intervals (or even estimates). *How much* depends on the intra-cluster correlation (ICC or $\rho$). ## Cluster randomization II - Higher $\rho$ is worse: - When $\rho=0$ then the village doesn't matter for the behavior of the individuals. - When $\rho=1$ then every person in the village would give exactly the same answer. Having another person from this village doesn't give you additional information since his outcome is identical to the person you already had. - For the same number of units, having **more clusters** with fewer units per cluster can help. - Trade off spillover and power. - If you would not like an experiment with 10 units, then you should not be happy with an experiment with 10 clusters of 100 units. The effective sample size of this cluster randomized experiment is between 10 and 10 $\times$ 100 = 1000, but closer to 10 the higher the $\rho$. ## You can combine blocks and clusters - You can have clusters within blocks. - Example: block = district, cluster = communities, units = individuals. You are measuring outcomes at the individual level. - Example: block = province, cluster = district, units = communities. You are measuring outcomes at the community level. - You can't have blocks within clusters. - For block and cluster randomization, you can use `block_ra` and `cluster_ra` in the `randomizr` package in R. - For more complicated designs, you might find `DeclareDesign` helpful. () # Best practices ## Best practices: replicability - EGAP Methods Guide on Randomization () - Set a seed and save your code and random assignment column - Verify - Sometimes increased transparency => replicability ## Best practices: balance - Check overall balance with a D-square test using `xBalance` in the `RItools` package (@hansen:bowers:2008) (large sample randomization inference): ```{r echo=TRUE, eval=FALSE} xBalance(treatment~x1+x2,data=dat,report='chisquare') ``` - See also the `coin` package `independence_test` for permutation based version - Use an F-test for a regression of treatment assignment on LHS and covariates on RHS (large sample approximate to randomization inference): ```{r echo=TRUE, eval=FALSE} anova(lm(treatment~1,data = dat), lm(treatment~x1+x2+x3,data=dat), test = 'F') ``` ## Best practices: balance - Random assignment gives us, in expectation, **overall balance** on the many covariates. It does not guarantee that all covariate to treatment relationships will be zero. In fact, in a small experiment, the magnitudes of imbalance may be high even if the randomization occurred perfectly. - You will see t-tests of covariates one by one. Just by chance, you might get statistically significant differences on a variable. If you check balance on 100 variables, you will reject the null of no relationship in 5 of them even if there truly is no relationship. # Some experimental designs ## Access - Randomly select a treatment group through a lottery or equivalent mechanism, which randomizes **access** to the program. - Useful when you do not have enough resources to treat everyone. - Sometimes, some units (peoples, communities) must have access to a program. - For example: a partner organization doesn’t want to risk a vulnerable community NOT getting a program (want a guarantee that they will be always be treated). - You can exclude those units from the experiment, and do random assignment among the remaining units that have a probability of assignment strictly between (and not including) 0 and 1. ## Delayed access (Phase-in or wait list) - Randomize *timing* of access to the program. - Often you do not have the capacity to implement the treatment in a lot of places at once. - When an intervention can be or must be rolled out in stages, you can randomize the order in which units are treated. - Your control group are the as-yet untreated units. - Be careful: the probability of assignment to treatment will vary over time because units that are assigned to treatment in earlier stages are not eligible to be assigned to treatment in later stages. ## Factorial or crossed-assignment - Factorial design enables testing of more than one treatment. - You can analyze one treatment at a time. - Or combinations thereof. - Example: \begin{table} \begin{tabular}{r|c|c} & $X_1=0$ & $X_1=1$ \\ \hline $X_2=1$ & A & C \\ \hline $X_2=0$ & B & D \\ \hline \end{tabular} \end{table} We might focus on an estimand like $\mathbb{E}[Y(X_1=1, X_2=1)]-\mathbb{E}[Y(X_1=0, X_2=0)]$. ## Encouragement - Randomize **encouragement** to take the treatment, such as an invitation or subsidy to participate in a program. - Useful when you cannot force a subject to participate. - Estimands: - the ATE of the encouragement for your experimental sample. - the ATE of participation (not the encouragement) for the units who would participate when encouraged and wouldn't participate when not encouraged (compliers). - Instrumental variables analysis for the complier ATE, with the assignment as the instrument. Note the exclusion restriction. # Some limits ## Limits to randomized studies - Ethics --- is this sort of manipulation ethical? Sometimes not. - Must be done in real time, ahead of the intervention roll-out. - Reduced flexibility for a partner organization (problem for any prospective evaluation). - Limits to the size of the experimental pool. - Cost. - The power constraint --- you need a lot of units (problem for many statistical approaches). - Violations of the key assumptions (spillovers; violation of the second key assumption for [Causal Inference](causalinference.html)). - External validity (problem for any evaluation and social science in general). ## References