3.9 Summary of important R code

The main components of R code used in this chapter follow with components to modify in lighter and/or ALL CAPS text, remembering that any R packages mentioned need to be installed and loaded for this code to have a chance of working:

  • MODELNAME <- lm(Y~X, data=DATASETNAME)

    • Probably the most frequently used command in R.

    • Here it is used to fit the reference-coded One-Way ANOVA model with Y as the response variable and X as the grouping variable, storing the estimated model object in MODELNAME. Remember that X should be defined as a factor variable.

  • MODELNAME <- lm(Y~X-1, data=DATASETNAME)

    • Fits the cell means version of the One-Way ANOVA model.
  • summary(MODELNAME)

    • Generates model summary information including the estimated model coefficients, SEs, t-tests, and p-values.
  • anova(MODELNAME)

    • Generates the ANOVA table but must only be run on the reference-coded version of the model.

    • Results are incorrect if run on the cell-means model since the reduced model under the null is that the mean of all the observations is 0!

  • pf(FSTATISTIC, df1=NUMDF, df2=DENOMDF, lower.tail=F)

    • Finds the p-value for an observed \(F\)-statistic with NUMDF and DENOMDF degrees of freedom.
  • par(mfrow=c(2,2)); plot(MODELNAME)

    • Generates four diagnostic plots including the Residuals vs Fitted and Normal Q-Q plot.
  • plot(allEffects(MODELNAME))

    • Requires the effects package be loaded.

    • Plots the estimated model component.

  • Tm2 <- glht(MODELNAME, linfct=mcp(X=“Tukey”)); confint(Tm2); plot(Tm2); summary(Tm2); cld(Tm2)

    • Requires the multcomp package to be installed and loaded.

    • Can only be run on the reference-coded version of the model.

    • Generates the text output and plot for Tukey’s HSD as well as the compact letter display information.