--- title: 'Linguistic Data: Quantitative Analysis and Visualisation' author: "Ilya Schurov, Olga Lyashevskaya, George Moroz, Alla Tambovtseva" date: "09 February 2018" output: html_document: df_print: paged html_notebook: default pdf_document: default subtitle: 'ANOVA: analysis of variance' --- Load data on Icelandic: ```{r} phono <- read.csv("http://math-info.hse.ru/f/2018-19/ling-data/icelandic.csv") ``` Look at groups of consonants: ```{r} table(phono$cons1) ``` Create a boxplot for vowel duration for each group of consonants: ```{r} boxplot(phono$vowel.dur ~ phono$cons1) ``` Perform ANOVA: ```{r} res <- aov(phono$vowel.dur ~ phono$cons1) res ``` More informative summary: ```{r} # H0: there are no difference in population means by groups summary(res) ``` **Question:** judging by the output above, can we conclude that average vowel duration differ significantly in different groups of consonants?