---
title: "Figure S1 Timeseries"
author: "Mads Albertsen"
date: "`r format(Sys.time(), '%d-%m-%Y')`"
output: html_document
---
## Load packages
```{r Load_packages, message=FALSE, warning=FALSE, results='hide'}
library("ampvis")
```
## Load data
```{r load_data}
data(DNAext_1.0)
```
## Subset to the relevant dataset
All samples are subset to 25.000 reads and then only OTUs which are seen at least 10 / 25000 times in a single sample is kept for further ordination analysis.
```{r subset, message=FALSE}
time <- subset_samples(V13, Exp.time == "YES") %>%
rarefy_even_depth(sample.size = 25000, rngseed = 712) %>%
filter_taxa(function(x) max(x) >= 10, TRUE)
```
## Figure S1A: PCA colored by Sampling Date
PCA with square root transformed OTU abundances. The effect of sampling from different tanks is tested using the envfit function in vegan (permutation test).
```{r PCA}
pca <- amp_ordinate(data = time,
plot.color = "Date",
plot.point.size = 3,
plot.theme = "clean",
envfit.factor = "Date",
envfit.show = F,
output = "complete")
```
Plot the data. It looks like there is significant groupings.
```{r pca_plot, fig.align='center', fig.height=3, fig.width=3}
pca$plot +
theme(legend.position = "none")
```
```{r save_S1A, eval=FALSE}
ggsave("plots/S1A.eps", width = 55, height = 55, units = "mm")
```
The model reports a p-value of `r pca$eff.model$factors$pvals`, hence there is an effect of time.
```{r pca_model}
pca$eff.model
```
## Figure S1B: Cluster analysis of beta diversity using Bray-Curtis
The Bray-Curtis dissimilarity index is used as an alternative method to test for significant groupings in the dataset.
```{r beta}
beta <- amp_test_cluster(data = time,
group = "Date",
method = "bray",
plot.color = "Date",
plot.label = "Date",
plot.theme = "clean")
```
Using adonis we also find a significant effect of time as the p-value is `r beta$adonis$aov.tab$"Pr(>F)"[1]`.
```{r beta_adonis}
beta$adonis
```
Clustering the data show a very nice grouping by time. Except the two first time-points two weeks appart all time-points cluster separately.
```{r beta_cluster, fig.align='center', fig.height=3, fig.width=4}
beta$plot_cluster +
theme(legend.position = "none")
```
```{r save_S1B, eval=FALSE}
ggsave("plots/S1B.eps", width = 60, height = 55, units = "mm")
```
## Difference between samples only weeks appart
```{r subset_weeks, message=FALSE}
time_weeks <- subset_samples(time, Date %in% c("2012-10-17", "2012-10-31", "2012-11-14")) %>%
filter_taxa(function(x) max(x) >= 10, TRUE)
```
```{r PCA_weeks}
pca_weeks <- amp_ordinate(data = time_weeks,
plot.color = "Date",
plot.point.size = 3,
plot.theme = "clean",
envfit.factor = "Date",
envfit.show = F,
output = "complete"
)
```
Plot the data. It looks like there is significant groupings even when looking at a time-scale of weeks.
```{r pca_weeks_plot, fig.align='center', fig.height=3, fig.width=4}
pca_weeks$plot
```
The model reports a p-value of `r pca_weeks$eff.model$factors$pvals`, showing there is an significant differnce between samples taken only weeks appart.
```{r rep_pca_model}
pca_weeks$eff.model
```
## Difference between samples only weeks using beta diversity
The Bray-Curtis dissimilarity index is used as an alternative method to test for significant groupings in the dataset.
```{r beta_time_week}
beta_weeks <- amp_test_cluster(data = time_weeks,
group = "Date",
method = "bray",
plot.color = "Date",
plot.label = "Date",
plot.theme = "clean")
```
Using adonis we also find a significant effect of weekly sampling as the p-value is `r beta_weeks$adonis$aov.tab$"Pr(>F)"[1]`.
```{r beta_adonis_week}
beta_weeks$adonis
```
Looking at the clustering it seems like the we can't distinguish between samples from the two first timepoints (2012-10-17 and 2012-10-31). However, they are quite different from the timpoint 2 weeks later.
```{r beta_cluster_week, fig.align='center', fig.height=3, fig.width=4}
beta_weeks$plot_cluster
```
## Variation compared to another WWTP
```{r subset_plant, message=FALSE}
time_plant <- subset_samples(V13, Exp.time == "YES" | Plant == "AAE") %>%
rarefy_even_depth(sample.size = 25000, rngseed = 712) %>%
filter_taxa(function(x) max(x) >= 10, TRUE)
```
The major difference is plants and the second axis explain the differences related to the time-series.
```{r PCA_plant, fig.align='center', fig.height=4, fig.width=5}
amp_ordinate(data = time_plant,
plot.color = "Date",
plot.shape = "Plant",
plot.point.size = 3,
plot.theme = "clean"
)
```
The Bray-Curtis dissimilarity index is used as an alternative method to test for significant groupings in the dataset.
```{r beta_time_plant}
beta_plant <- amp_test_cluster(data = time_plant,
group = "Date",
method = "bray",
plot.color = "Date",
plot.label = c("Date","Plant"),
plot.theme = "clean")
```
```{r beta_time_plant_cluster, fig.height=3, fig.width=4, fig.align='center'}
beta_plant$plot_cluster
```