---
title: "Conditional Autoregressive Models"
output: pdf_document
---

```{r setup, include=FALSE}
knitr::opts_chunk$set(message = FALSE)
library(tidyverse)
library(mnormt)
library(spdep)
set.seed(03202021)
```


## Gaussian Model

Suppose the full conditionals are specified as

\vfill

Then using Brooks' Lemma, the joint distribution is
$$p(y_1, \dots, y_n) \propto \exp \left(-\frac{1}{2}\boldsymbol{y}^T D^{-1} (I - B) \boldsymbol{y} \right),$$
where $B$ is a matrix with entries $b_{ij}$ and D is a diagonal matrix with diagonal elements $D_{ii} = \tau_i^2$.

\vfill

The previous equation suggests a multivariate normal distribution,

\vfill


\vfill


\vfill

\newpage

Now the full conditional distribution can be written as
$$Y_i|y_j, j\neq i \sim N \left(\sum_j w_{ij} y_j / w_{i+}, \tau^2 / w_{i+} \right)$$

\vfill

Similarly the joint distribution is now
$$p(y_1, \dots, y_n) \propto \exp \left(-\frac{1}{2 \tau^2}\boldsymbol{y}^T  (D_w - W) \boldsymbol{y} \right)$$
where $D_w$ is a diagonal matrix with diagonal entries $(D_w)_{ii} = w_{i+}$ 

\vfill

The joint distribution can also be re-written as
$$p(y_1, \dots, y_n) \propto \exp \left(-\frac{1}{2 \tau^2} \sum_{i \neq j} w_{ij} (y_i - y_j)^2\right)$$

\vfill

However, both these formulations results in an improper distribution. This could be solved with a constraint, such as $Y_i = 0$.

\vfill

The result is the joint distribution is improper, despite proper full conditional distributions. 

\vfill

\newpage

## IAR

The IAR cannot be used to model data directly, rather this is used a prior specification 

\vfill

The impropriety can be remedied by defining a parameter $\rho$ such that


\vfill


\vfill

With or without $\rho,$ $p(\boldsymbol{y})$ (or the Bayesian posterior when the CAR specification is placed on the spatial random effects) is proper.

\vfill

When using $\rho$, the full conditional becomes 

\vfill

## Simultaneous Autoregression Model

Rather than specifying the distribution on $\boldsymbol{Y}$, as in the CAR specification, the distribution can be specified for $\boldsymbol{\epsilon}$ which induces a distribution for $\boldsymbol{Y}$.

\vfill


\vfill


\vfill

\newpage
## SAR Model

If the matrix $(I - B)$ is full rank, then 
$$\boldsymbol{Y} \sim N \left(\boldsymbol{0},(I - B)^{-1} \tilde{D} ((I - B)^{-1})^T \right)$$

\vfill

If $\tilde{D} = \sigma^2 I$, then $\boldsymbol{Y} \sim N \left(\boldsymbol{0},\sigma^2 \left[(I - B)  (I - B)^T \right]^{-1} \right)$

\vfill

#### Choosing B

There are two common approaches for choosing B

\vfill


\vfill


\vfill

SAR Models are often introduced in a regression context, where the residuals $(\boldsymbol{U})$ follow a SAR model.

\vfill

Let $\boldsymbol{U} = \boldsymbol{Y} - X \boldsymbol{\beta}$ and then $\boldsymbol{U} = B \boldsymbol{U} + \boldsymbol{\epsilon}$ which results in
$$\boldsymbol{Y} = B \boldsymbol{Y} + (I-B) X \boldsymbol{\beta} + \boldsymbol{\epsilon}$$

\vfill

Hence the model contains a spatial weighting of neighbors $(B \boldsymbol{Y})$ and a regression component $((I-B) X \boldsymbol{\beta} )$.

\vfill


\vfill


\vfill