Model-based Geostatistics for Global Public Health
Lancaster University
Pearson Residuals: \[\frac{y_i - \hat{y}_i}{\sqrt{\text{Var}(\hat{y}_i)}}\]
Deviance Residuals: \[\text{sign}(y_i - \hat{y}_i) \times \sqrt{d_i}\] where \(d_i\) is the deviance contribution of observation \(i\).
Random Effects Residuals: \(\hat{Z}_i\), i.e. the estimated random effect for each location/household/cluster/village.
The Matérn family includes two important special cases:
Definition: Distance (\(h^*\)) where correlation drops to 0.05 (5%)
General Solution: \[ \rho(h^*) = 0.05 \] Solve for \(h^*\) to find practical range
Exponential Covariance Example: \[ \rho(h^*) = \exp\left(-\frac{h^*}{\phi}\right) = 0.05 \] \[ \Rightarrow -\frac{h^*}{\phi} = \log(0.05) \approx -3 \] \[ \Rightarrow h^* \approx 3\phi \]
The variogram measures spatial dependence: \[ \gamma(h) = \frac{1}{2} \mathbb{E} \left[\left\{S(x) + Z(x) - S(x') - Z(x')\right\}^2\right] \]
Where \(Z(x)\) is Gaussian noise with mean 0 and variance \(\tau^2\)
The expression of the variogram changes to \[ \gamma(h) = \tau^2 + \sigma^2 \left( 1 - C(h) \right), h = ||x-x'|| \] . . .
\[ Y_i = d(x_i)\top \beta + S(x_i) + U_i \]
where: \(d(x_i)\) are covariates, \(S(x_i)\) is a stationary and isotropic spatial Gaussian process and \(U_i\) is Gaussian noise.
\[ \mathbf{Y} \sim \text{MVN}\left(D\beta, \Sigma\right) \]
where the covariance matrix is given by
\[ \Sigma = \sigma^2 R(\phi) + \tau^2 I_n. \]
\[ \ell(\theta) = -\frac{1}{2} \left[ n \log(2\pi) + \log |\Sigma| + (\mathbf{Y} - D\beta)^T \Sigma^{-1} (\mathbf{Y} - D\beta) \right]. \]
The likelihood function for parameters \(\theta = (\beta, \sigma^2, \phi, \tau^2)\) is given by:
\[ L(\theta) = \int N(W; D\beta, \Omega) f(y| W) dW \]
where \(\Omega = \sigma^2 R(\phi) + \tau^2I_n\).
We approximate this using Monte Carlo integration:
\[ L_m(\theta) = \frac{1}{B} \sum_{j=1}^{B} \frac{N\left(W^{(j)}; D\beta, \Omega \right)}{N\left(W^{(j)}; D\beta_0, \Omega_0\right)} \] where \(W^{(i)}\) are sampled from the distribution of \(W\) given \(y\) using an MCMC algorithm.