University of Birmingham
Lund University
Antigen
A molecule (often a protein or polysaccharide) from a pathogen that is recognized by the immune system and can trigger an immune response.
Antibody
A protein produced by B cells that specifically binds to an antigen, reflecting current or past exposure to the pathogen.

Denning DW, Kilcoyne A, Ucer C (2020). Non-infectious status indicated by detectable IgG antibody to SARS-CoV-2.
British Dental Journal, 229:521–524. Link to article
Infections with many asymptomatic cases
malaria, dengue, chikungunya, Zika
Acute infections with short diagnostic windows
SARS-CoV-2, influenza, yellow fever
Diseases with repeated or cumulative exposure
malaria, schistosomiasis, soil-transmitted helminths, onchocerciasis
Chronic infection or elimination settings
trachoma, lymphatic filariasis, onchocerciasis
Not informative when cell-mediated immunity is dominant
tuberculosis, leishmaniasis
Seronegativity
Absence of detectable antibody response to a given antigen.
Seropositivity
Antibody concentration exceeding a predefined assay-specific threshold, interpreted as evidence of prior exposure or immunity.
Common approaches to serostatus classification
\[ f(y) = \pi_0 \mathcal{N}(y ; \mu_0, \sigma_0^2) + \pi_1 \mathcal{N}(y ; \mu_1, \sigma_1^2), \quad \pi_0 + \pi_1 = 1 \]


Classical antibody acquisition model
(Yman et al., 2016) \[
\mathbb{E}[Y;a] = f(a) = \mu_0 + (\mu_1 - \mu_0)\{1 - \exp(-r a)\},
\] with age \(a\) and acquisition rate \(r > 0\).
Interpretation in the latent sero-reactivity framework \[ \mathbb{E}[T;a] = \frac{f(a)-\mu_0}{\mu_1-\mu_0} = 1 - \exp(-r a), \]
Resulting expectation of the antibody level \[ \mathbb{E}[Y;a] = \mu_0 + (\mu_1-\mu_0)\,\mathbb{E}[T;a]. \]
| Parameter | Estimate | SD | 2.5% | 50% | 97.5% |
|---|---|---|---|---|---|
| Distribution of Y | T | |||||
| μ₀ | -3.427 | 0.077 | -3.626 | -3.470 | -3.311 |
| μ₁ | 0.790 | 0.016 | 0.774 | 0.808 | 0.838 |
| σ₀ | 0.606 | 0.041 | 0.537 | 0.609 | 0.695 |
| σ₁ | 0.114 | 0.022 | 0.077 | 0.122 | 0.164 |
| Distribution of T (age < 15) | |||||
| α₂ | 1.337 | 0.074 | 1.248 | 1.341 | 1.563 |
| λ | 0.224 | 0.032 | 0.173 | 0.227 | 0.299 |
| Distribution of T (age ≥ 15) | |||||
| ϕ | 3.423 | 0.259 | 3.078 | 3.520 | 4.117 |
| η₁ | -0.096 | 0.124 | -0.399 | -0.097 | 0.110 |
The estimate of \(\lambda\) suggests that on average it takes about 4.5 years for individuals to reach higher levels of sero-reactivity.
| Parameter | Mean | SD | 2.5% | 50% | 97.5% |
|---|---|---|---|---|---|
| Distribution of Y | T | |||||
| μ0 | -4.481 | 0.042 | -4.567 | -4.479 | -4.404 |
| μ1 | 1.255 | 0.026 | 1.205 | 1.255 | 1.307 |
| log σ0 | -0.677 | 0.043 | -0.764 | -0.676 | -0.599 |
| log σ1 | -5.716 | 0.536 | -6.892 | -5.666 | -4.874 |
| Distribution of T | |||||
| α0 | 0.093 | 0.036 | 0.025 | 0.093 | 0.166 |
| γ1 | 0.277 | 0.011 | 0.256 | 0.277 | 0.297 |
| β0 | 0.755 | 0.037 | 0.684 | 0.754 | 0.827 |
| δ1 | 0.110 | 0.018 | 0.075 | 0.110 | 0.145 |
| τcp | 11.623 | 0.406 | 11.004 | 11.667 | 12.197 |
| δ2 | -0.061 | 0.010 | -0.080 | -0.061 | -0.042 |
🔗 giorgistat.github.io
📧 e.giorgi@bham.ac.uk
📍 BESTEAM, Department of Applied Health Sciences, University of Birmingham
Two complementary inference approaches are used:
Conditional on the latent immune state \(T\), antibody concentrations are Gaussian with mean and variance interpolating between low and high sero-reactivity extremes.
The marginal density of \(Y\) is obtained by integrating out \(T\): \[
f(y;\boldsymbol\theta,\boldsymbol\psi)=\int_0^1
\phi\!\left(y;(1-t)\mu_0+t\mu_1,(1-t)\sigma_0^2+t\sigma_1^2\right)
\,g_T(t;\boldsymbol\psi)\,dt.
\]
Exact maximum likelihood is based on \[ \ell(\boldsymbol\theta,\boldsymbol\psi)=\sum_{i=1}^n\log f(y_i;\boldsymbol\theta,\boldsymbol\psi), \] but direct maximisation is computationally intensive due to repeated numerical integration.
To reduce computation, data are summarised into a histogram.
Let \(\widehat f_j = n_j/(n\Delta_j)\) be the empirical density in bin \(j\), and approximate model probabilities by evaluation at bin midpoints: \[
p_j(\boldsymbol\theta,\boldsymbol\psi)\approx f(m_j;\boldsymbol\theta,\boldsymbol\psi)\Delta_j.
\]
Parameters are estimated by minimising an \(L_2\) distance between empirical and model densities: \[ Q(\boldsymbol\theta,\boldsymbol\psi)=\sum_{j=1}^J\{\widehat f_j-f(m_j;\boldsymbol\theta,\boldsymbol\psi)\}^2. \] This yields a robust minimum-distance estimator and is computationally efficient when \(J\ll n\).
