-
Notifications
You must be signed in to change notification settings - Fork 2
Home
- Biases and variation in methodology for key population data which vary by country. Survey estimates have more comparable methodology but depending on KP features (for example proportion in households included in survey sampling frame) may have varying bias. See working paper "Laga - Mapping the population size of female sex worker in countries across sub-Saharan Africa"
- Survey question "did you have sex in exchange for money or goods" has been critisied. Likely too broad with regard to key population of female sex workers
- Only the most recent round of DHS has this survey question (change was made in 2013 and started to be implemented in 2015, although this may vary by country with some countries still not using this question). Alternative question regards last three sexual partners
- Other sources of data about key populations
- The UNAIDS Key Population Atlas
- Johnston et al. (2021, preprint) Deriving and interpreting population size estimates for adolescent and young key populations at higher risk of HIV transmission: men who have sex with men, female sex workers and transgender persons
- Disaggregates the UNAIDS published population size estimates by age using proportion of sexually active adults
- Kinh is a coauthor
- Warns that the estimates should be seen as expert opinion rather than based on data
- Several countries had no data
- Rounding up when the number is too small
- Laga et al. (2021, preprint) Mapping female sex worker prevalence (aged 15-49 years) in sub-Saharan Africa
- Has code and data
- Jeff is a coauthor
- Variation across countries may be implausibly large
- Uses study type random effects but implementation differences even within studies belonging to the same group likely to be large
- Possible approaches
- Move all
sexpaid12m
intosexnonreg
, then get the FSW estimates from other data sources. Is there a way to integrate this data in a coherent way? - Use the
sexpaid12m
data to learn the spatial pattern and the other data sources to learn the level
- Move all
- Is there a coherent way to use existing estimates? Penalise distance from existing estimates equivalent to placing a prior on estimate?
- Sounds similar to Bayesian melding, but implementing Bayesian melding is intractable for all but the simplest models
- Can we get distributions or standard errors on the existing estimates? Not for Johnston
- Work of Jon Wakefield / Taylor Okonek on calibration of estimates?
- Other possible data source on men who paid for sex
- See Hodgkins et al. (2021, preprint)
- The proportion of men who pay for sex (CFSW) can be estimated from the data, and then this can be linked to the proportion of FSW by some model like
p_{CFSW} = B * p_{FSW}
where a strongly informative prior is placed onB
(around 10 say)
-
Fully Bayesian benchmarking of small area estimation models (Zhang and Bryant, 2020)
- Zhang and Bryant have quite a few papers which look interesting
- Which logit to use SE question
- The Poisson transform for unnormalised statistical models slides by Chopin
- Nested logit model from EPFL MOOC
- DHS Recode manual
- Multinomial Response Models by Rodriguez
- Ordinal Regression case study by Betancourt
- Poisson GLMs and the Multinomial model lecture notes from Cambridge
- Online lecture material from PennState
-
orderly
documentation - Example using survey weight in multinomial model, where they put the weights in the log-likelihood
- How to use
rdhs
-
Separable models using the
group
option from Bayesian inference with INLA by Virgilio Gómez-Rubio - Gaussian Kronecker product Markov random fields presentation by Andrea Riebler
- Grouped models presentation by Daniel Simpson
- Primer on crashing INLA models
- Thread on multinomial logit models in Stan
- ...you might like to give a talk about how priors are useful for modelling spatial data but we certainly would not hold you to that
- Some topics in inference with
R-INLA
- The AIDS Data Repository
- KP Atlas
- Bayesian and frequentist approaches to multinomial count models in ecology
- A tour of regression models for explaining shares
- Bayesian spatial and spatio-temporal approaches to modelling dengue fever: a systematic review
- Dependent Multinomial Models Made Easy: Stick Breaking with the Polya-Gamma Augmentation
- A tutorial in spatial and spatio-temporal models with
R-INLA
makemyprior
svydesign
- Multinomial to Poisson transformation
- A tutorial in spatial and spatio-temporal models with
R-INLA
- Overview of the 2020-2022 Allocations and Catalytic Investments
survey
package- UN Inter-agency Group for Child Mortality Estimation and their report on Subnational Under-five Mortality Estimates, 1990–2019
- Splines in Stan
- 2025 AIDS targets
- Thirty Years of The Network Scale-up Method
- Indicators for monitoring the 2016 Political Declaration on Ending AIDS
- End Inequalities. End AIDS. Global AIDS Strategy 2021–2026
- Constraints in Knorr-Held Type IV
R-INLA
's group argument allows specifying Gaussian Kronecker product random fields with covariance given as the Kronecker product of between group and within group covariance matrices.
If A (m x n) and B (p x q) are matrices then their Kronecker product C (pm x qn) is the block matrix
C = [a_11 B ... a_1n B]
[... ... ...]
[a_m1 B ... a_mn B]
Within-group is controlled by f(), and between group is controlled by the control.group argument. See https://becarioprecario.bitbucket.io/inla-gitbook/ch-temporal.html#sec:spacetime.
Often the group argument is used to define (separable) spatiotemporal covariance structures. Following e.g. Blangiardo and Cameletti (2015), let delta_it be spatio-temporal interaction random effects. Knorr-Held (2000) present four ways to specify the structure matrix R_delta, where in the following R_space and R_time refer to spatially or temporally structured random effects and I_space and I_time unstructured random effects:
- Type I: I_space (x) I_time
f(spacetime, model = "iid")
- Type II: I_space (x) R_time
f(space, model = "iid", group = time, control.group = list(model = "rw1"))
- Type III: R_space (x) I_time
f(time, model = "iid", group = space, control.group = list(model = "besag"))
- Type IV: R_space (x) R_time
f(space, model = "besag", group = time, control.group = list(model = "rw1"))
We use the group option to define random effects for each of the multinomial categories.
For example, setting f(sur_idx)
with group = cat_idx
gives the grouped survey random effects for each category.
Interaction random effects should be constrained such that the sum over the non-category index is zero. For example, in each category the sum over ages of \alpha_{ak} should be zero:
\sum_a \alpha_{ak} = 0 \forall k = 1, ..., K
Intuition for this constraint is as follows. Suppose that the sum over age groups in category k is non-zero.
\sum_a \alpha_{a1} = C_1,
...
\sum_a \alpha_{ak} = C_k,
...
\sum_a \alpha_{aK} = C_K,
If C_1 =/= C_k =/= C_K, then the age x category interactions have the effect of increasing the likelihood of a particular category. This isn't the desired effect: increasing the likelihood of any of the categories relative to the others should be left to the category random effects \beta_k.
Additional linear constraints may be enforced on random effects in R-INLA
using extraconstr = list(A = A, e = e)
See https://becarioprecario.bitbucket.io/inla-gitbook/ch-INLAfeatures.html#sec:constraints.
A
should be a matrix which has ncol(A) = length(u)
and nrow(A)
equal to the number of constraints required.
e
should have length equal to the number of constraints required.