In this demo, we show a simple example of how to use nimble. We use the survival example with \(y = 19\) alive individuals out of \(n = 57\) released. A binomial likelihood is assumed, with a uniform prior on survival.
First load the nimble package.
library(nimble)
Then define the model, likelihood and prior.
naive.survival.model <- nimbleCode({
# prior
phi ~ dunif(0, 1)
# likelihood
y ~ dbinom(phi, n)
})
Store the constants and data in lists.
my.constants <- list(n = 57)
my.data <- list(y = 19)
Write a function to pick initial values for survival probability.
initial.values <- function() list(phi = runif(1,0,1))
Try the function.
initial.values()
$phi
[1] 0.2105773
Specify the parameter you’d like to monitor, and for which you’d like to carry out posterior inference.
parameters.to.save <- c("phi")
Set the details of the MCMC run. Here we will use 5000 iterations, including 1000 for the burn-in period, and run two chains.
n.iter <- 5000
n.burnin <- 1000
n.chains <- 2
n.thin <- 1
Run nimble.
mcmc.output <- nimbleMCMC(code = naive.survival.model, # model code
data = my.data, # data
constants = my.constants, # constants
inits = initial.values, # initial values
monitors = parameters.to.save, # parameters to monitor
thin = n.thin, # thinning interval (default = 1)
niter = n.iter, # nb iterations
nburnin = n.burnin, # length of the burn-in
nchains = n.chains) # nb of chains
|-------------|-------------|-------------|-------------|
|-------------------------------------------------------|
|-------------|-------------|-------------|-------------|
|-------------------------------------------------------|
First, we have a look to the structure of the outputs. We have a list with two elements, each one is a chain with 4000 values (number of iterations minus the iterations we used for the burn-in) for survival.
str(mcmc.output)
List of 2
$ chain1: num [1:4000, 1] 0.359 0.359 0.369 0.24 0.24 ...
..- attr(*, "dimnames")=List of 2
.. ..$ : NULL
.. ..$ : chr "phi"
$ chain2: num [1:4000, 1] 0.328 0.328 0.308 0.308 0.332 ...
..- attr(*, "dimnames")=List of 2
.. ..$ : NULL
.. ..$ : chr "phi"
Let’s have a look to the first chain.
head(mcmc.output$chain1)
phi
[1,] 0.3587653
[2,] 0.3587653
[3,] 0.3694220
[4,] 0.2395594
[5,] 0.2395594
[6,] 0.2395594
We build a histogram of the values of the first chain, generated from the survival posterior distribution.
mcmc.output %>%
as_tibble() %>%
janitor::clean_names() %>%
ggplot() +
geom_histogram(aes(x = chain1[,"phi"]), color = "white") +
labs(x = "survival probability")
`stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
There are several packages that can take care of the post-processing
for you. We will use MCMCvis
developed by Casey Youngflesh.
Let’s get the numerical summaries of the posterior distribution, along with the R-hat and the effective sample size.
library(MCMCvis)
MCMCsummary(mcmc.output, round = 2)
mean sd 2.5% 50% 97.5% Rhat n.eff
phi 0.34 0.06 0.23 0.33 0.46 1 1715
Now the trace plots and (a kernel density estimate of) the posterior distribution of survival.
MCMCtrace(mcmc.output,
pdf = FALSE)
You may add the value of R-hat and the effective sample size on the graph.
MCMCtrace(mcmc.output,
pdf = FALSE,
ind = TRUE,
Rhat = TRUE,
n.eff = TRUE)
Being able to reproduce the exactly same initial values and MCMC chains can be very useful for pinpointing the source of errors, and also when publishing your code/workflows.
Here are three easy steps to make your workflow more reproducible:
1) Setting a defined initial seed at the start of your code
mySeed <- 0
set.seed(mySeed)
2) Using “pre-sampled” initial values
initVals <- c(initial.values(), initial.values())
mcmc.output <- nimbleMCMC(code = naive.survival.model,
data = my.data,
constants = my.constants,
inits = initVals, # pre-sampled initial values
monitors = parameters.to.save,
thin = n.thin,
niter = n.iter,
nburnin = n.burnin,
nchains = n.chains)
3) Setting the MCMC seed
mcmc.output <- nimbleMCMC(code = naive.survival.model,
data = my.data,
constants = my.constants,
inits = initVals, # pre-sampled initial values
monitors = parameters.to.save,
thin = n.thin,
niter = n.iter,
nburnin = n.burnin,
nchains = n.chains,
setSeed = mySeed) # defined seed