Tutorial

Introduction

This tutorial is designed to provide new users with a step-by-step guide to running an analysis in BUGS. It is not intended to be prescriptive, but rather to introduce you to the main tools needed to run an MCMC simulation in BUGS, and give some guidance on appropriate usage of the software.

The Seeds example from Volume I of the BUGS examples will be used throughout this tutorial. This example is taken from Table 3 of Crowder (1978) and concerns the proportion of seeds that germinated on each of 21 plates arranged according to a 2 by 2 factorial layout by seed and type of root extract. The data are shown below, where r_i and n_i are the number of germinated and the total number of seeds on the i^th plate, i = 1, ..., N. These data are also analysed by, for example, Breslow and Clayton (1993).

[tutorial1]

The model is essentially a random effects logistic regression, allowing for over-dispersion. If p_i is the probability of germination on the i^th plate, we assume

   r_i ~ Binomial(p_i, n_i)

   logit(p_i) = a₀ + a₁x₁_i + a₂x₂_i + a₁₂x₁_ix₂_i + b_i

   b_i ~ Normal(0, τ)

where x₁_i and x₂_i are the seed type and root extract of the i ^th plate, and an interaction term a₁₂x₁_ix₂_i is included.

Specifying a model in the BUGS language

The BUGS language allows a concise expression of the model, using the 'twiddles' symbol ~ to denote stochastic (probabilistic) relationships, and the left arrow ('<' sign followed by '-' sign) to denote deterministic (logical) relationships. The stochastic parameters a₀, a₁, a₂, a₁₂, and τ are given proper but minimally informative prior distributions, while the logical expression for sigma allows the standard deviation (of the random effects distribution) to be estimate

   model 
   {
      for (i in 1:N) {
         r[i] ~ dbin(p[i], n[i])
         b[i] ~ dnorm(0, tau)
         logit(p[i]) <- alpha0 + alpha1 * x1[i] + alpha2 * x2[i]
                     + alpha12 * x1[i] * x2[i] + b[i]
      }
      alpha0 ~ dnorm(0, 1.0E-6)
      alpha1 ~ dnorm(0, 1.0E-6)
      alpha2 ~ dnorm(0, 1.0E-6)
      alpha12 ~ dnorm(0, 1.0E-6)
      tau ~ dgamma(0.001, 0.001)
      sigma <- 1 / sqrt(tau)
   }

More detailed descriptions of the BUGS language along with lists of the available logical functions and stochastic distributions can be found in Model Specification, Distributions, and Functions and functionals. See also the included examples: Volume I, Volume 2, Volume 3 and Volume 4. An alternative way of specifying a model in BUGS is to use the graphical interface known as DoodleBUGS.

Running a model in BUGS

The BUGS software uses compound documents , which comprise various different types of information (formatted text, tables, formulae, plots, graphs, etc.) displayed in a single window and stored in a single file. This means that it is possible to run the model for the "seeds" example directly from this tutorial document, since the model code can be made 'live' just by highlighting it. However, it is more usual when creating your own models to have the model code and data etc. in separate files. We have therefore created a separate file with the model code in it for this tutorial - you will find the file in Examples/Seedsmodel.

Step 1: Open the Seedsmodel file as follows:

* Choose the Open... option from the File menu.
* Select the Examples directory and double-click on the Seeds file to open.

Step 2: To run the model, we first need to check that the model description does fully define a probability model:

* Choose the Specification option from the Model menu.
* Focus the window containing the model code by clicking once (left mouse button) anywhere in the window - the top panel of the window should then become highlighted in blue (usually) to indicate that the window is currently in focus.
* Highlight the word model at the beginning of the code by double clicking on the word (left mouse button).
* Check the model syntax by clicking once on the check model button in the Specification Tool window. A message saying "model is syntactically correct" should appear in the bottom left of the BUGS program window.

Step 3: We next need to load in the data. The data can be represented using S-Plus object notation (file Example/Seedsdata), or as a combination of an S-Plus object and a rectangular array with labels at the head of each column.

* Open one of the data files now.

* To load the data in file Examples/Seedsdata:
- Highlight the word list at the beginning of the data file.
- Click once on the load data button in the Specification Tool window. A message saying "data loaded" should appear in the bottom left of the BUGS program window.

Step 4: Now we need to select the number of chains (i.e. sets of samples to simulate). The default is 1, but we will use 2 chains for this tutorial, since running multiple chains is one way to check the convergence of your MCMC simulations.

* Type the number 2 in the white box labelled num of chains in the Specification Tool window. In practice, if you have a fairly complex model, you may wish to do a pilot run using a single chain to check that the model compiles and runs and obtain an estimate of the time taken per iteration. Once you are happy with the model, re-run it using multiple chains (say 2-5 chains) to obtain a final set of posterior estimates.

Step 5: Next compile the model by clicking on the compile button in the Specification Tool window. A message saying "model compiled" should appear in the bottom left of the BUGS program window. This sets up the internal data structures and chooses the specific MCMC updating algorithms to be used by BUGS for your particular model.

Step 6: Finally the MCMC sampler must be given some initial values for each stochastic node. These can be arbitrary values, although in practice, convergence can be poor if wildly inappropriate values are chosen. You will need a different set of initial values for each chain, i.e. two sets are needed for this tutorial since we have specified two chains - these are stored in files Examples/Seedsinits and Examples/Seedsinits1.

* Open thess files now.
* To load the initial values:
- Highlight the word list at the beginning of the set of initial values.
- Click once on the load inits button in the Specification Tool window. A message saying "initial values loaded: this or another chain contains uninitialized nodes" should appear in the bottom left of the BUGS program window.- Repeat this process for the second initial values file. A message saying "initial values loaded: model initialized" should now appear in the bottom left of the BUGS program window.

Note that you do not need to provide a list of initial values for every parameter in your model. You can get BUGS to generate initial values for any stochastic parameter not already initialized by clicking on the gen inits button in the Specification Tool window. BUGS generates initial values by forward sampling from the prior distribution for each parameter. Therefore, you are advised to provide your own initial values for parameters with vague prior distributions to avoid wildly inappropriate values.

Step 7: Close the Specification Tool window. You are now ready to start running the simulation. However, before doing so, you will probably want to set some monitors to store the sampled values for selected parameters. For the seeds example, set monitors for the parameters alpha0, alpha1, alpha2, alpha12 and sigma - see here for details on how to do this.

To run the simulation:

* Select the Update... option from the Model menu.
* Type the number of updates (iterations of the simulation) you require in the appropriate white box (labelled updates) in the Update Tool window - the default value is 1000.
* Click once on the update button: the program will now start simulating values for each parameter in the model. This may take a few seconds - the box marked iteration will tell you how many updates have currently been completed. The number of times this value is revised depends on the value you have set for the refresh option in the white box above the iteration box. The default is every 100 iterations, but you can ask the program to report more frequently by changing refresh to, say, 10 or 1. A sensible choice will depend on how quickly the program runs. For the seeds example, experiment with changing the refresh option from 100 to 10 and then 1.
* When the updates are finished, the message "updates took *** s" will appear in the bottom left of the BUGS program window (where *** is the number of seconds taken to complete the simulation).
* If you previously set monitors for any parameters you can now check convergence and view graphical and numerical summaries of the samples. Do this now for the parameters you monitored in the seeds example - see Checking convergencefor tips on how to do this (for the seeds example, you should find that at least 2000 iterations are required for convergence).
* Once you're happy that your simulation has converged, you will need to run some further updates to obtain a sample from the posterior distribution. The section How many iterations after convergence? provides tips on deciding how many more updates you should run. For the seeds example, try running a further 10000 updates.
* Once you have run enough updates to obtain an appropriate sample from the posterior distribution, you may summarise these samples numerically and graphically (see section Obtaining summaries of the posterior distribution for details on how to do this). For the seeds example, summary statistics for the monitored parameters are shown below:

      mean   sd   MC_error   val2.5pc   median   val97.5pc   start   sample
   alpha0   -0.5473   0.1935   0.003929   -0.9384   -0.5448   -0.1678   4001   14000
   alpha1   0.06959   0.3203   0.006121   -0.5839   0.08103   0.667   4001   14000
   alpha12   -0.8165   0.4377   0.008459   -1.69   -0.8186   0.05156   4001   14000
   alpha2   1.352   0.2718   0.005731   0.8208   1.349   1.906   4001   14000
   sigma   0.2943   0.1435   0.005438   0.05322   0.2827   0.6112   4001   14000

* To save any files created during your BUGS run, focus the window containing the information you want to save, and select the Save As... option from the File menu.
* To quit BUGS, select the Exit option from the File menu.

Monitoring parameter values

In order to check convergence and obtain posterior summaries of the model parameters, you first need to set monitors for each parameter of interest. This tells BUGS to store the values sampled for those parameters; otherwise, BUGS automatically discards the simulated values.

There are two types of monitor in BUGS:

Samples monitors: Setting a samples monitor tells BUGS to store every value it simulates for that parameter. You will need to set a samples monitor if you want to view trace plots of the samples to check convergence (see Checking convergence ) or if you want to obtain 'exact' posterior quantiles, for example, the posterior 95% Bayesian credible interval for that parameter. (Note that you can also obtain approximate 2.5%, 50% and 97.5% quantiles of the posterior distribution for each parameter using summary monitors.)

To set a samples monitor:

* Select Samples... from the Inference menu;
* Type the name of the parameter to be monitored in the white box marked node in the Sample Monitor Tool window
* Click once on the button marked set
* Repeat for each parameter to be monitored.

Summary monitors: Setting a summary monitor tells BUGS to store the running mean and standard deviation for the parameter, plus approximate running quantiles (2.5%, 50% and 97.5%). The values saved contain less information than saving each individual sample in the simulation, but require much less storage. This is an important consideration when running long simulations (e.g. 1000's of iterations) and storing values for many variables.

We recommend setting summary monitors on long vectors of parameters such as random effects in order to store posterior summaries, and then also setting full samples monitors on a small subset of the random effects, plus other relevant parameters (e.g. means and variances), to check convergence.

To set a summary monitor:

* Select Summary... from the Inference menu;
* Type the name of the parameter to be monitored in the white box marked node in the Summary Monitor Tool window
* Click once on the button marked set
* Repeat for each parameter to be monitored.

Note: you should not set a summary monitor until you are happy that convergence has been reached (see Checking convergence), since it is not possible to discard any of the pre-convergence ('burn-in') values from the summary once it is set, other than to clear the monitor and re-set it.

Checking convergence

Checking convergence requires considerable care. It is very difficult to say conclusively that a chain (simulation) has converged, only to diagnose when it definitely hasn't!

The following are practical guidelines for assessing convergence:

* For models with many parameters, it is impractical to check convergence for every parameter, so just choose a selection of relevant parameters to monitor. For example, rather than checking convergence for every element of a vector of random effects, just choose a random subset (say, the first 5 or 10).
* Examine trace plots of the sample values versus iteration to look for evidence of when the simulation appears to have stabilised:

To obtain 'live' trace plots for a parameter:

- Select Samples... from the Inference menu.
- Type the name of the parameter in the white box marked node in the Sample Monitor Tool window
- Click once on the button marked trace: an empty graphics window will appear on screen.
- Repeat for each parameter of interest.
- Once you start running the simulations (using the Update... tool from the Model menu), trace plots for these parameters will appear 'live' in the graphics windows.

To obtain a trace plot showing the full history of the samples for any parameter for which you have previously set a samples monitor and carried out some updates:

- Select Samples... from the Inference menu.
- Type the name of the parameter in the white box marked node (or select the name from the pull down list of currently monitored nodes - click once on the downward-facing arrowhead to the immediate right of the node field) in the Sample Monitor Tool window.
- Click once on the button marked history: a graphics window showing the sample trace will appear.
- Repeat for each parameter of interest.

The following plots are examples of: (i) chains for which convergence (in the pragmatic sense) looks reasonable (top two plots); and (ii) chains which have clearly not reached convergence (bottom two plots).

[tutorial2]

If you are running more than one chain simultaneously, the trace and history plots will show each chain in a different colour. In this case, we can be reasonably confident that convergence has been achieved if all the chains appear to be overlapping one another.

The following plots are examples of: (i) multiple chains for which convergence looks reasonable (top); and (ii) multiple chains which have clearly not reached convergence (bottom).

[tutorial6]

For a more formal approach to convergence diagnosis the software also provides an implementation (see here) of the techniques described in Brooks & Gelman (1998), and a facility for outputting monitored samples in a format that is compatible with the CODA software - see here.

How many iterations after convergence

Once you are happy that convergence has been achieved, you will need to run the simulation for a further number of iterations to obtain samples that can be used for posterior inference. The more samples you save, the more accurate will be your posterior estimates.

One way to assess the accuracy of the posterior estimates is by calculating the Monte Carlo error for each parameter. This is an estimate of the difference between the mean of the sampled values (which we are using as our estimate of the posterior mean for each parameter) and the true posterior mean.

As a rule of thumb, the simulation should be run until the Monte Carlo error for each parameter of interest is less than about 5% of the sample standard deviation. The Monte Carlo error (MC error) and sample standard deviation (SD) are reported in the summary statistics table (see the next section).

Obtaining summaries of the posterior distribution

The posterior samples may be summarised either graphically, e.g. by kernel density plots, or numerically, by calculating summary statistics such as the mean, variance and quantiles of the sample.

To obtain summaries of the monitored samples, to be used for posterior inference:

* Select Samples... from the Inference menu.
* Type the name of the parameter in the white box marked node (or select the name from the pull down list, or type "*" (star) to select all monitored parameters) in the Sample Monitor Tool window . * Type the iteration number that you want to start your summary from in the white box marked beg: this allows the pre-convergence 'burn-in' samples to be discarded.
* Click once on the button marked stats: a table reporting various summary statistics based on the sampled values of the selected parameter will appear.
* Click once on the button marked density: a window showing kernel density plots based on the sampled values of the selected parameter will appear.

Please see the manual entry Samples... for a detailed description of the Sample Monitor Tool window (i.e. that which appears when Samples... is chosen from the Inference menu).

Contents