Markov Chain Monte Carlo Approaches to Analysis of Genetic and Environmental Components of Human Developmental Change and G E Interaction

Size: px
Start display at page:

Download "Markov Chain Monte Carlo Approaches to Analysis of Genetic and Environmental Components of Human Developmental Change and G E Interaction"

Transcription

1 Behavior Genetics, Vol. 33, No. 3, May 2003 ( 2003) Markov Chain Monte Carlo Approaches to Analysis of Genetic and Environmental Components of Human Developmental Change and G E Interaction Lindon Eaves 1,3 and Alaattin Erkanli 2 Received 12 Sept Final 10 Oct The linear structural model has provided the statistical backbone of the analysis of twin and family data for 25 years. A new generation of questions cannot easily be forced into the framework of current approaches to modeling and data analysis because they involve nonlinear processes. Maximizing the likelihood with respect to parameters of such nonlinear models is often cumbersome and does not yield easily to current numerical methods. The application of Markov Chain Monte Carlo (MCMC) methods to modeling the nonlinear effects of genes and environment in MZ and DZ twins is outlined. Nonlinear developmental change and genotype environment interaction in the presence of genotype-environment correlation are explored in simulated twin data. The MCMC method recovers the simulated parameters and provides estimates of error and latent (missing) trait values. Possible limitations of MCMC methods are discussed. Further studies are necessary explore the value of an approach that could extend the horizons of research in developmental genetic epidemiology. KEY WORDS: Growth curves; Bayesian inference; Gibbs sampling; Markov Chain Monte Carlo methods; twins; longitudinal studies; G E interaction, hierarchical mixed models. INTRODUCTION From relatively modest and controversial beginnings (Jinks and Fulker, 1970), the last quarter-century has seen the emergence of linear structural modeling as little short of an industry in behavioral genetics and genetic epidemiology to the point where it has supplanted almost all other approaches to the analysis of family resemblance. The reasons for its success are fairly clear. The approach is flexible, allowing for a very wide range of models to be specified for the effects of genes and environment. Models have been developed and tested for biological and cultural inheritance, various patterns 1 Virginia Institute for Psychiatric and Behavioral Genetics, Department of Human Genetics, Virginia Commonwealth University. 2 Department of Biostatistics and Bioinformatics, Duke University Medical Center. 3 To whom correspondence should be addressed at Virginia Institute for Psychiatric and Behavioral Genetics, PO Box , Virginia Commonwealth University, Richmond, Virginia of assortative mating, and developmental change in longitudinal data. One major factor accounting for its appeal has been the ready extension of models for family resemblance to incorporate multivariate measures (Martin and Eaves, 1977). On the assumption of underlying multivariate normality (i.e., probit regression), the approach can be further extended to encompass dichotomous (Fulker, 1973) and other categorical data (Eaves et al., 1978) in a rich variety of complex patterns (Neale and Kendler, 1995). The widespread adoption of structural modeling techniques has been further facilitated by the development and dissemination of remarkably flexible, wellsupported, robust, efficient and user-friendly software such as the Mx package (Neale et al., 1999). Over the last decade, structural modeling has provided the unifying platform for teaching the analysis of family resemblance at a widely-subscribed series of international workshops that are now credited with more than 500 alumni /03/ / Plenum Publishing Corporation

2 280 Eaves and Erkanli The structural modeling approach is typically based in maximizing the likelihood of data with respect to parameters of one or more theoretical models. Thus, the approach yields tests of goodness of fit and standard errors of parameter estimates, minimizing, though not removing, one element of subjectivity in deciding between alternative hypotheses. The transformation of many areas of statistical genetic research accomplished by these approaches over the last three decades has been remarkable and due in no small measure to the parallel development of computer hardware and software for numerical analysis that has rendered as matters of routine analyses that were scarcely conceivable 30 years ago. In few areas has the impact of these developments been greater than that of behavioral and psychiatric genetics. When a method works well and is being productive, there is a danger that limitations may be ignored and significant scientific questions be deferred because there are many others that can be answered with the existing methods. Nonlinear models incorporating random genetic and environmental effects comprise one significant arena in which there are serious scientific questions vying for the attention of behavioral geneticists. Examples of issues requiring such models are the analysis of non-linear developmental change (e.g., genetic differences in growth patterns) and non-additive effects (e.g., epistasis and G E interaction). Such processes cannot readily be forced into the Procrustean bed of the linear structural models that have, historically, proved so productive. Although it is a relatively easy matter to write the likelihood functions associated with such nonlinear models (see, e.g., Eaves et al., 1986), obtaining parameter estimates has been a laborious process because the likelihood involves integration over the unknown random effects in the model. For example, in a (multivariate) nonlinear model, such as a survival model or growth curve model, the likelihood of a twin pair depends on the values of the random genetic and environmental effects of the individual twins for all the parameters of the growth curve model (e.g., initial value, asymptote, slope, inflection point). The likelihood of the pair requires that the value of the likelihood of the phenotype, given particular (unknown) genetic and environmental effects be integrated over all values of the latent genetic and environmental variables. This integration has as many dimensions as there are latent genetic and environmental affects in the model. Thus, a four-parameter growth curve model in which the growth parameters of twins are each influenced by separate genetic and environmental factors requires integration over dimensions. Even though this is numerically tractable in simple cases through methods such as Gaussian quadrature, it rapidly becomes tedious in practically important contexts, such as the analysis nonlinear growth curves in twins, because of the relatively large number of dimensions and the lack of robust, accurate and relatively general adaptive procedures for numerical integration. The problem of multidimensional integration is encountered in the context of hierarchial mixed models in which there are random individual differences in parameters of a nonlinear model for a response variable (e.g., Lindstrom and Bates, 1990). Even in those cases where estimates can be obtained numerically, the additional computation required to obtain estimates of confidence intervals is prohibitive. In this paper, we introduce a fairly general Markov Chain Monte Carlo (MCMC) framework of the Bayesian methodology (see, e.g., Gilks et al., 1996) to the analysis of twin data that is free of many of the limitations inherent in some of the current widely used methods. In addition to providing numerical estimates of the parameters of complex models for twin resemblance, the MCMC approach provides, at little extra computational cost, information that is more difficult to obtain through the classic likelihood-based approaches, such as the joint posterior probability distribution of a latent trait for a twin pair. The use of noninformative prior distributions provide inferences that are comparable with the classic MLE approaches, and additionally, if there is prior information available, it too can be incorporated into the existing models through the use of informative prior distributions; such information cannot be utilized with MLE estimation. At one level, we may think of MCMC as an approach to parameter estimation that uses a carefully constructed sequence of Monte Carlo simulations to construct the integrals that are so elusive in ML approaches. However, the way the sequence of simulations is constructed provides a wealth of information and possible insight that is not available through the usual approaches to numerical integration. Although MCMC has been used quite widely in the analysis of linkage and pedigree data (see, e.g., Thomas and Gauderman, 1996), it has not been applied much outside that arena. Part of the reason for this is almost certainly the convenience of linear structural modeling and the conceptual impact of its applications in a wide range of contexts over the last quarter century. Do et al. s (2000) application of

3 Markov Chain Monte Carlo 281 MCMC to survival analysis of twin data is a notable exception that also reflects the relative intractability of nonlinear genetic models to the usual likelihood-based approaches. MARKOV CHAIN MONTE CARLO METHODS Since their inception in physics in early 1950s (Metropolis et al., 1953; Hastings, 1970), MCMC methods have been used in a variety of areas requiring complex statistical modeling such as image analysis (Besag and Green, 1993; Gelfand and Smith, 1990; Geman and Geman, 1984) and generalized linear mixed models (Clayton, 1996; Zeger and Karim, 1991). Although the approach has been implemented in some genetic contexts (see, e.g., Shoemaker et al., 1999, and citations in Thomas and Gauderman, 1996), there have been relatively few published examples of its application to twin data (e.g., Burton et al., 1999; Do et al., 2000). The latter application was developed appropriately in the context of survival analysis that has hitherto proved numerically cumbersome (Meyer and Eaves, 1988; Meyer et al., 1991) for the same reasons that compelled us to explore the application of MCMC methods to the problems described here. Apart from apparently solving some practical problems, MCMC methods commend themselves intellectually because they provide a unifying framework within which many complex problems can be analyzed using generic software (Gilks et al., 1996, p. 1). Indeed, the feasibility of applying MCMC methods to behavior-genetic applications is enhanced enormously by the free dissemination of a windows version (WinBUGS 1.3) of the program BUGS ( Bayesian Inference Using Gibbs Sampling ; Spiegelhalter et al., 2000) that we used in all the applications described here. Superficially, we may regard MCMC methods as one further way of obtaining estimates of parameters of a genetic model when the usual approaches of maximum-likelihood are too tedious. Thus, MCMC methods use Monte Carlo methods to approximate the integrals that have proved tiresome in likelihood-based approaches, such as obtaining the normalizing constants in Bayesian analyses, and marginal likelihood functions in the generalized linear mixed-effects models. However, from another perspective, MCMC s foundation in a Bayesian approach to modeling raises more profound questions about the theoretical framework within which the task of modeling is conceived (see, e.g., Gelman et al., 1995, Sivia, 1996). In the classic ML framework, we seek the values of model parameters that maximize the likelihood of the data, given certain assumptions about the distribution of the data and the underlying parameters. Within the Bayesian framework, we seek the joint posterior distribution of the unknown parameters given the data under, it is hoped, appropriate assumptions about the prior distribution of the parameters and data. Whether or not we choose explicitly to adopt a Bayesian paradigm for genetic modeling, it turns out that some of the algorithms developed within the Bayesian context have advantages for exploring a range of models that appear less tractable within the conventional context of maximum likelihood. Briefly, the MCMC approach constructs a Markov Chain on the (parameter) space of unknown quantities such that, starting with a series of trial values (e.g. means, regressions, genetic variances, genetic and environmental effects etc.), after an initial series of iterations (the burn-in ) successive iterations represent samples from the unknown joint distribution. This is the so-called stationary distribution of the Markov Chain, and in the Bayesian context, it is the joint posterior distribution of all the parameters. We note that in this context, the parameters do not just include the usual parameters of the structural model (means, genetic and environmental variances, etc.) but also the latent genetic and environmental deviations of the individual twins and any missing values. The MCMC iterations are furnished by simulating values for the unknown parameters, conditional upon the given data, using specially catered transition probability kernels (also called proposal distributions) that are not only easy to simulate from, but also guarantee the convergence (in distribution) of the simulated Markov Chain to the true joint posterior distribution. After a burn-in period the probability distribution, and its moments like the expected value of any function, of the unknown quantity is obtained to any desired degree of precision by taking an (ergodic) average of the successive values over a sufficiently large number of iterations. The Gibbs sampler (Creutz, 1979; Gelfand and Smith, 1990; Geman and Geman, 1984; Ripley, 1979) is perhaps the most popular MCMC approach to construct a Markov chain with the desired properties. In the Gibbs sampling approach, the Markov kernels consist of the conditional distributions of each variables of interest given all the other variables. As an example consider random variables X and Y having an unknown joint distribution [X, Y], and assume further that each of the conditional distributions [XƒY] and [YƒX] are available in analytically closed form. Here the Markov kernels are the conditionals [XƒY] and [YƒX]. So, if X 0

4 282 Eaves and Erkanli and Y 0 are initial values, then the Markov Chain is constructed on the XY space by simulating successively a sequence of {X r, Y r from the known conditionals [YƒX r-1 ] and [XƒY r-1 ] for r 1,2,..., R. It can be shown that the joint distribution of the sequence {X r, Y r converges to the joint distribution [X, Y] as R tends toward infinity, as long as these conditionals are bonafide probabilitydistributions. Thus, for a sufficiently large R, the {X r, Y r resemble draws from the true joint distribution [X, Y]. Within the Bayesian context, X and Y are usually unknown parameters (e.g., the mean and variance), and [XƒY] and [YƒX] (suppressing the conditioning on the data) are the conditional posterior distributions, and [X, Y] is the joint posterior distribution of X and Y, respectively. For example, the marginal posterior distribution [X] of X can be approximated by the Monte Carlo integration, for each X x, [x] 1/R r [x ƒ Y r ], where the summation is over r 1,2,..., R. Similarly, the expectation of a function g (X) is approximated by the Monte Carlo average Eg(X) 1/R r g(x r ). Note that a desired byproduct of MCMC approach is that not only the expectations, but the entire posterior distribution of g(x) is approximated by using the sequence {g(x r ). There are also several other ways, such as general Metropolis-Hasting algorithms, to construct an MCMC sampler; in fact the Gibbs sampling is a special case of these general algorithms. A more thorough account of the approach, which is beyond the scope of this paper, may be found in Tierney (1994), Gilks et al. (1996), and Brooks (1998). A recent paper by Besag (2000) reviews several MCMC approaches and provides a comprehensive list of references. Example 1: The Classic Multivariate Genetic Model for MZ and DZ Twins It is convenient to introduce the approach with an example familiar to those with practice in linear structural models, that is the estimation of the additive genetic and within-family environmental covariance matrices, from data on a pair of variables measured on samples of 100 MZ and 100 DZ twins. The data were simulated using SAS on the assumption of multivariate normality. Table I summarizes the population parameter values used to generate the simulated data and the observed mean vectors and 4 4 covariance matrices for the MZ and DZ twins. The table also shows the maximum- Table I. Population Parameter Values Used in Simulation of Bivariate Twin Data and Values Realized Using Mx for ML Estimation (n 100 MZ and 100 DZ Pairs) Parameter ML estimate Population value mu[1] mu[2] sigma2.g[1,1] sigma2.g[1,2] sigma2.g[2,2] sigma2.e[1,1] sigma2.e[1,2] sigma2.e[2,2] likelihood estimates of the MZ and DZ means and the additive genetic and within-family environmental covariance matrices obtained by fitting the structural model to the raw data vectors in Mx (Neale et al., 1999). The same data were analyzed by MCMC using WinBUGS to implement the Gibbs sampler. The first step is to construct a directed graph (Fig. 1) that expresses the logical and stochastic process by which the twin data are generated. WinBUGS provides a graphical user interface (GUI) that allows most of the elements of the model to be specified graphically as doodles. Subsequently the graphical model may be automatically translated into a script with syntax similar to S-Plus, and compiled with object data prior to initialization with trial values and execution. The automated code may be modified manually to carry out side-computations that cannot be implemented directly in the GUI, such as the computation of standardized estimates. The underlying model follows that of Jinks and Fulker (1970) in recognizing that each twin of a DZ pair is realized, in the absence of shared environmental effects, by sampling three separate (normal) variables: a between families genetic component, g 2 ; a within families genetic effect, g 1, and a within families environmental effect, e 1. The expected variances of the between and within family genetic effects may be parameterized in terms of additive and non-additive genetic effects. If genetic effects are additive and mating is random, the variances of g 2 and g 1 are equal. The process that generates MZ twins is similar, but each twin of a pair share the same within and between-family genetic effects (so the genetic effects of MZs are identical). Each MZ twin still receives its own unique within-family environmental effect, e 1. The logic of the genetic model follows closely the underlying process by which genetic and environmental effects arise. The between family genetic effects

5 Markov Chain Monte Carlo 283 Fig. 1. Graphical model for random additive genetic and within-family environmental effects on multivariate MZ and DZ twin data. reflect the average genetic differences between parents and the within-family effects those of genetic segregation. The environment within families is thus a further process that differentiates between individuals of known genetic constitution. This natural logic is mirrored in the way in which BUGS models are written for twin data (Fig. 1). Figure 1 represents many, but not all, of the features of a BUGS model. A fuller description and examples are provided in the manuals accompanying the downloaded WinBUGS software. (Spiegelhalter et al., 2000). Constants are represented by rectangles and variables by ellipses. Stochastic dependence is represented by a single one-headed arrow. The distribution of stochastic nodes is not shown in the figure, but is selected from a menu of options available when building the doodle interactively. The rectangles ( plates ) represent the domain over which subscripts vary in subscripted variables and translate into For loops in the code derived from the doodle. The graphical model is in two parts. The left half of the diagram represents the process by which data on DZ pairs are generated. The element ydz[l,j,k] contains the observed value of the jth twin of the ith pair on the kth variable. The model allows generally for N variables. Similarly, the corresponding typical datum for an MZ individual is stored in ymz[l,j,k] represented on the right of the diagram. Elements such as ydz[l,j,k] are termed nodes and are represented in ellipses in the BUGS GUI. The observed values of the twins depend stochastically on prior nodes whose interrelationships are specified in the graph. Starting at the top, we have the population mean vector, mu, it is assumed initially that these means have a relatively uninformative prior distribution and are sampled from a multivariate normal distribution with mean vector mean (0,0) and precision matrix (precis). The precision matrix is equal to the inverse of the prior 2 2 covariance matrix of the population means. These so-called meta-parameters are typically chosen, in our application, to reflect our relative lack of prior information about the population parameters. This amounts to making the diagonal elements of precis fairly small (we used ). These prior values are supplied as data to BUGS. Notice that with non-informative priors, the Bayesian inference usually produces results that are comparable to those obtained using MLE since the joint posterior distribution is highly dominated by the likelihood function relative to the prior distribution. The next layer of the process in DZ twins, which corresponds to the way twin pairs are produced, comprises generation of the means of the ndz twin pairs. The node g2dz[i,k] denotes the expected value of the

6 284 Eaves and Erkanli mean of the jth pair on the kth variable. The vector of between family effects is generated from the multivariate normal distribution with mean vector mu and precision matrix tau.g. The matrix tau.g is the inverse of half the additive genetic covariance matrix. Our initial model assumes that this matrix is of general positive definite form. It is possible to devise models that reflect other hypotheses about the genetic covariance structure. In the GUI notation of WinBUGS, a singleheaded arrow represents stochastic dependence. This should not be confused with the path coefficient familiar in LISREL formulations. For simplicity we ignore the effects of the shared environment in this early example, but there is no difficulty adding an environmental component to the differences between families. Our choice of g2 to denote the between-pair genetic effects emphasizes the parallel between the development of the MCMC model for twin data and the early formulation of the variance components model for twin data by Jinks and Fulker (1970). The fact that each ith pair of the set of ndz DZ pairs has its own genetic deviation is represented by the large rectangle ( plate ) encompassing the g2dz[i,1:n]. The plate corresponds to a for loop in many programming languages. The expected values of the individual DZ twins, g1dz[i,j,k] are obtained by adding bivariate normal within-pair genetic deviations to the expected values of the corresponding pair. The genetic deviations are again sampled from a multivariate normal distribution with covariance matrix equal to half the additive genetic variance. Thus, in our GUI, the individual expected values of the DZ twins depend stochastically on the g2dz[i,k] and the genetic precision matrix, tau.g. The fact that the single headed arrow ( edge ) links both g2dz and g1dz to tau.g automatically imposes the constraint that the genetic variances are equal within and between DZ (sib) pairs when gene action is additive. Finally, the individual twin observations (ydzil,j,k]) are represented as samples from the multivariate normal distribution with expected values equal to the genetic effects, g1dz, and precision equal to the inverse, tau.e, of the within-family environmental covariance matrix. The fact that the individual twins are nested within pairs is represented in the graph by the second plate nested within the outer plate. The inner plate contains the expected individual values, g1dz, and the observations, ydz, but excludes the g2dz that are only captured within the domain of the outer plate. The graph is completed by representing, on the right-hand side of Figure 1, the process that generates pairs of MZ twins. The MZ part of the graph differs only in the fact that both the between-family ( g 2 ) and within-family ( g 1 ) genetic deviations contribute to expected values of the twin pair in MZ twins, because they have identical alleles segregating from their parental genotypes. The individual MZ twin observations are assumed to be multivariate normal with expected values equal to the pair means (g1mz[i,k]) and dispersion equal to the within family environmental covariance matrix. The fact that both between and within-family genetic deviations contribute to differences between MZ pairs is indicated by excluding the g1mz effects from the inner plate in the case of MZ pairs (Fig. 1). Note that the nodes on the MZ part of the diagram depend stochastically (have arrows coming from) the same precision nodes as those for DZ twins. By putting MZ and DZ twins on the same diagram in this way, the implied equalities of the components of covariance in MZ and DZ twins are automatically imposed. The BUGS code generated automatically from the graph was amended to invert the genetic and environmental precision matrices to yield the additive genetic and within-family environmental covariance matrices. The code is supplied in (Appendix 1). Electronic copies of this and other doodles, code and simulated data used in this paper can be obtained from the first author. The MCMC simulation was started with trial values of (15,15for the mean vector, the identity matrix, I, for the genetic precision matrix (tau.g) and 6I for the environmental precision matrix (tau.e). These values were selected to be of the right order but sufficiently far from the ML estimates as to pose a realistic challenge to the MCMC algorithm. The MCMC algorithm converged very rapidly, but a 2000 iteration burnin preceded a further 5000 iterations that were sampled to characterize the stationary distribution. With the relatively small data set and simple model the CPU time was very short on a lap-top computer, though significantly slower than Mx applied to the raw observations. WinBUGS offers a rich range of tools for monitoring the progress of the algorithm, including active traces of the iterations of any desired nodes, histories of selected bands of iterations, kernel density plots and a variety of summary statistics. Table II gives the statistical summaries of the first 5000 MCMC iterations after a 2000 iteration burn-in. The first part of the table corresponds to Table I and gives the MCMC values of the parameters that were also obtained by ML using the conventional structural

7 Markov Chain Monte Carlo 285 Table II. Summary Statistics for 5000 MCMC Iterations of Bivariate AE Model after 2000 Iteration Burn in Node Mean SD MC error 2.5% Median 97.5% deviance mu[1] mu[2] sigma2.g[1,1] sigma2.g[1,2] sigma2.g[2,2] sigma2.e[1,1] sigma2.e[1,2] E sigma2.e[2,2] g1mz[1,1] g1mz[1,2] g1mz[2,1] g1mz[2,2] g1mz[3,1] g1mz[3,2] g1mz[4,1] g1mz[4,2] g1mz[5,1] g1mz[5,2] g1dz[1,1,1] g1dz[1,1,2] g1dz[1,2,1] g1dz[1,2,2] g1dz[2,1,1] g1dz[2,1,2] g1dz[2,2,1] g1dz[2,2,2] g1dz[3,1,1] g1dz[3,1,2] g1dz[3,2,1] g1dz[3,2,2] g1dz[4,1,1] g1dz[4,1,2] g1dz[4,2,1] g1dz[4,2,2] g1dz[5,1,1] g1dz[5,1,2] g1dz[5,2,1] g1dz[5,2,2] Note: The nodes are defined as follows: deviance minus twice the log-likelihood; mu[1],mu[2] means of first and second variable; sigma2.g elements of 2 2 genetic covariance matrix; sigma2.e elements of 2 2 environmental covariance matrix; g1mz[i,k] genetic effect on kth variable in ith MZ twin pair ( genetic score ); g1dz[i,j,k] genetic effect on kth variable in jth twin of ith DZ pair. modeling approach. Note that the agreement between the ML estimates and those obtained by averaging the sequence of Monte Carlo simulations is very close. Table II however, gives a number of other statistics that are not readily available from the ML algorithm. The MCMC algorithm gives the median parameter values and the upper and lower 2.5% confidence intervals. Mx experienced difficulty in obtaining these confidence intervals for this small sample. The standard deviations of the parameters are also obtained directly from the MCMC algorithm. These may be time-consuming to calculate numerically within the ML algorithm since they require numerical computation of the Hessian matrix that usually requires a series of additional steps after the ML estimates have been obtained.

8 286 Eaves and Erkanli The flexibility of the MCMC method, however, is illustrated by the fact that it yields almost as a byproduct, estimates of the individual genetic deviations of the MZ and DZ twins, together with a variety of statistics for evaluating their precision. The fact that MCMC is a carefully constructed simulation algorithm means that, at each iteration, the latent genetic and environmental effects of individual twins are simulated conditional on their phenotypes and the parameters of the genetic model. These latent scores and, indeed, any missing values are thus treated like every other parameter in the MCMC model and, after the algorithm has converged, can be sampled and summarized to provide mean values and estimates of error for the genetic and environmental deviations of individual twins. In multivariate genetic analyses, MCMC can be expected to evaluate genetic or environmental factor scores (see, e.g., Molenaar et al., 1990) and their confidence intervals at the same time as fitting the genetic factor model at very little extra computational cost. We regard the availability of estimates of individual scores and missing values as a significant bonus of the Bayesian approach. These estimates are given for the first 10 MZ and DZ pairs in Table II. We note in passing that, although we did not simulate missing data in this basic example, in principle the MCMC method also can also routinely simulate missing values at no additional computational cost. However, in the current version of BUGS, the simulation of multivariate normal deviates cannot accommodate missing values. The method thus appears to accomplish in a relatively straightforward manner much of what can be achieved by other tailor-made algorithms. Example 2: Nonlinear Growth Curve Models for Twin Data The graph in Figure 1 is modified simply to take into account a non-linear developmental model in which there is random genetic and environmental variation in the parameters of a non-linear growth curve (Fig. 2). Introducing a third plate, nested within individual twins, to reflect the repeated measures of the outcome variable extends the model. In the figure, the response of the jth twin of the ith DZ pair on the kth occasion is denoted by rdz[l,j,k]. The responses for MZ twins are denoted by the rmz[l,j,k]. The model assumes that the responses on the kth occasion have expected values ydz[l,j,k] and ymz[l,j,k]. The residual variances are assumed, for simplicity, to be constant and independent over occasions (i.e. any correlation across time is explained by the underlying developmental model). The residual variance is the inverse of the precision tau.res that is assumed to be the same for MZ and DZ twins. The expected values of the MZ and DZ responses are assumed to be logical functions of underlying random latent variables whose (multivariate) genetic and environmental structure can be captured by the model in Figure 1. In this case we assume that the responses are measured at ten equally spaced intervals (k ), and that the temporal change within individuals follows a four-parameter logistic function with random genetic and environmental variation between individuals in each of the four parameters. Thus we write (for DZ twins) rdz[l,j,k] y[l,j,1] y[l,j,2]/ {1 exp( y[l,j,4] (k y[l,j,3])). The individual parameters thus correspond to initial value, y[l,j,1], range from initial value to asymptotic value, y[l,j,2], time of maximum rate of change, y[l,j,3] and rate of change, y[l,j,4]. Similar parameters are defined for MZ pairs. Other functional forms can be incorporated by relatively minor changes in the code given in Appendix 2. Data were simulated using SAS for 500 pairs each of MZ and DZ twins on the assumption that the four growth curve parameters each had independent additive genetic (A) and within-family environmental (E) components. The heritabilities of the four components were all assumed to be 0.8. Population parameter values were assumed to be N[1, 1], ranges were N[10, 1], times of maximum growth were N[5.5, 1], and rates of change were N[1,0.1]. The residual within-occasion error was assumed to be 0.5. The genetic and environmental correlations between the random growth-curve parameters were all assumed to be zero in the original data simulation, but the genetic and environmental covariances were allowed to be free parameters in the Bayesian analysis. Noninformative multivariate normal priors were assumed for the mean response vectors and Wishart priors (omega.g and omega.e respectively) were assumed for the precisions of the genetic and environmental covariance matrices. The 10,000 MCMC iterations took approximately 45 minutes on a laptop computer. The statistics for the principal parameters of the model sampled over the last 2000 MCMC iterations are summarized in Table III. The means of the growth curve parameters and error variances are reproduced quite precisely. The heritabilities of the four random components (based on the median values of the genetic and environmental components) are: initial value 0.932; range 0.825; time of maximum change 0.762; and rate of change The apparent

9 Markov Chain Monte Carlo 287 Fig. 2. Graphical model for random additive genetic and environmental effects in four-paramater logistic growth curve model for MZ and DZ twin data. upward bias in the estimated heritability of the initial value is not resolved, but cannot be explained simply by the correlations between twins observed in the simulated initial values since these were (MZ) and (DZ) respectively, close to their expected values. The MCMC estimates of the genetic and environmental covariances are all close to the zero values assumed in the simulation. As in the case of the basic multivariate model (Example 1), the MCMC analysis automatically estimates the genetic effects contributing to growth curve parameters of the individual twins but illustrative values are not presented here because of space limitations. These individual effects, for example, can be further analyzed to identify interesting sub groups or populations of twins that are hidden in the observed data, which would be extremely difficult to do in a standard MLE approach. Example 3: Genotype Environment Interaction and Correlation Our third example addresses a challenging problem that has been frustratingly insoluble within the framework of structural modeling or conventional regression analysis, namely that of the genetic control of sensitivity to the environment (G E interaction). Available methods for the analysis of G E have depended on stratification of a sample of relatives by values of an environmental covariate (e.g., twins discordant for an environmental factor or siblings stratified by a putative environmental covariate) and testing for heterogeneity among genetic parameters within strata. This approach assumes that the environmental measures are fixed rather than random and independent of genetic effects on the outcome of interest. Environments are not fixed and their independence can seldom be guaranteed in practice when, for example, aspects of the family environment that interact with genetic liability are themselves correlated with genetic liability. Our preliminary studies suggest that the MCMC approach may provide a framework for removing this restriction and enhancing our capability of a more rigorous and flexible analysis of G E interaction. Clearly, there are many different ways in which nonadditive effects, including G E, may feature in a family study. We consider one example likely to be relevant to future analyses of developmental twin data. We assume that we have measured an outcome, symptoms of depression, for example, that is influenced by genetic and environmental factors that may interact. We further assume that we have measured a covariate (endophenotype) that is itself partly genetically deter-

10 288 Eaves and Erkanli Table III. Summary Statistics for Four-Parameter Logistic Model for 10-Occasion Longitudinal MZ and DZ Twin Data from 2000 MCMC Iterations after 8000 Iteration Burn-in Node Mean SD MC error 2.5% Median 97.5% mu[1] mu[2] mu[3] mu[4] E sigma2.g[1,1] sigma2.g[1,2] sigma2.g[1,3] sigma2.g[1,4] E sigma2.g[2,2] sigma2.g[2,3] sigma2.g[2,4] E sigma2.g[3,3] sigma2.g[3,4] E sigma2.g[4,4] E sigma2.e[1,1] E sigma2.e[1,2] E sigma2.e[1,3] E sigma2.e[1,4] E sigma2.e[2,2] E sigma2.e[2,3] E sigma2.e[2,4] E sigma2.e[3,3] sigma2.e[3,4] E sigma2.e[4,4] E sigma2.res E Notes: Subscripts on estimates of means (mu) and genetic and environmental covariances (sigma2.g and sigma2.e) refer to parameters of four-parameter logistic model as follows: 1 initial value (1.0); 2 total growth (final asymptotic value-initial value, 10.0); 3 age of maximum rate of change (5.5); 4 growth rate (1.0). True values are given in parentheses. Sigma2.res is the residual, within occasion, variance (0.5), assumed to be constant across occasions. Residual effects are assumed to be uncorrelated across occasions. mined and indexes the genetic sensitivity of the individual to a specific environmental factor. Finally, we assume that we have measured a variable that is hypothesized to be a covariate (partly environmental ) of the outcome. The regression of the outcome on the environmental covariate varies between subjects as a function of differential (possibly genetic) sensitivity of individuals to the measured environmental covariate. We could, for example, envision a system in which pre-pubertal anxiety was genetically correlated with post-pubertal depression, but was also an index of genetic sensitivity to the impact of life events. The problem may be further complicated by the fact that there may be genetic effects on exposure to life events and that these may correlate genetically with anxiety and or depression. Figure 3 represents the graph that specifies the model for this particular problem. As before, the model is based on the underlying multivariate linear model represented in Figure 1, with modifications to allow for the unique aspects of the GxE model. As before, the model comprises two similar components representing the relationships among the nodes for DZ (left side of the figure) and MZ twin individuals nested within pairs. In this case, we define three variables each having their own independent errors. For DZ twins, we define rdz[i,j] as the response variable for the jth twin in the ith DZ pair, the covariates sdz[i,j] and edz[i,j] represent the corresponding index of sensitivity to the environment and the measured environment respectively. The model for MZ manifest variables follows the same basic pattern, with modifications to allow for the fact that both within-family and betweenfamily genetic differences contribute to differences between MZ pairs (see Fig. 3, cf. Jinks and Fulker, 1970). The DZ responses have expected values erdz[i,j] g1dz[i,j,3] esdz[i,j]edz[i,j] and precision tau.r equal to the inverse of the withinfamily environmental variance in the responses sigma2.r. The product term represents the interaction

11 Markov Chain Monte Carlo 289 Fig. 3. Graphical model for random additive genetic effects and within-family environmental effects on sensitivity to partly heritable environmental effects (G E interaction in presence of genotype-environment correlation). between genes and the measured environment, edz[l,j], as the product of the measured environment and a random coefficient, esdz[i,j], that depends on the genotype of the individual such that esdz[l,j] g1dz[i,j,2]. The environmental measures, edz[i,j], have expected values eedz[i,j] g1dz[i,j,1] and precision tau.e equal to the inverse of the within-family environmental effect on the environmental measure. The sensitivities to the environment, esdz[i,j], are assumed to be indexed by the measured variable sdz[i,j] that have expected values esindz[i,j] beta[1] beta[2]esdz[l,j] and precision tau.s equal to the withinfamily environmental variance in the index of sensitivity to the environment, sigma2.s. The coefficients beta[1] and beta[2] are the intercept and slope respectively of the regression of the index of sensitivity on the latent genetic sensitivity to the environment. The structure of the three latent genetic variables, g1dz[l,j,k],k , follows that already described above for the basic multivariate linear structural model for MZ and DZ twin data, requiring specification of their mean vector, mu, and the inverse, tau.g, of half the additive genetic covariance matrix, sigma2.g. In its current form, the model assumes that there is no withinfamily environmental correlation between the three manifest variables except for that introduced between the response variable and the environmental measure by virtue of the regression of the former on the latter. Other parameterizations may be devised that enable this assumption to be relaxed. Appendix 3 reproduces the code generated by WinBUGS, with some minor modifications to allow direct inspection of genetic covariances and environmental variances. The population parameters used to generate the simulated data are provided in Table V. The three latent variables are all assumed to be independent with unit total variance. The assumption of independence is made in the simulation to simplify inspection of the simulated data points but not in the data analysis. Thus, although the data do not include G-E correlation in the first example, they do allow for it in the analysis through the estimation of the positive-definite genetic covariance matrix. If the analysis is correct, the covariances between the latent variables should all be zero. The simulations assumed that (additive) genetic effects contributed to random variation in sensitivity to the environment (h 2 0.4), the environmental index (h 2 0.4) and the outcome variable prior to GxE interaction (h 2 0.8). Although the mean sensitivity to the environment is zero,

12 290 Eaves and Erkanli there is considerable variation, some of it genetic, among individuals in sensitivity to the environment. In this case, some individuals will show a positive regression of phenotype on the environment and others a negative response. The latent variables are assumed to be normal in the current formulation. Table IV summarizes the statistics realized in a simulation of 1000 MZ and 1000 DZ pairs from these population parameters. We note that the simulated values for the measured environment, sensitivity to the environment and the outcome prior to GxE yield statistics quite close to those expected. The response variable, which includes the effects of GxE interaction, is significantly less correlated in MZ and DZ twins than the raw outcome variable in the absence of GxE. However, we note that GxE interaction adds about 40% to the variance of the raw outcome in the absence of GxE. Table V summarizes the results of the MCMC analysis of the simulated data when allowance is made in the model for random effects on sensitivity to the environment. We note that BUGS correctly recovers the latent genetic covariance structure of the influences on the outcome, the environment and sensitivity to the environment, and correctly characterizes the relationship between the index of sensitivity to the environment and the latent sensitivity to the environment. The estimated genetic contribution to the environment and the genetic contribution to sensitivity are about right and the genetic covariances between the measures are also about zero as assumed in the simulation. The analysis of GxE does not address directly the question of the impact of removing GxE in the model for data in which there is known to be substantial GxE. Each of the MCMC iteration in BUGS computes an approximate realization of the posterior distribution of deviance (minus twice the logarithm of the likelihood) that can be traced and summarized over iterations in exactly the same way as any other node in the model. Subsequently, different models can be compared based on their deviance distributions. Alternatively, one can use the DIC (Deviance Information Criterion) which is a penalized version of the posterior mean of the deviance function (Spiegelhalter et al., 1998). For the full GxE model, the average deviance over 5000 iterations (Table V) was The crucial element of our GxE model is the random component in sensitivity to the environment. We thus recast the model of Figure 3 to retain a fixed effect of the measured environment (Fig. 4), but removing the random effect. After a 2000 iteration burn-in we obtained the revised parameter estimates also given in Table V. The average deviance was , which is significantly higher than the value obtained under the full model indicating strong support for GxE in the simulated example. Table IV. Summary Statistics for Simulated Data on 1000 MZ and 1000 DZ Twins in Presence of G E and Genetic Effects on Environmental Index (MZ Correlations Are in the Upper Triangle and DZ Correlations in the Lower Triangle) Correlations V11 V21 V31 S1 Resp1 V12 V22 V32 S2 Resp2 V V V S Resp V V V S Resp MZ Mean MZ S.D DZ Mean DZ S.D Note: Variables are labeled as follows: V11 Measured environment on first twin; V21 Index of environmental sensitivity of first twin; V31 Phenotype of first twin in absence of G E (not used in analysis); S1 genetic sensitivity of first twin to the environment; Resp1 Phenotype of first twin. V12, V22, V32, S2 and Resp2 are the corresponding variables for the second twin.

Ordinal Data Modeling

Ordinal Data Modeling Valen E. Johnson James H. Albert Ordinal Data Modeling With 73 illustrations I ". Springer Contents Preface v 1 Review of Classical and Bayesian Inference 1 1.1 Learning about a binomial proportion 1 1.1.1

More information

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University

More information

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

Combining Risks from Several Tumors Using Markov Chain Monte Carlo

Combining Risks from Several Tumors Using Markov Chain Monte Carlo University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln U.S. Environmental Protection Agency Papers U.S. Environmental Protection Agency 2009 Combining Risks from Several Tumors

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

Bias in Correlations from Selected Samples of Relatives: The Effects of Soft Selection

Bias in Correlations from Selected Samples of Relatives: The Effects of Soft Selection Behavior Genetics, Vol. 19, No. 2, 1989 Bias in Correlations from Selected Samples of Relatives: The Effects of Soft Selection M. C. Neale, ~ L. J. Eaves, 1 K. S. Kendler, 2 and J. K. Hewitt 1 Received

More information

Individual Differences in Attention During Category Learning

Individual Differences in Attention During Category Learning Individual Differences in Attention During Category Learning Michael D. Lee (mdlee@uci.edu) Department of Cognitive Sciences, 35 Social Sciences Plaza A University of California, Irvine, CA 92697-5 USA

More information

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill) Advanced Bayesian Models for the Social Sciences Instructors: Week 1&2: Skyler J. Cranmer Department of Political Science University of North Carolina, Chapel Hill skyler@unc.edu Week 3&4: Daniel Stegmueller

More information

Advanced Bayesian Models for the Social Sciences

Advanced Bayesian Models for the Social Sciences Advanced Bayesian Models for the Social Sciences Jeff Harden Department of Political Science, University of Colorado Boulder jeffrey.harden@colorado.edu Daniel Stegmueller Department of Government, University

More information

Mediation Analysis With Principal Stratification

Mediation Analysis With Principal Stratification University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 3-30-009 Mediation Analysis With Principal Stratification Robert Gallop Dylan S. Small University of Pennsylvania

More information

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination Timothy N. Rubin (trubin@uci.edu) Michael D. Lee (mdlee@uci.edu) Charles F. Chubb (cchubb@uci.edu) Department of Cognitive

More information

Method Comparison for Interrater Reliability of an Image Processing Technique in Epilepsy Subjects

Method Comparison for Interrater Reliability of an Image Processing Technique in Epilepsy Subjects 22nd International Congress on Modelling and Simulation, Hobart, Tasmania, Australia, 3 to 8 December 2017 mssanz.org.au/modsim2017 Method Comparison for Interrater Reliability of an Image Processing Technique

More information

Resolving multiple epigenetic pathways to adolescent depression

Resolving multiple epigenetic pathways to adolescent depression Journal of Child Psychology and Psychiatry 44:7 (2003), pp 1006 1014 Resolving multiple epigenetic pathways to adolescent depression Lindon Eaves, 1 Judy Silberg, 1 and Alaattin Erkanli 2 1 Virginia Institute

More information

It is shown that maximum likelihood estimation of

It is shown that maximum likelihood estimation of The Use of Linear Mixed Models to Estimate Variance Components from Data on Twin Pairs by Maximum Likelihood Peter M. Visscher, Beben Benyamin, and Ian White Institute of Evolutionary Biology, School of

More information

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) *

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) * by J. RICHARD LANDIS** and GARY G. KOCH** 4 Methods proposed for nominal and ordinal data Many

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Introduction to Bayesian Analysis 1

Introduction to Bayesian Analysis 1 Biostats VHM 801/802 Courses Fall 2005, Atlantic Veterinary College, PEI Henrik Stryhn Introduction to Bayesian Analysis 1 Little known outside the statistical science, there exist two different approaches

More information

Bayesian Methods p.3/29. data and output management- by David Spiegalhalter. WinBUGS - result from MRC funded project with David

Bayesian Methods p.3/29. data and output management- by David Spiegalhalter. WinBUGS - result from MRC funded project with David Bayesian Methods - I Dr. David Lucy d.lucy@lancaster.ac.uk Lancaster University Bayesian Methods p.1/29 The package we shall be looking at is. The Bugs bit stands for Bayesian Inference Using Gibbs Sampling.

More information

Application of Bayesian Inference using Gibbs Sampling to Item-Response Theory Modeling of Multi-Symptom Genetic Data

Application of Bayesian Inference using Gibbs Sampling to Item-Response Theory Modeling of Multi-Symptom Genetic Data Behavior Genetics, Vol. 35, No. 6, November 2005 (Ó 2005) DOI: 10.1007/s10519-005-7284-z Application of Bayesian Inference using Gibbs Sampling to Item-Response Theory Modeling of Multi-Symptom Genetic

More information

Bayesian Estimation of a Meta-analysis model using Gibbs sampler

Bayesian Estimation of a Meta-analysis model using Gibbs sampler University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2012 Bayesian Estimation of

More information

Michael Hallquist, Thomas M. Olino, Paul A. Pilkonis University of Pittsburgh

Michael Hallquist, Thomas M. Olino, Paul A. Pilkonis University of Pittsburgh Comparing the evidence for categorical versus dimensional representations of psychiatric disorders in the presence of noisy observations: a Monte Carlo study of the Bayesian Information Criterion and Akaike

More information

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison Group-Level Diagnosis 1 N.B. Please do not cite or distribute. Multilevel IRT for group-level diagnosis Chanho Park Daniel M. Bolt University of Wisconsin-Madison Paper presented at the annual meeting

More information

Score Tests of Normality in Bivariate Probit Models

Score Tests of Normality in Bivariate Probit Models Score Tests of Normality in Bivariate Probit Models Anthony Murphy Nuffield College, Oxford OX1 1NF, UK Abstract: A relatively simple and convenient score test of normality in the bivariate probit model

More information

An Introduction to Bayesian Statistics

An Introduction to Bayesian Statistics An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics

More information

Univariate modeling. Sarah Medland

Univariate modeling. Sarah Medland Univariate modeling Sarah Medland Starting at the beginning Data preparation The algebra style used in Mx expects line per case/family (Almost) limitless number of families and variables Missing data Default

More information

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Dipak K. Dey, Ulysses Diva and Sudipto Banerjee Department of Statistics University of Connecticut, Storrs. March 16,

More information

THE INDIRECT EFFECT IN MULTIPLE MEDIATORS MODEL BY STRUCTURAL EQUATION MODELING ABSTRACT

THE INDIRECT EFFECT IN MULTIPLE MEDIATORS MODEL BY STRUCTURAL EQUATION MODELING ABSTRACT European Journal of Business, Economics and Accountancy Vol. 4, No. 3, 016 ISSN 056-6018 THE INDIRECT EFFECT IN MULTIPLE MEDIATORS MODEL BY STRUCTURAL EQUATION MODELING Li-Ju Chen Department of Business

More information

Bayesian Joint Modelling of Longitudinal and Survival Data of HIV/AIDS Patients: A Case Study at Bale Robe General Hospital, Ethiopia

Bayesian Joint Modelling of Longitudinal and Survival Data of HIV/AIDS Patients: A Case Study at Bale Robe General Hospital, Ethiopia American Journal of Theoretical and Applied Statistics 2017; 6(4): 182-190 http://www.sciencepublishinggroup.com/j/ajtas doi: 10.11648/j.ajtas.20170604.13 ISSN: 2326-8999 (Print); ISSN: 2326-9006 (Online)

More information

Studying the effect of change on change : a different viewpoint

Studying the effect of change on change : a different viewpoint Studying the effect of change on change : a different viewpoint Eyal Shahar Professor, Division of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona

More information

Methods Research Report. An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy

Methods Research Report. An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy Methods Research Report An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy Methods Research Report An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy

More information

For general queries, contact

For general queries, contact Much of the work in Bayesian econometrics has focused on showing the value of Bayesian methods for parametric models (see, for example, Geweke (2005), Koop (2003), Li and Tobias (2011), and Rossi, Allenby,

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Chapter 2 Interactions Between Socioeconomic Status and Components of Variation in Cognitive Ability

Chapter 2 Interactions Between Socioeconomic Status and Components of Variation in Cognitive Ability Chapter 2 Interactions Between Socioeconomic Status and Components of Variation in Cognitive Ability Eric Turkheimer and Erin E. Horn In 3, our lab published a paper demonstrating that the heritability

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models

Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models Brady T. West Michigan Program in Survey Methodology, Institute for Social Research, 46 Thompson

More information

An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy

An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy Number XX An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy Prepared for: Agency for Healthcare Research and Quality U.S. Department of Health and Human Services 54 Gaither

More information

We expand our previous deterministic power

We expand our previous deterministic power Power of the Classical Twin Design Revisited: II Detection of Common Environmental Variance Peter M. Visscher, 1 Scott Gordon, 1 and Michael C. Neale 1 Genetic Epidemiology, Queensland Institute of Medical

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods Introduction Patrick Breheny January 10 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/25 Introductory example: Jane s twins Suppose you have a friend named Jane who is pregnant with twins

More information

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data Missing data Patrick Breheny April 3 Patrick Breheny BST 71: Bayesian Modeling in Biostatistics 1/39 Our final topic for the semester is missing data Missing data is very common in practice, and can occur

More information

MISSING DATA AND PARAMETERS ESTIMATES IN MULTIDIMENSIONAL ITEM RESPONSE MODELS. Federico Andreis, Pier Alda Ferrari *

MISSING DATA AND PARAMETERS ESTIMATES IN MULTIDIMENSIONAL ITEM RESPONSE MODELS. Federico Andreis, Pier Alda Ferrari * Electronic Journal of Applied Statistical Analysis EJASA (2012), Electron. J. App. Stat. Anal., Vol. 5, Issue 3, 431 437 e-issn 2070-5948, DOI 10.1285/i20705948v5n3p431 2012 Università del Salento http://siba-ese.unile.it/index.php/ejasa/index

More information

Behavioral genetics: The study of differences

Behavioral genetics: The study of differences University of Lethbridge Research Repository OPUS Faculty Research and Publications http://opus.uleth.ca Lalumière, Martin 2005 Behavioral genetics: The study of differences Lalumière, Martin L. Department

More information

Liability Threshold Models

Liability Threshold Models Liability Threshold Models Frühling Rijsdijk & Kate Morley Twin Workshop, Boulder Tuesday March 4 th 2008 Aims Introduce model fitting to categorical data Define liability and describe assumptions of the

More information

Impact and adjustment of selection bias. in the assessment of measurement equivalence

Impact and adjustment of selection bias. in the assessment of measurement equivalence Impact and adjustment of selection bias in the assessment of measurement equivalence Thomas Klausch, Joop Hox,& Barry Schouten Working Paper, Utrecht, December 2012 Corresponding author: Thomas Klausch,

More information

Graphical Modeling Approaches for Estimating Brain Networks

Graphical Modeling Approaches for Estimating Brain Networks Graphical Modeling Approaches for Estimating Brain Networks BIOS 516 Suprateek Kundu Department of Biostatistics Emory University. September 28, 2017 Introduction My research focuses on understanding how

More information

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions J. Harvey a,b, & A.J. van der Merwe b a Centre for Statistical Consultation Department of Statistics

More information

With the rapid advances in molecular biology, the near

With the rapid advances in molecular biology, the near Theory and Practice in Quantitative Genetics Daniëlle Posthuma 1, A. Leo Beem 1, Eco J. C. de Geus 1, G. Caroline M. van Baal 1, Jacob B. von Hjelmborg 2, Ivan Iachine 3, and Dorret I. Boomsma 1 1 Department

More information

Multiple trait model combining random regressions for daily feed intake with single measured performance traits of growing pigs

Multiple trait model combining random regressions for daily feed intake with single measured performance traits of growing pigs Genet. Sel. Evol. 34 (2002) 61 81 61 INRA, EDP Sciences, 2002 DOI: 10.1051/gse:2001004 Original article Multiple trait model combining random regressions for daily feed intake with single measured performance

More information

Modeling Multitrial Free Recall with Unknown Rehearsal Times

Modeling Multitrial Free Recall with Unknown Rehearsal Times Modeling Multitrial Free Recall with Unknown Rehearsal Times James P. Pooley (jpooley@uci.edu) Michael D. Lee (mdlee@uci.edu) Department of Cognitive Sciences, University of California, Irvine Irvine,

More information

Genomics Research. May 31, Malvika Pillai

Genomics Research. May 31, Malvika Pillai Genomics Research May 31, 2018 Malvika Pillai Outline for Research Discussion Why Informatics Read journal articles! Example Paper Presentation Research Pipeline How To Read A Paper If I m aiming to just

More information

Ecological Statistics

Ecological Statistics A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis Advanced Studies in Medical Sciences, Vol. 1, 2013, no. 3, 143-156 HIKARI Ltd, www.m-hikari.com Detection of Unknown Confounders by Bayesian Confirmatory Factor Analysis Emil Kupek Department of Public

More information

An Exercise in Bayesian Econometric Analysis Probit and Linear Probability Models

An Exercise in Bayesian Econometric Analysis Probit and Linear Probability Models Utah State University DigitalCommons@USU All Graduate Plan B and other Reports Graduate Studies 5-1-2014 An Exercise in Bayesian Econometric Analysis Probit and Linear Probability Models Brooke Jeneane

More information

Bayesian Mediation Analysis

Bayesian Mediation Analysis Psychological Methods 2009, Vol. 14, No. 4, 301 322 2009 American Psychological Association 1082-989X/09/$12.00 DOI: 10.1037/a0016972 Bayesian Mediation Analysis Ying Yuan The University of Texas M. D.

More information

Simultaneous Equation and Instrumental Variable Models for Sexiness and Power/Status

Simultaneous Equation and Instrumental Variable Models for Sexiness and Power/Status Simultaneous Equation and Instrumental Variable Models for Seiness and Power/Status We would like ideally to determine whether power is indeed sey, or whether seiness is powerful. We here describe the

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

Understanding. Regression Analysis

Understanding. Regression Analysis Understanding Regression Analysis Understanding Regression Analysis Michael Patrick Allen Washington State University Pullman, Washington Plenum Press New York and London Llbrary of Congress Cataloging-in-Publication

More information

How do we combine two treatment arm trials with multiple arms trials in IPD metaanalysis? An Illustration with College Drinking Interventions

How do we combine two treatment arm trials with multiple arms trials in IPD metaanalysis? An Illustration with College Drinking Interventions 1/29 How do we combine two treatment arm trials with multiple arms trials in IPD metaanalysis? An Illustration with College Drinking Interventions David Huh, PhD 1, Eun-Young Mun, PhD 2, & David C. Atkins,

More information

A test of quantitative genetic theory using Drosophila effects of inbreeding and rate of inbreeding on heritabilities and variance components #

A test of quantitative genetic theory using Drosophila effects of inbreeding and rate of inbreeding on heritabilities and variance components # Theatre Presentation in the Commision on Animal Genetics G2.7, EAAP 2005 Uppsala A test of quantitative genetic theory using Drosophila effects of inbreeding and rate of inbreeding on heritabilities and

More information

How few countries will do? Comparative survey analysis from a Bayesian perspective

How few countries will do? Comparative survey analysis from a Bayesian perspective Survey Research Methods (2012) Vol.6, No.2, pp. 87-93 ISSN 1864-3361 http://www.surveymethods.org European Survey Research Association How few countries will do? Comparative survey analysis from a Bayesian

More information

Model-based quantification of the relationship between age and anti-migraine therapy

Model-based quantification of the relationship between age and anti-migraine therapy 6 Model-based quantification of the relationship between age and anti-migraine therapy HJ Maas, M Danhof, OE Della Pasqua Submitted to BMC. Clin. Pharmacol. Migraine is a neurological disease that affects

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Remarks on Bayesian Control Charts

Remarks on Bayesian Control Charts Remarks on Bayesian Control Charts Amir Ahmadi-Javid * and Mohsen Ebadi Department of Industrial Engineering, Amirkabir University of Technology, Tehran, Iran * Corresponding author; email address: ahmadi_javid@aut.ac.ir

More information

WinBUGS : part 1. Bruno Boulanger Jonathan Jaeger Astrid Jullion Philippe Lambert. Gabriele, living with rheumatoid arthritis

WinBUGS : part 1. Bruno Boulanger Jonathan Jaeger Astrid Jullion Philippe Lambert. Gabriele, living with rheumatoid arthritis WinBUGS : part 1 Bruno Boulanger Jonathan Jaeger Astrid Jullion Philippe Lambert Gabriele, living with rheumatoid arthritis Agenda 2 Introduction to WinBUGS Exercice 1 : Normal with unknown mean and variance

More information

PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science. Homework 5

PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science. Homework 5 PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science Homework 5 Due: 21 Dec 2016 (late homeworks penalized 10% per day) See the course web site for submission details.

More information

Numerical Integration of Bivariate Gaussian Distribution

Numerical Integration of Bivariate Gaussian Distribution Numerical Integration of Bivariate Gaussian Distribution S. H. Derakhshan and C. V. Deutsch The bivariate normal distribution arises in many geostatistical applications as most geostatistical techniques

More information

Applying Bayesian Ordinal Regression to ICAP Maladaptive Behavior Subscales

Applying Bayesian Ordinal Regression to ICAP Maladaptive Behavior Subscales Brigham Young University BYU ScholarsArchive All Theses and Dissertations 2007-10-25 Applying Bayesian Ordinal Regression to ICAP Maladaptive Behavior Subscales Edward P. Johnson Brigham Young University

More information

Path Analysis, SEM and the Classical Twin Model

Path Analysis, SEM and the Classical Twin Model Path Analysis, SEM and the Classical Twin Model Michael Neale & Frühling Rijsdijk 2 Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University 2 MRC SGDP Centre, Institute

More information

Reveal Relationships in Categorical Data

Reveal Relationships in Categorical Data SPSS Categories 15.0 Specifications Reveal Relationships in Categorical Data Unleash the full potential of your data through perceptual mapping, optimal scaling, preference scaling, and dimension reduction

More information

CHAPTER VI RESEARCH METHODOLOGY

CHAPTER VI RESEARCH METHODOLOGY CHAPTER VI RESEARCH METHODOLOGY 6.1 Research Design Research is an organized, systematic, data based, critical, objective, scientific inquiry or investigation into a specific problem, undertaken with the

More information

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling STATISTICS IN MEDICINE Statist. Med. 2009; 28:3363 3385 Published online 3 September 2009 in Wiley InterScience (www.interscience.wiley.com).3721 Estimating drug effects in the presence of placebo response:

More information

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 DETAILED COURSE OUTLINE Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 Hal Morgenstern, Ph.D. Department of Epidemiology UCLA School of Public Health Page 1 I. THE NATURE OF EPIDEMIOLOGIC

More information

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data Appl. Statist. (2018) 67, Part 1, pp. 145 163 Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data Qiuju Li and Li Su Medical Research Council

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling Olli-Pekka Kauppila Daria Kautto Session VI, September 20 2017 Learning objectives 1. Get familiar with the basic idea

More information

Small-area estimation of mental illness prevalence for schools

Small-area estimation of mental illness prevalence for schools Small-area estimation of mental illness prevalence for schools Fan Li 1 Alan Zaslavsky 2 1 Department of Statistical Science Duke University 2 Department of Health Care Policy Harvard Medical School March

More information

Small Sample Bayesian Factor Analysis. PhUSE 2014 Paper SP03 Dirk Heerwegh

Small Sample Bayesian Factor Analysis. PhUSE 2014 Paper SP03 Dirk Heerwegh Small Sample Bayesian Factor Analysis PhUSE 2014 Paper SP03 Dirk Heerwegh Overview Factor analysis Maximum likelihood Bayes Simulation Studies Design Results Conclusions Factor Analysis (FA) Explain correlation

More information

Quantitative Trait Analysis in Sibling Pairs. Biostatistics 666

Quantitative Trait Analysis in Sibling Pairs. Biostatistics 666 Quantitative Trait Analsis in Sibling Pairs Biostatistics 666 Outline Likelihood function for bivariate data Incorporate genetic kinship coefficients Incorporate IBD probabilities The data Pairs of measurements

More information

Bayesian Hierarchical Models for Fitting Dose-Response Relationships

Bayesian Hierarchical Models for Fitting Dose-Response Relationships Bayesian Hierarchical Models for Fitting Dose-Response Relationships Ketra A. Schmitt Battelle Memorial Institute Mitchell J. Small and Kan Shao Carnegie Mellon University Dose Response Estimates using

More information

Identification of Tissue Independent Cancer Driver Genes

Identification of Tissue Independent Cancer Driver Genes Identification of Tissue Independent Cancer Driver Genes Alexandros Manolakos, Idoia Ochoa, Kartik Venkat Supervisor: Olivier Gevaert Abstract Identification of genomic patterns in tumors is an important

More information

Bayesian Inference Bayes Laplace

Bayesian Inference Bayes Laplace Bayesian Inference Bayes Laplace Course objective The aim of this course is to introduce the modern approach to Bayesian statistics, emphasizing the computational aspects and the differences between the

More information

NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES

NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES Amit Teller 1, David M. Steinberg 2, Lina Teper 1, Rotem Rozenblum 2, Liran Mendel 2, and Mordechai Jaeger 2 1 RAFAEL, POB 2250, Haifa, 3102102, Israel

More information

A COMPARISON OF BAYESIAN MCMC AND MARGINAL MAXIMUM LIKELIHOOD METHODS IN ESTIMATING THE ITEM PARAMETERS FOR THE 2PL IRT MODEL

A COMPARISON OF BAYESIAN MCMC AND MARGINAL MAXIMUM LIKELIHOOD METHODS IN ESTIMATING THE ITEM PARAMETERS FOR THE 2PL IRT MODEL International Journal of Innovative Management, Information & Production ISME Internationalc2010 ISSN 2185-5439 Volume 1, Number 1, December 2010 PP. 81-89 A COMPARISON OF BAYESIAN MCMC AND MARGINAL MAXIMUM

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

Exploring the Influence of Particle Filter Parameters on Order Effects in Causal Learning

Exploring the Influence of Particle Filter Parameters on Order Effects in Causal Learning Exploring the Influence of Particle Filter Parameters on Order Effects in Causal Learning Joshua T. Abbott (joshua.abbott@berkeley.edu) Thomas L. Griffiths (tom griffiths@berkeley.edu) Department of Psychology,

More information

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study

Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Sampling Weights, Model Misspecification and Informative Sampling: A Simulation Study Marianne (Marnie) Bertolet Department of Statistics Carnegie Mellon University Abstract Linear mixed-effects (LME)

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Bayesian growth mixture models to distinguish hemoglobin value trajectories in blood donors

Bayesian growth mixture models to distinguish hemoglobin value trajectories in blood donors Bayesian growth mixture models to distinguish hemoglobin value trajectories in blood donors Kazem Nasserinejad 1 Joost van Rosmalen 1 Mireille Baart 2 Katja van den Hurk 2 Dimitris Rizopoulos 1 Emmanuel

More information

A Bayesian Approach to Measurement Error Problems in Epidemiology Using Conditional Independence Models

A Bayesian Approach to Measurement Error Problems in Epidemiology Using Conditional Independence Models American Journal of Epidemlotogy Vol 138, No 6 Copyright 1993 by The Johns Hopkins University School of Hygiene and Public Health Printed in U SA. All rights reserved A Bayesian Approach to Measurement

More information

Bayesian hierarchical modelling

Bayesian hierarchical modelling Bayesian hierarchical modelling Matthew Schofield Department of Mathematics and Statistics, University of Otago Bayesian hierarchical modelling Slide 1 What is a statistical model? A statistical model:

More information

Frequency of church attendance in Australia and the United States: models of family resemblance

Frequency of church attendance in Australia and the United States: models of family resemblance Twin Research (1999) 2, 99 107 1999 Stockton Press All rights reserved 1369 0523/99 $12.00 http://www.stockton-press.co.uk/tr Frequency of church attendance in Australia and the United States: models of

More information

Introduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015

Introduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015 Introduction to diagnostic accuracy meta-analysis Yemisi Takwoingi October 2015 Learning objectives To appreciate the concept underlying DTA meta-analytic approaches To know the Moses-Littenberg SROC method

More information

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION Fong Wang, Genentech Inc. Mary Lange, Immunex Corp. Abstract Many longitudinal studies and clinical trials are

More information

Introduction to Meta-analysis of Accuracy Data

Introduction to Meta-analysis of Accuracy Data Introduction to Meta-analysis of Accuracy Data Hans Reitsma MD, PhD Dept. of Clinical Epidemiology, Biostatistics & Bioinformatics Academic Medical Center - Amsterdam Continental European Support Unit

More information

Hierarchical Bayesian Methods for Estimation of Parameters in a Longitudinal HIV Dynamic System

Hierarchical Bayesian Methods for Estimation of Parameters in a Longitudinal HIV Dynamic System Hierarchical Bayesian Methods for Estimation of Parameters in a Longitudinal HIV Dynamic System Yangxin Huang, Dacheng Liu and Hulin Wu Department of Biostatistics & Computational Biology University of

More information

Discriminant Analysis with Categorical Data

Discriminant Analysis with Categorical Data - AW)a Discriminant Analysis with Categorical Data John E. Overall and J. Arthur Woodward The University of Texas Medical Branch, Galveston A method for studying relationships among groups in terms of

More information

SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA. Henrik Kure

SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA. Henrik Kure SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA Henrik Kure Dina, The Royal Veterinary and Agricuural University Bülowsvej 48 DK 1870 Frederiksberg C. kure@dina.kvl.dk

More information

Selecting a research method

Selecting a research method Selecting a research method Tomi Männistö 13.10.2005 Overview Theme Maturity of research (on a particular topic) and its reflection on appropriate method Validity level of research evidence Part I Story

More information

A Race Model of Perceptual Forced Choice Reaction Time

A Race Model of Perceptual Forced Choice Reaction Time A Race Model of Perceptual Forced Choice Reaction Time David E. Huber (dhuber@psyc.umd.edu) Department of Psychology, 1147 Biology/Psychology Building College Park, MD 2742 USA Denis Cousineau (Denis.Cousineau@UMontreal.CA)

More information