A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn

Similar documents
Regression so far... Lecture 22 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102

GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA. Anti-Epileptic Drug Trial Timeline. Exploratory Data Analysis. Exploratory Data Analysis

Logistic regression. Department of Statistics, University of South Carolina. Stat 205: Elementary Statistics for the Biological and Life Sciences

Self-assessment test of prerequisite knowledge for Biostatistics III in R

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics.

Analysis of bivariate binomial data: Twin analysis

4. STATA output of the analysis

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

OPCS coding (Office of Population, Censuses and Surveys Classification of Surgical Operations and Procedures) (4th revision)

Today Retrospective analysis of binomial response across two levels of a single factor.

Data Analysis in the Health Sciences. Final Exam 2010 EPIB 621

LOGLINK Example #1. SUDAAN Statements and Results Illustrated. Input Data Set(s): EPIL.SAS7bdat ( Thall and Vail (1990)) Example.

Regression models, R solution day7

Use of GEEs in STATA

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Notes for laboratory session 2

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn

Generalized Estimating Equations for Depression Dose Regimes

Stats for Clinical Trials, Math 150 Jo Hardin Logistic Regression example: interaction & stepwise regression

Biostatistics II

SCHOOL OF MATHEMATICS AND STATISTICS

Content. Basic Statistics and Data Analysis for Health Researchers from Foreign Countries. Research question. Example Newly diagnosed Type 2 Diabetes

Normal Q Q. Residuals vs Fitted. Standardized residuals. Theoretical Quantiles. Fitted values. Scale Location 26. Residuals vs Leverage

STAT 503X Case Study 1: Restaurant Tipping

Daniel Boduszek University of Huddersfield

2. Scientific question: Determine whether there is a difference between boys and girls with respect to the distance and its change over time.

Practical Multivariate Analysis

1. Objective: analyzing CD4 counts data using GEE marginal model and random effects model. Demonstrate the analysis using SAS and STATA.

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)

Lecture 21. RNA-seq: Advanced analysis

Tutorial #7A: Latent Class Growth Model (# seizures)

In this module I provide a few illustrations of options within lavaan for handling various situations.

Two-phase designs in epidemiology

Today. HW 1: due February 4, pm. Matched case-control studies

Review and Wrap-up! ESP 178 Applied Research Methods Calvin Thigpen 3/14/17 Adapted from presentation by Prof. Susan Handy

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Data Analysis Using Regression and Multilevel/Hierarchical Models

Statistical reports Regression, 2010

How to analyze correlated and longitudinal data?

Regression analysis of mortality with respect to seasonal influenza in Sweden

Lab 2: Non-linear regression models

Determinants and Status of Vaccination in Bangladesh

Table. A [a] Multiply imputed. Outpu

5 To Invest or not to Invest? That is the Question.

Analyzing binary outcomes, going beyond logistic regression

Analytic Strategies for the OAI Data

ANALYSING COGNITIVE TEST DATA: DISTRIBUTIONS AND NON-PARAMETRIC RANDOM EFFECTS

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Chapter 13 Estimating the Modified Odds Ratio

Package frailtyhl. February 19, 2015

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS

Clincial Biostatistics. Regression

One-Way Independent ANOVA

Math 215, Lab 7: 5/23/2007

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

Joint Modelling of Event Counts and Survival Times: Example Using Data from the MESS Trial

Report Perception Experiment

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies.

Package HAP.ROR. R topics documented: February 19, 2015

Cancerclass: An R package for development and validation of diagnostic tests from high-dimensional molecular data

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers

Correlation and regression

An Examination of Factors Affecting Incidence and Survival in Respiratory Cancers. Katie Frank Roberto Perez Mentor: Dr. Kate Cowles.

Sample size and power calculations in Mendelian randomization with a single instrumental variable and a binary outcome

Kidane Tesfu Habtemariam, MASTAT, Principle of Stat Data Analysis Project work

Prediction Model For Risk Of Breast Cancer Considering Interaction Between The Risk Factors

FACULTY OF SCIENCES Master of Statistics: Biostatistics

What Else do Epileptic Data Reveal

Introduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015

Modeling Binary outcome

Constructing a mixed model using the AIC

Dan Byrd UC Office of the President

Stat Wk 9: Hypothesis Tests and Analysis

Analysis of variance and regression. Other types of regression models

Part 8 Logistic Regression

Fitting discrete-data regression models in social science

EXECUTIVE SUMMARY DATA AND PROBLEM

Problem 1) Match the terms to their definitions. Every term is used exactly once. (In the real midterm, there are fewer terms).

A Handbook of Statistical Analyses using SAS

bivariate analysis: The statistical analysis of the relationship between two variables.

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3

Lisa Yelland. BMa&CompSc (Hons)

A peer-reviewed version of this preprint was published in PeerJ on 23 May 2018.

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Package xseq. R topics documented: September 11, 2015

IAPT: Regression. Regression analyses

Introduction. Overview

Answer to exercise: Growth of guinea pigs

Department of Statistics, Biostatistics & Informatics, University of Dhaka, Dhaka-1000, Bangladesh. Abstract

Package inctools. January 3, 2018

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

Modelling Research Productivity Using a Generalization of the Ordered Logistic Regression Model

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI /

Biostat/Stat 571: Coursework 5

THE UNIVERSITY OF OKLAHOMA HEALTH SCIENCES CENTER GRADUATE COLLEGE A COMPARISON OF STATISTICAL ANALYSIS MODELING APPROACHES FOR STEPPED-

Chapter 2 Organizing and Summarizing Data. Chapter 3 Numerically Summarizing Data. Chapter 4 Describing the Relation between Two Variables

Basic Biostatistics. Chapter 1. Content

Vessel wall differences between middle cerebral artery and basilar artery. plaques on magnetic resonance imaging

Transcription:

A Handbook of Statistical Analyses Using R Brian S. Everitt and Torsten Hothorn

CHAPTER 11 Analysing Longitudinal Data II Generalised Estimation Equations: Treating Respiratory Illness and Epileptic Seizures 11.1 Introduction 11.2 Generalised Estimating Equations 11.3 Analysis Using R 11.3.1 Beat the Blues Revisited To use the gee function, package gee (Carey et al., 2006) has to be installed and attached: R> library("gee") The gee function is used in a similar way to the lme function met in Chapter 10, with the addition of the features of the glm function that specify the appropriate error distribution for the response and the implied link function, and an argument to specify the structure of the working correlation matrix. Here we will fit an independence structure and then an exchangeable structure. The R code for fitting generalised estimation equations to the BtheB_long data (as constructed in Chapter 10, with idenity working correlation matrix is as follows (note that the gee function assumes the rows of the data.frame BtheB_long to be ordered with respect to subjects) R> osub <- order(as.integer(btheb_long$subject)) R> BtheB_long <- BtheB_long[osub,] R> btb_gee <- gee(bdi ~ bdi.pre + treatment + length + drug, + data = BtheB_long, id = subject, family = gaussian, + corstr = "independence") and with exchangeable correlation matrix R> btb_gee1 <- gee(bdi ~ bdi.pre + treatment + length + drug, + data = BtheB_long, id = subject, family = gaussian, + corstr = "exchangeable") The summary method can be used to inspect the fitted models; the results are shown in Figures 11.1 and 11.2 3

4 ANALYSING LONGITUDINAL DATA II R> summary(btb_gee) GEE: GENERALIZED LINEAR MODELS FOR DEPENDENT DATA gee S-function, version 4.13 modified 98/01/27 (1998) Model: Link: Identity Variance to Mean Relation: Gaussian Correlation Structure: Independent gee(formula = bdi ~ bdi.pre + treatment + length + drug, id = subject, data = BtheB_long, family = gaussian, corstr = "independence") Summary of Residuals: -21.6497810-5.8485100 0.1131663 5.5838383 28.1871039 Estimate Naive S.E. Naive z Robust S.E. (Intercept) 3.5686314 1.4833349 2.405816 2.26947617 bdi.pre 0.5818494 0.0563904 10.318235 0.09156455 treatmentbtheb -3.2372285 1.1295569-2.865928 1.77459534 length>6m 1.4577182 1.1380277 1.280916 1.48255866 drugyes -3.7412982 1.1766321-3.179667 1.78271179 Robust z (Intercept) 1.5724472 bdi.pre 6.3545274 treatmentbtheb -1.8242066 length>6m 0.9832449 drugyes -2.0986557 Estimated Scale Parameter: 79.25813 Number of Iterations: 1 Working Correlation [,1] [,2] [,3] [,4] [1,] 1 0 0 0 [2,] 0 1 0 0 [3,] 0 0 1 0 [4,] 0 0 0 1 Figure 11.1 R output of the summary method for the btb_gee model. 11.3.2 Respiratory Illness The baseline status, i.e., the status for month == 0, will enter the models as an explanatory variable and thus we have to rearrange the data.frame respiratory in order to create a new variable baseline: R> data("respiratory", package = "HSAUR") R> resp <- subset(respiratory, month > "0") R> resp$baseline <- rep(subset(respiratory, month == "0")$status, rep(4, 111)) R> resp$nstat <- as.numeric(resp$status == "good") The new variable nstat is simply a dummy coding for a poor respiratory status. Now we can use the data resp to fit a logistic regression model and GEE models with an independent and an exchangeable correlation structure as follows;

ANALYSIS USING R 5 R> summary(btb_gee1) GEE: GENERALIZED LINEAR MODELS FOR DEPENDENT DATA gee S-function, version 4.13 modified 98/01/27 (1998) Model: Link: Identity Variance to Mean Relation: Gaussian Correlation Structure: Exchangeable gee(formula = bdi ~ bdi.pre + treatment + length + drug, id = subject, data = BtheB_long, family = gaussian, corstr = "exchangeable") Summary of Residuals: -23.955980-6.643864-1.109741 4.257688 25.452310 Estimate Naive S.E. Naive z Robust S.E. (Intercept) 3.0231602 2.30390185 1.31219140 2.23204410 bdi.pre 0.6479276 0.08228567 7.87412417 0.08351405 treatmentbtheb -2.1692863 1.76642861-1.22806339 1.73614385 length>6m -0.1112910 1.73091679-0.06429596 1.55092705 drugyes -2.9995608 1.82569913-1.64296559 1.73155411 Robust z (Intercept) 1.3544357 bdi.pre 7.7583066 treatmentbtheb -1.2494854 length>6m -0.0717577 drugyes -1.7322940 Estimated Scale Parameter: 81.7349 Number of Iterations: 5 Working Correlation [,1] [,2] [,3] [,4] [1,] 1.0000000 0.6757951 0.6757951 0.6757951 [2,] 0.6757951 1.0000000 0.6757951 0.6757951 [3,] 0.6757951 0.6757951 1.0000000 0.6757951 [4,] 0.6757951 0.6757951 0.6757951 1.0000000 Figure 11.2 R output of the summary method for the btb_gee1 model. R> resp_glm <- glm(status ~ centre + treatment + sex + baseline + + age, data = resp, family = "binomial") R> resp_gee1 <- gee(nstat ~ centre + treatment + sex + baseline + + age, data = resp, family = "binomial", id = subject, + corstr = "independence", scale.fix = TRUE, scale.value = 1) R> resp_gee2 <- gee(nstat ~ centre + treatment + sex + baseline + + age, data = resp, family = "binomial", id = subject, + corstr = "exchangeable", scale.fix = TRUE, scale.value = 1)

6 ANALYSING LONGITUDINAL DATA II R> summary(resp_glm) glm(formula = status ~ centre + treatment + sex + baseline + age, family = "binomial", data = resp) Deviance Residuals: -2.3146-0.8551 0.4336 0.8953 1.9246 Estimate Std. Error z value Pr(> z ) (Intercept) -0.900171 0.337653-2.666 0.00768 ** centre2 0.671601 0.239567 2.803 0.00506 ** treatmenttreatment 1.299216 0.236841 5.486 4.12e-08 *** sexmale 0.119244 0.294671 0.405 0.68572 baselinegood 1.882029 0.241290 7.800 6.20e-15 *** age -0.018166 0.008864-2.049 0.04043 * --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 (Dispersion parameter for binomial family taken to be 1) Null deviance: 608.93 on 443 degrees of freedom Residual deviance: 483.22 on 438 degrees of freedom AIC: 495.22 Number of Fisher Scoring iterations: 4 Figure 11.3 R output of the summary method for the resp_glm model.

ANALYSIS USING R 7 R> summary(resp_gee1) GEE: GENERALIZED LINEAR MODELS FOR DEPENDENT DATA gee S-function, version 4.13 modified 98/01/27 (1998) Model: Link: Logit Variance to Mean Relation: Binomial Correlation Structure: Independent gee(formula = nstat ~ centre + treatment + sex + baseline + age, id = subject, data = resp, family = "binomial", corstr = "independence", scale.fix = TRUE, scale.value = 1) Summary of Residuals: -0.93134415-0.30623174 0.08973552 0.33018952 0.84307712 Estimate Naive S.E. Naive z (Intercept) -0.90017133 0.337653052-2.665965 centre2 0.67160098 0.239566599 2.803400 treatmenttreatment 1.29921589 0.236841017 5.485603 sexmale 0.11924365 0.294671045 0.404667 baselinegood 1.88202860 0.241290221 7.799854 age -0.01816588 0.008864403-2.049306 Robust S.E. Robust z (Intercept) 0.46032700-1.9555041 centre2 0.35681913 1.8821889 treatmenttreatment 0.35077797 3.7038127 sexmale 0.44320235 0.2690501 baselinegood 0.35005152 5.3764332 age 0.01300426-1.3969169 Estimated Scale Parameter: 1 Number of Iterations: 1 Working Correlation [,1] [,2] [,3] [,4] [1,] 1 0 0 0 [2,] 0 1 0 0 [3,] 0 0 1 0 [4,] 0 0 0 1 Figure 11.4 R output of the summary method for the resp_gee1 model.

8 ANALYSING LONGITUDINAL DATA II R> summary(resp_gee2) GEE: GENERALIZED LINEAR MODELS FOR DEPENDENT DATA gee S-function, version 4.13 modified 98/01/27 (1998) Model: Link: Logit Variance to Mean Relation: Binomial Correlation Structure: Exchangeable gee(formula = nstat ~ centre + treatment + sex + baseline + age, id = subject, data = resp, family = "binomial", corstr = "exchangeable", scale.fix = TRUE, scale.value = 1) Summary of Residuals: -0.93134415-0.30623174 0.08973552 0.33018952 0.84307712 Estimate Naive S.E. Naive z (Intercept) -0.90017133 0.4784634-1.8813796 centre2 0.67160098 0.3394723 1.9783676 treatmenttreatment 1.29921589 0.3356101 3.8712064 sexmale 0.11924365 0.4175568 0.2855747 baselinegood 1.88202860 0.3419147 5.5043802 age -0.01816588 0.0125611-1.4462014 Robust S.E. Robust z (Intercept) 0.46032700-1.9555041 centre2 0.35681913 1.8821889 treatmenttreatment 0.35077797 3.7038127 sexmale 0.44320235 0.2690501 baselinegood 0.35005152 5.3764332 age 0.01300426-1.3969169 Estimated Scale Parameter: 1 Number of Iterations: 1 Working Correlation [,1] [,2] [,3] [,4] [1,] 1.0000000 0.3359883 0.3359883 0.3359883 [2,] 0.3359883 1.0000000 0.3359883 0.3359883 [3,] 0.3359883 0.3359883 1.0000000 0.3359883 [4,] 0.3359883 0.3359883 0.3359883 1.0000000 Figure 11.5 R output of the summary method for the resp_gee2 model.

ANALYSIS USING R 9 The estimated treatment effect taken from the exchangeable structure GEE model is 1.299 which, using the robust standard errors, has an associated 95% confidence interval R> se <- summary(resp_gee2)$coefficients["treatmenttreatment", + "Robust S.E."] R> coef(resp_gee2)["treatmenttreatment"] + + c(-1, 1) * se * qnorm(0.975) [1] 0.6117037 1.9867281 These values reflect effects on the log-odds scale. Interpretation becomes simpler if we exponentiate the values to get the effects in terms of odds. This gives a treatment effect of 3.666 and a 95% confidence interval of R> exp(coef(resp_gee2)["treatmenttreatment"] + + c(-1, 1) * se * qnorm(0.975)) [1] 1.843570 7.291637 The odds of achieving a good respiratory status with the active treatment is between about twice and seven times the corresponding odds for the placebo. 11.3.3 Epilepsy Moving on to the count data in epilepsy from Table??, we begin by calculating the means and variances of the number of seizures for all treatment / period interactions R> data("epilepsy", package = "HSAUR") R> itp <- interaction(epilepsy$treatment, epilepsy$period) R> tapply(epilepsy$seizure.rate, itp, mean) placebo.1 Progabide.1 placebo.2 Progabide.2 placebo.3 9.357143 8.580645 8.285714 8.419355 8.785714 Progabide.3 placebo.4 Progabide.4 8.129032 7.964286 6.709677 R> tapply(epilepsy$seizure.rate, itp, var) placebo.1 Progabide.1 placebo.2 Progabide.2 placebo.3 102.75661 332.71828 66.65608 140.65161 215.28571 Progabide.3 placebo.4 Progabide.4 193.04946 58.18386 126.87957 Some of the variances are considerably larger than the corresponding means, which for a Poisson variable may suggest that overdispersion may be a problem, see Chapter??. We can now fit a Poisson regression model to the data assuming independence using the glm function. We also use the GEE approach to fit an independence structure, followed by an exchangeable structure using the following R code:

10 ANALYSING LONGITUDINAL DATA II R> layout(matrix(1:2, nrow = 1)) R> ylim <- range(epilepsy$seizure.rate) R> placebo <- subset(epilepsy, treatment == "placebo") R> progabide <- subset(epilepsy, treatment == "Progabide") R> boxplot(seizure.rate ~ period, data = placebo, + ylab = "Number of seizures", + xlab = "Period", ylim = ylim, main = "Placebo") R> boxplot(seizure.rate ~ period, data = progabide, + main = "Progabide", ylab = "Number of seizures", + xlab = "Period", ylim = ylim) Placebo Progabide Number of seizures 0 20 40 60 80 100 Number of seizures 0 20 40 60 80 100 1 2 3 4 Period 1 2 3 4 Period Figure 11.6 Boxplots of numbers of seizures in each two-week period post randomisation for placebo and active treatments. R> per <- rep(log(2),nrow(epilepsy)) R> epilepsy$period <- as.numeric(epilepsy$period) R> fm <- seizure.rate ~ base + age + treatment + offset(per) R> epilepsy_glm <- glm(fm, data = epilepsy, family = "poisson") R> epilepsy_gee1 <- gee(fm, data = epilepsy, family = "poisson", + id = subject, corstr = "independence", scale.fix = TRUE, + scale.value = 1) R> epilepsy_gee2 <- gee(fm, data = epilepsy, family = "poisson", + id = subject, corstr = "exchangeable", scale.fix = TRUE, + scale.value = 1) R> epilepsy_gee3 <- gee(fm, data = epilepsy, family = "poisson",

ANALYSIS USING R 11 R> layout(matrix(1:2, nrow = 1)) R> ylim <- range(log(epilepsy$seizure.rate + 1)) R> boxplot(log(seizure.rate + 1) ~ period, data = placebo, + main = "Placebo", ylab = "Log number of seizures", + xlab = "Period", ylim = ylim) R> boxplot(log(seizure.rate + 1) ~ period, data = progabide, + main = "Progabide", ylab = "Log number of seizures", + xlab = "Period", ylim = ylim) Placebo Progabide Log number of seizures 0 1 2 3 4 Log number of seizures 0 1 2 3 4 1 2 3 4 Period 1 2 3 4 Period Figure 11.7 Boxplots of log of numbers of seizures in each two-week period post randomisation for placebo and active treatments. + id = subject, corstr = "exchangeable", scale.fix = FALSE, + scale.value = 1) As usual we inspect the fitted models using the summary method, the results are given in Figures 11.8, 11.9, 11.10, and 11.11.

12 ANALYSING LONGITUDINAL DATA II R> summary(epilepsy_glm) glm(formula = fm, family = "poisson", data = epilepsy) Deviance Residuals: -4.4360-1.4034-0.5029 0.4842 12.3223 Estimate Std. Error z value Pr(> z ) (Intercept) -0.1306156 0.1356191-0.963 0.3355 base 0.0226517 0.0005093 44.476 < 2e-16 *** age 0.0227401 0.0040240 5.651 1.59e-08 *** treatmentprogabide -0.1527009 0.0478051-3.194 0.0014 ** --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 (Dispersion parameter for poisson family taken to be 1) Null deviance: 2521.75 on 235 degrees of freedom Residual deviance: 958.46 on 232 degrees of freedom AIC: 1732.5 Number of Fisher Scoring iterations: 5 Figure 11.8 R output of the summary method for the epilepsy_glm model.

ANALYSIS USING R 13 R> summary(epilepsy_gee1) GEE: GENERALIZED LINEAR MODELS FOR DEPENDENT DATA gee S-function, version 4.13 modified 98/01/27 (1998) Model: Link: Logarithm Variance to Mean Relation: Poisson Correlation Structure: Independent gee(formula = fm, id = subject, data = epilepsy, family = "poisson", corstr = "independence", scale.fix = TRUE, scale.value = 1) Summary of Residuals: -4.9195387 0.1808059 1.7073405 4.8850644 69.9658560 Estimate Naive S.E. Naive z (Intercept) -0.13061561 0.1356191185-0.9631062 base 0.02265174 0.0005093011 44.4761250 age 0.02274013 0.0040239970 5.6511312 treatmentprogabide -0.15270095 0.0478051054-3.1942393 Robust S.E. Robust z (Intercept) 0.365148155-0.3577058 base 0.001235664 18.3316325 age 0.011580405 1.9636736 treatmentprogabide 0.171108915-0.8924196 Estimated Scale Parameter: 1 Number of Iterations: 1 Working Correlation [,1] [,2] [,3] [,4] [1,] 1 0 0 0 [2,] 0 1 0 0 [3,] 0 0 1 0 [4,] 0 0 0 1 Figure 11.9 R output of the summary method for the epilepsy_gee1 model.

14 ANALYSING LONGITUDINAL DATA II R> summary(epilepsy_gee2) GEE: GENERALIZED LINEAR MODELS FOR DEPENDENT DATA gee S-function, version 4.13 modified 98/01/27 (1998) Model: Link: Logarithm Variance to Mean Relation: Poisson Correlation Structure: Exchangeable gee(formula = fm, id = subject, data = epilepsy, family = "poisson", corstr = "exchangeable", scale.fix = TRUE, scale.value = 1) Summary of Residuals: -4.9195387 0.1808059 1.7073405 4.8850644 69.9658560 Estimate Naive S.E. Naive z (Intercept) -0.13061561 0.2004416507-0.6516391 base 0.02265174 0.0007527342 30.0926122 age 0.02274013 0.0059473665 3.8235638 treatmentprogabide -0.15270095 0.0706547450-2.1612270 Robust S.E. Robust z (Intercept) 0.365148155-0.3577058 base 0.001235664 18.3316325 age 0.011580405 1.9636736 treatmentprogabide 0.171108915-0.8924196 Estimated Scale Parameter: 1 Number of Iterations: 1 Working Correlation [,1] [,2] [,3] [,4] [1,] 1.0000000 0.3948033 0.3948033 0.3948033 [2,] 0.3948033 1.0000000 0.3948033 0.3948033 [3,] 0.3948033 0.3948033 1.0000000 0.3948033 [4,] 0.3948033 0.3948033 0.3948033 1.0000000 Figure 11.10 R output of the summary method for the epilepsy_gee2 model.

ANALYSIS USING R 15 R> summary(epilepsy_gee3) GEE: GENERALIZED LINEAR MODELS FOR DEPENDENT DATA gee S-function, version 4.13 modified 98/01/27 (1998) Model: Link: Logarithm Variance to Mean Relation: Poisson Correlation Structure: Exchangeable gee(formula = fm, id = subject, data = epilepsy, family = "poisson", corstr = "exchangeable", scale.fix = FALSE, scale.value = 1) Summary of Residuals: -4.9195387 0.1808059 1.7073405 4.8850644 69.9658560 Estimate Naive S.E. Naive z (Intercept) -0.13061561 0.45219954-0.2888451 base 0.02265174 0.00169818 13.3388301 age 0.02274013 0.01341735 1.6948302 treatmentprogabide -0.15270095 0.15939823-0.9579840 Robust S.E. Robust z (Intercept) 0.365148155-0.3577058 base 0.001235664 18.3316325 age 0.011580405 1.9636736 treatmentprogabide 0.171108915-0.8924196 Estimated Scale Parameter: 5.089608 Number of Iterations: 1 Working Correlation [,1] [,2] [,3] [,4] [1,] 1.0000000 0.3948033 0.3948033 0.3948033 [2,] 0.3948033 1.0000000 0.3948033 0.3948033 [3,] 0.3948033 0.3948033 1.0000000 0.3948033 [4,] 0.3948033 0.3948033 0.3948033 1.0000000 Figure 11.11 R output of the summary method for the epilepsy_gee3 model.

Bibliography Carey, V. J., Lumley, T., and Ripley, B. D.(2006), gee: Generalized Estimation Equation Solver, URL http://cran.r-project.org, R package version 4.13-13.