REPRODUCTIVE ENDOCRINOLOGY

Size: px
Start display at page:

Download "REPRODUCTIVE ENDOCRINOLOGY"

Transcription

1 FERTILITY AND STERILITY VOL. 74, NO. 2, AUGUST 2000 Copyright 2000 American Society for Reproductive Medicine Published by Elsevier Science Inc. Printed on acid-free paper in U.S.A. REPRODUCTIVE ENDOCRINOLOGY The alternating-sequence design (or multiple-period crossover) trial for evaluating treatment efficacy in infertility Geoffrey R. Norman, Ph.D., a and Salim Daya, M.B., Ch.B., M.Sc. a,b McMaster University, Hamilton, Ontario, Canada Objective: To determine whether a constant-sequence or an alternating-sequence design is better for the evaluation of infertility treatment efficacy when multiple cycles of treatment are undertaken. Design: A simulation exercise using analytical methods. Setting: University medical center. Patient(s): A hypothetical, heterogeneous population of infertile patients participating in a randomized trial comparing an experimental treatment, with effectiveness of 2.0, to no treatment. Intervention(s): Comparison of a constant-sequence design in which the subject receives the same intervention or the alternating-sequence design in which experimental and control treatments are crossed over after each successive cycle. Main Outcome Measure(s): Relative risks of pregnancy per cycle and overall after a maximum of five cycles of treatment. Result(s): With both designs, the pregnancy rates in experimental and control groups showed a consistent decrease with each successive cycle. The overall effectiveness in the constant-sequence design was underestimated at 1.83, whereas in the alternating-sequence design it was overestimated at However, by restricting the analysis in the latter design only to the odd-numbered cycles, the relative risk was precisely correct at Conclusion(s): When multiple cycles of treatment are undertaken to evaluate the efficacy of infertility therapy, the alternating-sequence design with restriction of the analysis to only the odd-numbered treatment cycles provides an unbiased estimation of the treatment effect. (Fertil Steril 2000;74: by American Society for Reproductive Medicine.) Key Words: Infertility, efficacy, randomized controlled trial, constant-sequence design, alternating-sequence design Received October 20, 1999; revised and accepted January 31, Reprint requests: Salim Daya, M.B., Ch.B., M.Sc., Department of Obstetrics and Gynaecology, McMaster University, 1200 Main Street West, Hamilton, Ontario, Canada L8N 3Z5 (FAX: ; dayas@fhs.csu.mcmaster.ca). a Department of Clinical Epidemiology and Biostatistics. b Department of Obstetrics and Gynaecology /00/$20.00 PII S (00) The rapid development in assisted reproductive technology has provided infertile couples with a variety of therapeutic options to achieve pregnancy. Together with this change has been a shift in the focus of reporting success rates of treatment from a per-patient basis to a percycle basis. This change has made the evaluation of treatment efficacy more complicated because of variability in the numbers of cycles of treatment per patient and in the length of time patients may have to wait between successive cycles of treatment. In the past, the pregnancy rates with treatments, such as tuboplasty, donor insemination, and ovulation induction, were compared with those in a control group (often receiving no treatment) at a specified time period (e.g., 1 year after initiating treatment). With this approach, patients in the experimental group were exposed to a single type of intervention. Today, the situation is different because the approach is to offer several cycles of treatment with assisted reproductive technology (ART), the nature of which may vary from cycle to cycle, and to evaluate the outcome per treatment cycle. In the one-cycle, randomized, paralleldesign trial, the experimental treatment is administered in one cycle in one group of patients, and the control treatment is administered in one cycle in a second group of patients. This design permits comparison of pregnancy rates between patients and is the best design for evaluating treatment efficacy, but a large sample size is usually required because, in general, 319

2 small differences (e.g., 5% 10%) in pregnancy rates are expected. Furthermore, compliance may be more difficult to achieve, especially in patients allocated to the control group. In contrast, the two-period (or two-cycle) crossover design is becoming more popular among clinical investigators and may be more appealing to patients because they have the opportunity to receive the experimental treatment in a second cycle if they had received the control treatment in the first cycle. In this way, all patients would have access to the experimental treatment, and the sample size would be much smaller than in the parallel design because a within-patient comparison is undertaken. However, a major problem with this design is that women who become pregnant in the first period will be classified as dropouts from the second period, thereby not permitting a within-patient comparison (1 6). In addition, the likelihood of becoming pregnant may not be constant from one period to the next because women who fail to conceive in the first period may have a lower probability of success in the second period. Consequently, by pooling the data obtained over the two study periods, a larger estimate of the effect of treatment is obtained than that obtained with a one-cycle parallel design trial (7). This issue of possible bias resulting from study design has been explored further with computer simulation and extending the number of cycles per patient to six (8). With use of a constant-treatment sequence design (in which a parallel group design is used but patients undertake repeated cycles of the same treatment within each group), the results were observed to be about the same as those with an alternatingsequence design (in which experimental and control treatments are crossed over after each successive cycle). Although the pregnancy rates in the two designs using the computer simulation seem close, it is difficult to know from the simulation whether they are the same or different. The computed odds ratios (ORs) in the cycles of the constant-sequence arm ranged from 1.76 to 2.25, whereas in the alternating-sequence arm, the ORs ranged from 2.18 to Thus, there was considerable variability in the results, such that small but systematic differences may have been obscured. This variability is a natural consequence of the statistical simulation technique, in which, even with a starting set of 2,000 observations, individual data may vary considerably, simply from sampling variation. An alternative approach is to use a somewhat simpler form of simulation in which results are computed directly from assumed probability distributions. The objective of this study was to undertake such a simulation exercise, with parameters chosen so that real differences would emerge, but with a form sufficiently simplified that analytical methods could be used to solve the question of whether a constantsequence design or an alternating-sequence design is better for the evaluation of efficacy of infertility therapy when multiple cycles of treatment are undertaken. MATERIALS AND METHODS We conducted an analytical simulation starting with the assumption that we are dealing with an experimental treatment with effectiveness of 2.0. The heterogeneity of the sample was modeled in a simple manner by assuming that each cohort consisted of two subpopulations: 20% of the sample consisted of fertile women who have a spontaneous fecundity rate of 40% per cycle, and 80% of the sample consisted of relatively less fertile couples with a spontaneous fecundity rate of 10% per cycle. Thus, based on the effectiveness of 2.0, the fecundity rates in the experimental arm of the model would be 80% (from the fertile 20% of the sample) and 20% (from the remaining, relatively less fertile, 80% of the sample). In addition, for convenience, the starting sample size in each arm was set at 1,000 couples who will each undergo up to a maximum of five cycles of treatment. Although the fecundity rates were chosen to be unrealistically high to make the effects clear, the issue of varying the control group pregnancy rates was also evaluated with use of algebra (as indicated below). The analysis was directed at determining the estimated effectiveness of the experimental treatment over successive cycles to identify whether the presumed bias of both the constant-sequence and alternating-sequence designs are present. In this analysis, the effectiveness of treatment was assessed with use of the relative risk (RR) (i.e., risk of pregnancy in the experimental group/risk of pregnancy in the control group) rather than the OR, because the RR is independent of prevalence, whereas the OR is affected by the overall prevalence and would be expected to become smaller with successive cycles. The results were computed with a handheld calculator with use of a simple approach. For the first treatment cycle, the number of pregnancies was calculated in the two subgroups ( in the control group and in the experimental group). These numbers were then subtracted from the starting population sizes to give the new populations for the second cycle (i.e., 120 and 720, respectively, in the control group, and 40 and 640, respectively, in the experimental group). For the constant-sequence design study, the number of pregnancies in the new cohorts were then calculated as ( ) 120 for the control group and ( ) 160 for the experimental group. In the alternating-sequence design study, the populations were crossed over (i.e., in the second cycle the experimental group now receives the control treatment, whereas the control group now receives the experimental treatment). The calculation was then repeated so that the number of pregnancies in the new experimental group was now ( ) 96, and for the new control group it was ( ) 80. This process was iterated for five cycles. For each design, the RR for pregnancy was then calculated per cycle and overall. 320 Norman and Daya The alternating-sequence design trial Vol. 74, No. 2, August 2000

3 FIGURE 1 Relative risk of pregnancy using the constant-sequence design trial. The assumption is that each sample is made up of 20% with a spontaneous fecundity of 40% per cycle and 80% with a spontaneous fecundity of 10% per cycle, and treatment has an effectiveness of 2.0. RR (overall) The results are shown in Figure 1 for the constant-sequence design and in Figure 2 for the alternating-sequence design. In the former design, the proportion of women with high fertility decreases in both arms, but at a much higher rate in the experimental arm, so that after five cycles, although approximately 26 patients remain in the control arm, almost none remains in the experimental arm. Consequently, the pregnancy rate in each group shows a consistent decrease from the first to the last cycle. However, as the numbers of couples in the experimental group diminish, the pregnancy rate asymptotes to 0.20, whereas the rate in the control group continues to fall (Table 1). As a result, the effectiveness (i.e., pregnancy rate in experimental group/pregnancy rate in control group) is at a minimum with the third cycle and increases slightly thereafter. The overall computed effectiveness is 1.83, somewhat smaller than the true value of The alternating-sequence design study shows a different picture. As expected, in the second cycle, the experimental group of patients now contains a relatively higher proportion of fertile couples, so that the effectiveness of the experimental treatment is overestimated at 2.42 (Table 2). However, in the third cycle, the crossover has removed the excess of fertile women from the new control group, so the proportion of fertile couples is exactly the same in both groups. The calculated effectiveness is now precisely correct at In cycle four, the new experimental group again has a relative excess of fertile women, and the calculated effectiveness is high at In cycle five, however, the numbers of fertile and infertile couples in the two groups again are exactly the same, and the calculated effectiveness, once more, is precisely correct at Overall, the effectiveness of the treatment in the alternating-sequence design trial is It is important to note that these results are based on two basic assumptions only: [1] There is no carryover effect from one cycle to the next. That is, the effect of the drug acts only on the cycle in which it is taken, and does not continue into subsequent cycles. [2] The distribution of fertility takes a particularly simple form, consisting of two subpopulations with different fixed fertility rates. Although we have used rates of spontaneous fertility (10% and 40%) that are unrealistically high and a drug FIGURE 2 Relative risk of pregnancy using the alternating-sequence design trial. The assumption is that each sample is made up of 20% with a spontaneous fecundity of 40% per cycle and 80% with a spontaneous fecundity of 10% per cycle, and treatment has an effectiveness of 2.0. RR (overall) RESULTS FERTILITY & STERILITY 321

4 TABLE 1 Pregnancy rate per cycle in the constant-sequence design trial. Cycle number Experimental group Pregnancy risk a Control group Relative risk 1 320/1000 (0.320) 160/1000 (0.160) /680 (0.235) 120/840 (0.143) /520 (0.209) 93.6/720 (0.130) /411.2 (0.202) 75.6/626.4 (0.121) /328 (0.200) 62.8/550.8 (0.114) 1.76 Overall 737.8/ (0.251) 512/ (0.137) 1.83 a Pregnancy risk refers to the number of subjects becoming pregnant out of the total number receiving the experimental or control interventions, and the proportions are indicated in parentheses. efficacy that perhaps is similarly unrealistic, the findings of the simulation related to the unbiased nature of the odd-cycle alternating sequence design do not depend on the values chosen. The high values are used simply to make the results more obvious; the results are the same with any arbitrary set of starting parameters. Furthermore, the assumptions of a constant drug efficacy and a simple distribution of fertility are not necessary. In the Appendix, we show algebraically that, for any distribution of fertility p(f), where F is a number between 0 and 1 and p(f) expresses the proportion of women in a sample with a particular level of fertility, and for any value of drug efficacy, (F), where the efficacy is a function of fertility, the outcome rates in the odd cycles in an alternating sequence are unbiased. In particular, we might presume clinically that the efficacy of the drug would be reduced in high fertility women; this algebraic extension of the basic concept demonstrates that the results will hold true regardless of the relationship between efficacy and fertility. One other issue is whether the possibility of bias justifies rejecting up to half the data. To assess this issue, we recalculated the results with use of the alternating-sequence design after assuming more conservative estimates of fertility (i.e., 2.5% and 10%) in the two subgroups. Under these circumstances, the odd-numbered cycles continued to provide unbiased estimates of efficacy; the even-numbered cycles yielded estimates that were biased upward, but to a lesser degree 2.10 for cycle 2 and 2.05 for cycle 4. The only remaining assumption is the one of lack of carryover effect. If there was a carryover effect, such that the effect of the drug continued beyond the cycle in which it was administered, then the effect would be to increase fertility in the control group at each cycle, resulting in a conservative estimate of treatment effectiveness. Returning to the original, simplified, model, we assumed a carryover effect of 20% (i.e., the fertility rate in the control subsamples would increase from 10% and 40% to 12% and 48%, respectively) and recalculated the results. The calculation yielded an estimate of 1.99 for all cycles and 1.82 for the odd cycles. It is tempting to assume that, in situations in which it is presumed there is a carryover effect, one should use an alternating-sequence design and include all cycles in the analysis. However, the computed efficacy of 1.99 in this situation results from two competing biases that will not cancel out in all circumstances. DISCUSSION This small analytical simulation has led to a remarkable conclusion. When multiple cycles of treatment are undertaken to evaluate the efficacy of infertility therapy, neither the constant-sequence design (in which the experimental group receives the same intervention in successive cycles) nor the alternating-sequence design (in which experimental and control interventions are crossed over at the end of each successive cycle) provides the correct estimate of the true treatment effect. Both designs suffer from systematic bias due to confounding of treatment effect and dropouts that result from pregnancy. However, the use of the alternatingsequence design with restriction of the analysis to only the odd-numbered treatment cycles provides an unbiased estimation of the treatment effect. Although it is tempting to presume that this unexpected finding may represent an idiosyncracy of the simple model, there is every reason to believe that the findings are generalizable to situations in which more than two cycles of treatment are undertaken. The simple reason for this belief is that, in general, after two cycles the increased pregnancy rate of the experimental treatment has been applied to both experimental and control groups so that the distributions are equivalent regarding fertility potential. The analysis we performed in the Appendix shows that the finding is robust over TABLE 2 Pregnancy rate per cycle in the alternating-sequence design trial. Cycle number Experimental group Pregnancy risk a Control group Relative risk 1 320/1000 (0.320) 160/1000 (0.160) /840 (0.286) 80/680 (0.118) /600 (0.224) 67.2/600 (0.112) /532.8 (0.216) 48/465.6 (0.103) /417.6 (0.204) 42.7/417.6 (0.102) 2.00 Overall 894.5/ (0.260) 397.9/ (0.126) 2.06 a Pregnancy risk refers to the number of subjects becoming pregnant out of the total number receiving the experimental or control interventions, and the proportions are indicated in parentheses. 322 Norman and Daya The alternating-sequence design trial Vol. 74, No. 2, August 2000

5 any estimate of the distribution of fertility and treatment effectiveness. Moreover, the results we obtained are remarkably similar to those of Cohlen s more elaborate model (8). In addition, their computer simulation model shows exactly the same pattern we predicted, with even-numbered cycles producing ORs that are systematically higher than those of odd-numbered cycles. In fact, the effectiveness calculated from the odd-numbered cycles of their model average remarkably close to the true value (i.e., cycle 1, 20/ ; cycle 3, 17.9/ ; and cycle 5, 16.3/ ), just as was the case with our simplified model. Neither the constant-sequence nor the alternating-sequence design is immune to the potential for bias from differential numbers of dropout. In both cases, women who are most fertile will be the ones most likely to drop out (i.e., become pregnant). Consequently, the population at each successive cycle constitutes a progressively more infertile group. Moreover, when the treatment is effective, there will be a systematic distortion of the two groups. Thus, with effective treatment, the relatively more fertile patients in the experimental arm in the constant-sequence design will drop out at a higher rate than those in the control arm so that over successive cycles, the treatment will seem to become progressively less effective. This effect of decreasing OR in a linear fashion over successive cycles was also observed in the computer simulation study (8). The net result of this phenomenon is that the constant-sequence design study may underestimate the true therapeutic effect. In the present study, the final estimated RR was 1.83 instead of 2.00, and in the computer simulation study the final estimated OR was 2.05 instead of 2.25 (8). Conversely, in the alternating-sequence design trial, patients who have not conceived in the first cycle are now crossed over into the alternative group. Thus, those who were in the control group are now in the experimental group and vice versa. A potential problem in the opposite direction may now be encountered. If therapy has been effective in the first cycle, the new experimental group will now consist of couples that are systematically more fertile than those in the new control group. Consequently, treatment would appear more effective than it is in reality. Therefore, the alternatingsequence design trial will produce results that may overestimate the true treatment effect. The situation is more complicated than that of the constant-sequence design trial, because the effective treatment is now acting on the subgroup with relatively more fertile women thereby producing, at the end of the second cycle, two groups that are somewhat more similar. This effect is also seen in the computer simulation study in which the successive ORs for the even-numbered cycles were 2.40, 2.36, and 2.36 compared to 2.25, 2.26, 2.18, and 2.30 in the odd-numbered cycles (8). The one caveat to the use of the alternating-sequence design is that, in situations in which there may be carryover effects, the consequence will be to increase fertility in the control group and thereby underestimate the treatment effect. The finding of this study is of particular importance in present-day infertility research because of the increasing emphasis that is being placed on the two-period crossover design trial to evaluate therapy. It can be seen clearly from Table 2 that because the second cycle produces a larger estimate of the treatment effect, the overall conclusion at the end of the two periods (or cycles) will be an overestimation of the true treatment effect, i.e., the estimate in period 1 (or cycle 1) is 2.00 (as expected), but in period 2 (or cycle 2) it is much higher at 2.42, resulting in an overall estimate at the end of the two-period crossover study of A more reliable approach is to conduct a one-period trial in which patients are randomly allocated to receive either treatment or control intervention in one cycle. Such a parallel group design trial, with sufficient power, is the best design to evaluate treatment efficacy and would provide an estimate of treatment effect that is closer to truth. However, it requires a much larger sample size than with the crossover design, and half of the patients would have to be in the control (or nonintervention arm) of the trial. Hence, it is not as popular with patients as is the two-period crossover design, which ensures that patients allocated to the control arm in the first period will receive the experimental treatment in the second period. The objective of obtaining an accurate estimate of the effect of treatment, but also allowing all subjects to have the opportunity to receive the experimental treatment in at least one cycle, can now be achieved with the alternating-sequence design trial. The proviso is that the trial should run for at least three cycles and all data from the even-numbered cycles would have to be excluded from the analysis, which would be restricted only to the odd-numbered cycles. Because up to half of the data have to be discarded with this approach, the design has obvious implications for calculating sample size, which is now expressed in cycles and not in persons. Of course, with lower fecundity rates, the magnitude of the bias is reduced, and it may be that individual investigators may choose to accept the possibility of bias in return for an increase in statistical power. This strategy of using the alternating-sequence design trial may have implications beyond infertility trials. In situations where there is a differential susceptibility in the population and where the outcome leads to a dropout from the trial, such as would arise from either cure or death, then effective treatments inevitably will be applied to progressively more resistant subpopulations as time unfolds. Under these circumstances, it may be wise to induce a period effect, so that the treatment can be applied equally to both cohorts, thereby maintaining similar characteristics in the experimental and control samples over time. FERTILITY & STERILITY 323

6 References 1. Daya S. Is there a place for the crossover design in infertility trials? Fertil Steril 1993;59: Lois TA, Lavori PW, Bailar JC, Polansky M. Crossover and selfcontrolled designs in clinical research. N Engl J Med 1984;310: Hills M, Armitage P. Two-period crossover clinical trial. Br J Pharmacol 1979;82: Petrie A. The crossover design. In: Tygstrup N, Lachin JM, Juhl E, editors. The randomized clinical trial and therapeutic decisions. New York: Marcel Dekker Inc. 1982: Senn S. Cross-over trials in clinical research. Chichester, England: John Wiley and Sons Ltd. 1993:8. 6. Daya S. Differences between crossover and parallel study designs debate? Fertil Steril 1999;71: Khan K, Daya S, Collins JA, Walter SD. Empirical evidence of bias in infertility research: overestimation of treatment effect in crossover trials using pregnancy as the outcome measure. Fertil Steril 1996;65: Cohlen BJ, le Velde ER, Looman CWN, Eijckemans R, Habbema JDF. Crossover or parallel design in infertility trials? The discussion continues. Fertil Steril 1998;70:40 5. APPENDIX Let p(f) be the probability distribution of women as a function of fertility (F), where F ranges from 0 to 1. So, for a sample of size N, the number of women at a given fertility, F is equal to Np(F). Furthermore, in the most general case, the effectiveness of the drug,, will also be related to fertility, so is expressed as (F). So, at each cycle, the number of pregnancies in the control group will be F the number of patients at risk, and in the experimental group will be F (F) the number at risk. At the conclusion of each cycle, then, the number remaining at risk (who will then be crossed over at the next cycle) is (1 F) the number at risk in the control group, and (1 F (F)) the number at risk in the experimental group. We can now compute the number of pregnancies at each cycle for the crossover sequence, as shown in Appendix Table 1. Because the number of patients at fertility F is exactly the same in both groups at the beginning of cycle 3, we have proved that the effectiveness computed from odd-numbered cycles is an unbiased estimate, in the general case of an arbitrary distribution of fertility and a drug effectiveness that is a function of fertility. APPENDIX TABLE 1 Theoretical calculation of the number of pregnancies and number of exposed patients for three cycles of a crossover sequence, for the general case of an arbitrary distribution of fertility and effectiveness related to fertility. Control Treatment Cycle 1 No. at risk N 1c Np(F) N 1t Np(F) No. pregnant F N 1c F N p(f) F (F) N 1t F (F) N p(f) No. at risk at end of cycle (1 F) N 1c (1 F)N p(f) (1 F (F)) N 1t (1 F (F)) N p(f) Cycle 2 No. at risk N 2c (1 F (F)) N p(f) N 2t (1 F) N p(f) No. pregnant F N 2c F(1 F (F)) N p(f) F (F)N 2t F (F)(1 F) N p(f) No. at risk at end of cycle (1 F) N 2c (1 F)(1 F (F)) N p(f) (1 F (F)) N 2t (1 F (F)(1 F) N p(f) Cycle 3 No. at risk N 3c (1 F)(1 F (F)) N p(f) N 3t (1 F (F)(1 F) N p(f) 324 Norman and Daya The alternating-sequence design trial Vol. 74, No. 2, August 2000

COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO CONSIDER ON MISSING DATA

COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO CONSIDER ON MISSING DATA The European Agency for the Evaluation of Medicinal Products Evaluation of Medicines for Human Use London, 15 November 2001 CPMP/EWP/1776/99 COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO

More information

CLINICAL ASSISTED REPRODUCTION

CLINICAL ASSISTED REPRODUCTION CLINICAL ASSISTED REPRODUCTION IVF Births and Pregnancies: An Exploration of Two Methods of Assessment Using Life-Table Analysis R. DEONANDAN, 1,4 M. K. CAMPBELL, 1 T. OSTBYE, 2 I. TUMMON, 3 and J. ROBERTSON

More information

Cochrane Pregnancy and Childbirth Group Methodological Guidelines

Cochrane Pregnancy and Childbirth Group Methodological Guidelines Cochrane Pregnancy and Childbirth Group Methodological Guidelines [Prepared by Simon Gates: July 2009, updated July 2012] These guidelines are intended to aid quality and consistency across the reviews

More information

Revised Cochrane risk of bias tool for randomized trials (RoB 2.0) Additional considerations for cross-over trials

Revised Cochrane risk of bias tool for randomized trials (RoB 2.0) Additional considerations for cross-over trials Revised Cochrane risk of bias tool for randomized trials (RoB 2.0) Additional considerations for cross-over trials Edited by Julian PT Higgins on behalf of the RoB 2.0 working group on cross-over trials

More information

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering Meta-Analysis Zifei Liu What is a meta-analysis; why perform a metaanalysis? How a meta-analysis work some basic concepts and principles Steps of Meta-analysis Cautions on meta-analysis 2 What is Meta-analysis

More information

* Present address: Foothills Hospital, Calgary, Alberta, Canada.

* Present address: Foothills Hospital, Calgary, Alberta, Canada. FERTILITY AND STERILITY Copyright 1993 The American Fertility Society Vol. 59, No. 6, June 1993 Printed on acid-free paper in U.S.A. A randomized trial of in vitro fertilization versus conventional treatment

More information

Flexible Matching in Case-Control Studies of Gene-Environment Interactions

Flexible Matching in Case-Control Studies of Gene-Environment Interactions American Journal of Epidemiology Copyright 2004 by the Johns Hopkins Bloomberg School of Public Health All rights reserved Vol. 59, No. Printed in U.S.A. DOI: 0.093/aje/kwg250 ORIGINAL CONTRIBUTIONS Flexible

More information

Methods of Calculating Deaths Attributable to Obesity

Methods of Calculating Deaths Attributable to Obesity American Journal of Epidemiology Copyright 2004 by the Johns Hopkins Bloomberg School of Public Health All rights reserved Vol. 160, No. 4 Printed in U.S.A. DOI: 10.1093/aje/kwh222 Methods of Calculating

More information

Regression Discontinuity Analysis

Regression Discontinuity Analysis Regression Discontinuity Analysis A researcher wants to determine whether tutoring underachieving middle school students improves their math grades. Another wonders whether providing financial aid to low-income

More information

STEP II Conceptualising a Research Design

STEP II Conceptualising a Research Design STEP II Conceptualising a Research Design This operational step includes two chapters: Chapter 7: The research design Chapter 8: Selecting a study design CHAPTER 7 The Research Design In this chapter you

More information

Bias in randomised factorial trials

Bias in randomised factorial trials Research Article Received 17 September 2012, Accepted 9 May 2013 Published online 4 June 2013 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.5869 Bias in randomised factorial trials

More information

BAYESIAN ESTIMATORS OF THE LOCATION PARAMETER OF THE NORMAL DISTRIBUTION WITH UNKNOWN VARIANCE

BAYESIAN ESTIMATORS OF THE LOCATION PARAMETER OF THE NORMAL DISTRIBUTION WITH UNKNOWN VARIANCE BAYESIAN ESTIMATORS OF THE LOCATION PARAMETER OF THE NORMAL DISTRIBUTION WITH UNKNOWN VARIANCE Janet van Niekerk* 1 and Andriette Bekker 1 1 Department of Statistics, University of Pretoria, 0002, Pretoria,

More information

Book review of Herbert I. Weisberg: Bias and Causation, Models and Judgment for Valid Comparisons Reviewed by Judea Pearl

Book review of Herbert I. Weisberg: Bias and Causation, Models and Judgment for Valid Comparisons Reviewed by Judea Pearl Book review of Herbert I. Weisberg: Bias and Causation, Models and Judgment for Valid Comparisons Reviewed by Judea Pearl Judea Pearl University of California, Los Angeles Computer Science Department Los

More information

Appendix G: Methodology checklist: the QUADAS tool for studies of diagnostic test accuracy 1

Appendix G: Methodology checklist: the QUADAS tool for studies of diagnostic test accuracy 1 Appendix G: Methodology checklist: the QUADAS tool for studies of diagnostic test accuracy 1 Study identification Including author, title, reference, year of publication Guideline topic: Checklist completed

More information

Meta-Analysis and Publication Bias: How Well Does the FAT-PET-PEESE Procedure Work?

Meta-Analysis and Publication Bias: How Well Does the FAT-PET-PEESE Procedure Work? Meta-Analysis and Publication Bias: How Well Does the FAT-PET-PEESE Procedure Work? Nazila Alinaghi W. Robert Reed Department of Economics and Finance, University of Canterbury Abstract: This study uses

More information

The comparison or control group may be allocated a placebo intervention, an alternative real intervention or no intervention at all.

The comparison or control group may be allocated a placebo intervention, an alternative real intervention or no intervention at all. 1. RANDOMISED CONTROLLED TRIALS (Treatment studies) (Relevant JAMA User s Guides, Numbers IIA & B: references (3,4) Introduction: The most valid study design for assessing the effectiveness (both the benefits

More information

Statistics for Clinical Trials: Basics of Phase III Trial Design

Statistics for Clinical Trials: Basics of Phase III Trial Design Statistics for Clinical Trials: Basics of Phase III Trial Design Gary M. Clark, Ph.D. Vice President Biostatistics & Data Management Array BioPharma Inc. Boulder, Colorado USA NCIC Clinical Trials Group

More information

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013 Evidence-Based Medicine Journal Club A Primer in Statistics, Study Design, and Epidemiology August, 2013 Rationale for EBM Conscientious, explicit, and judicious use Beyond clinical experience and physiologic

More information

CONTINUOUS AND CATEGORICAL TREND ESTIMATORS: SIMULATION RESULTS AND AN APPLICATION TO RESIDENTIAL RADON

CONTINUOUS AND CATEGORICAL TREND ESTIMATORS: SIMULATION RESULTS AND AN APPLICATION TO RESIDENTIAL RADON CONTINUOUS AND CATEGORICAL TREND ESTIMATORS: SIMULATION RESULTS AND AN APPLICATION TO RESIDENTIAL RADON A Schaffrath Rosario 1,2*, J Wellmann 1,3, IM Heid 1 and HE Wichmann 1,2 1 Institute of Epidemiology,

More information

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lec 02: Estimation & Hypothesis Testing in Animal Ecology Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then

More information

Measuring cancer survival in populations: relative survival vs cancer-specific survival

Measuring cancer survival in populations: relative survival vs cancer-specific survival Int. J. Epidemiol. Advance Access published February 8, 2010 Published by Oxford University Press on behalf of the International Epidemiological Association ß The Author 2010; all rights reserved. International

More information

Practitioner s Guide To Stratified Random Sampling: Part 1

Practitioner s Guide To Stratified Random Sampling: Part 1 Practitioner s Guide To Stratified Random Sampling: Part 1 By Brian Kriegler November 30, 2018, 3:53 PM EST This is the first of two articles on stratified random sampling. In the first article, I discuss

More information

Title: Who does not participate in a follow-up postal study? A survey of infertile couples treated by in vitro fertilization

Title: Who does not participate in a follow-up postal study? A survey of infertile couples treated by in vitro fertilization Author's response to reviews Title: Who does not participate in a follow-up postal study? A survey of infertile couples treated by in vitro fertilization Authors: Pénélope Troude (penelope.troude@inserm.fr)

More information

UNIT I SAMPLING AND EXPERIMENTATION: PLANNING AND CONDUCTING A STUDY (Chapter 4)

UNIT I SAMPLING AND EXPERIMENTATION: PLANNING AND CONDUCTING A STUDY (Chapter 4) UNIT I SAMPLING AND EXPERIMENTATION: PLANNING AND CONDUCTING A STUDY (Chapter 4) A DATA COLLECTION (Overview) When researchers want to make conclusions/inferences about an entire population, they often

More information

Where does "analysis" enter the experimental process?

Where does analysis enter the experimental process? Lecture Topic : ntroduction to the Principles of Experimental Design Experiment: An exercise designed to determine the effects of one or more variables (treatments) on one or more characteristics (response

More information

Systematic Reviews and Meta- Analysis in Kidney Transplantation

Systematic Reviews and Meta- Analysis in Kidney Transplantation Systematic Reviews and Meta- Analysis in Kidney Transplantation Greg Knoll MD MSc Associate Professor of Medicine Medical Director, Kidney Transplantation University of Ottawa and The Ottawa Hospital KRESCENT

More information

Department of International Health

Department of International Health JOHNS HOPKINS U N I V E R S I T Y Center for Clinical Trials Department of Biostatistics Department of Epidemiology Department of International Health Memorandum Department of Medicine Department of Ophthalmology

More information

Comparison of hysterosalpingography and laparoscopy in predicting fertility outcome

Comparison of hysterosalpingography and laparoscopy in predicting fertility outcome Human Reproduction vol.14 no.5 pp.1237 1242, 1999 Comparison of hysterosalpingography and in predicting fertility outcome Ben W.J.Mol 1,2,5, John A.Collins 3,4, Elizabeth A.Burrows 4, Fulco van der Veen

More information

Gender-Based Differential Item Performance in English Usage Items

Gender-Based Differential Item Performance in English Usage Items A C T Research Report Series 89-6 Gender-Based Differential Item Performance in English Usage Items Catherine J. Welch Allen E. Doolittle August 1989 For additional copies write: ACT Research Report Series

More information

A Decision Tree for Controlled Trials

A Decision Tree for Controlled Trials SPORTSCIENCE Perspectives / Research Resources A Decision Tree for Controlled Trials Alan M Batterham, Will G Hopkins sportsci.org Sportscience 9, 33-39, 2005 (sportsci.org/jour/05/wghamb.htm) School of

More information

Adjusted Crash Odds Ratio Estimates of Driver Behavior Errors: A Re-Analysis of the SHRP2 Naturalistic Driving Study Data

Adjusted Crash Odds Ratio Estimates of Driver Behavior Errors: A Re-Analysis of the SHRP2 Naturalistic Driving Study Data University of Iowa Iowa Research Online Driving Assessment Conference 2017 Driving Assessment Conference Jun 28th, 12:00 AM Adjusted Crash Odds Ratio Estimates of Driver Behavior Errors: A Re-Analysis

More information

EU regulatory concerns about missing data: Issues of interpretation and considerations for addressing the problem. Prof. Dr.

EU regulatory concerns about missing data: Issues of interpretation and considerations for addressing the problem. Prof. Dr. EU regulatory concerns about missing data: Issues of interpretation and considerations for addressing the problem Prof. Dr. Karl Broich Disclaimer No conflicts of interest Views expressed in this presentation

More information

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY Lingqi Tang 1, Thomas R. Belin 2, and Juwon Song 2 1 Center for Health Services Research,

More information

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC Selected Topics in Biostatistics Seminar Series Missing Data Sponsored by: Center For Clinical Investigation and Cleveland CTSC Brian Schmotzer, MS Biostatistician, CCI Statistical Sciences Core brian.schmotzer@case.edu

More information

VALIDITY OF QUANTITATIVE RESEARCH

VALIDITY OF QUANTITATIVE RESEARCH Validity 1 VALIDITY OF QUANTITATIVE RESEARCH Recall the basic aim of science is to explain natural phenomena. Such explanations are called theories (Kerlinger, 1986, p. 8). Theories have varying degrees

More information

Chapter Three Research Methodology

Chapter Three Research Methodology Chapter Three Research Methodology Research Methods is a systematic and principled way of obtaining evidence (data, information) for solving health care problems. 1 Dr. Mohammed ALnaif METHODS AND KNOWLEDGE

More information

Helmut Schütz. BioBriges 2018 Prague, September

Helmut Schütz. BioBriges 2018 Prague, September Multi-Group Studies in BE. Multi-Group Studies in Bioequivalence. To pool or not to pool? Helmut Schütz BioBriges 2018 Prague, 26 27 September 2018 1 Remember Whenever a theory appears to you as the only

More information

Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank)

Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank) Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank) Attribution The extent to which the observed change in outcome is the result of the intervention, having allowed

More information

A Case Study: Two-sample categorical data

A Case Study: Two-sample categorical data A Case Study: Two-sample categorical data Patrick Breheny January 31 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/43 Introduction Model specification Continuous vs. mixture priors Choice

More information

Structural Approach to Bias in Meta-analyses

Structural Approach to Bias in Meta-analyses Original Article Received 26 July 2011, Revised 22 November 2011, Accepted 12 December 2011 Published online 2 February 2012 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/jrsm.52 Structural

More information

Appendix Table A Frequency of end-stage liver disease in inception cohort. Reference Exposure followup Frequency ESLD (%) Seeff-1 PTH 24 23/568 (4%) 7

Appendix Table A Frequency of end-stage liver disease in inception cohort. Reference Exposure followup Frequency ESLD (%) Seeff-1 PTH 24 23/568 (4%) 7 Published as supplied by the author Appendix Table A Frequency of end-stage liver disease in inception cohort studies Route of Years Reference Exposure followup Frequency ESLD (%) Locasciulli PTH 1 15

More information

Clinical research in AKI Timing of initiation of dialysis in AKI

Clinical research in AKI Timing of initiation of dialysis in AKI Clinical research in AKI Timing of initiation of dialysis in AKI Josée Bouchard, MD Krescent Workshop December 10 th, 2011 1 Acute kidney injury in ICU 15 25% of critically ill patients experience AKI

More information

Understanding noninferiority trials

Understanding noninferiority trials Review article http://dx.doi.org/10.3345/kjp.2012.55.11.403 Korean J Pediatr 2012;55(11):403-407 eissn 1738-1061 pissn 2092-7258 Understanding noninferiority trials Seokyung Hahn, PhD Department of Medicine,

More information

Lecture 4: Research Approaches

Lecture 4: Research Approaches Lecture 4: Research Approaches Lecture Objectives Theories in research Research design approaches ú Experimental vs. non-experimental ú Cross-sectional and longitudinal ú Descriptive approaches How to

More information

INTERNAL VALIDITY, BIAS AND CONFOUNDING

INTERNAL VALIDITY, BIAS AND CONFOUNDING OCW Epidemiology and Biostatistics, 2010 J. Forrester, PhD Tufts University School of Medicine October 6, 2010 INTERNAL VALIDITY, BIAS AND CONFOUNDING Learning objectives for this session: 1) Understand

More information

Understanding Uncertainty in School League Tables*

Understanding Uncertainty in School League Tables* FISCAL STUDIES, vol. 32, no. 2, pp. 207 224 (2011) 0143-5671 Understanding Uncertainty in School League Tables* GEORGE LECKIE and HARVEY GOLDSTEIN Centre for Multilevel Modelling, University of Bristol

More information

Fundamental Clinical Trial Design

Fundamental Clinical Trial Design Design, Monitoring, and Analysis of Clinical Trials Session 1 Overview and Introduction Overview Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics, University of Washington February 17-19, 2003

More information

A re-randomisation design for clinical trials

A re-randomisation design for clinical trials Kahan et al. BMC Medical Research Methodology (2015) 15:96 DOI 10.1186/s12874-015-0082-2 RESEARCH ARTICLE Open Access A re-randomisation design for clinical trials Brennan C Kahan 1*, Andrew B Forbes 2,

More information

About Reading Scientific Studies

About Reading Scientific Studies About Reading Scientific Studies TABLE OF CONTENTS About Reading Scientific Studies... 1 Why are these skills important?... 1 Create a Checklist... 1 Introduction... 1 Abstract... 1 Background... 2 Methods...

More information

Do the sample size assumptions for a trial. addressing the following question: Among couples with unexplained infertility does

Do the sample size assumptions for a trial. addressing the following question: Among couples with unexplained infertility does Exercise 4 Do the sample size assumptions for a trial addressing the following question: Among couples with unexplained infertility does a program of up to three IVF cycles compared with up to three FSH

More information

Estimation of effect sizes in the presence of publication bias: a comparison of meta-analysis methods

Estimation of effect sizes in the presence of publication bias: a comparison of meta-analysis methods Estimation of effect sizes in the presence of publication bias: a comparison of meta-analysis methods Hilde Augusteijn M.A.L.M. van Assen R. C. M. van Aert APS May 29, 2016 Today s presentation Estimation

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

Patrick Breheny. January 28

Patrick Breheny. January 28 Confidence intervals Patrick Breheny January 28 Patrick Breheny Introduction to Biostatistics (171:161) 1/19 Recap Introduction In our last lecture, we discussed at some length the Public Health Service

More information

SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA. Henrik Kure

SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA. Henrik Kure SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA Henrik Kure Dina, The Royal Veterinary and Agricuural University Bülowsvej 48 DK 1870 Frederiksberg C. kure@dina.kvl.dk

More information

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis EFSA/EBTC Colloquium, 25 October 2017 Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis Julian Higgins University of Bristol 1 Introduction to concepts Standard

More information

How to interpret results of metaanalysis

How to interpret results of metaanalysis How to interpret results of metaanalysis Tony Hak, Henk van Rhee, & Robert Suurmond Version 1.0, March 2016 Version 1.3, Updated June 2018 Meta-analysis is a systematic method for synthesizing quantitative

More information

Introduction to Bayesian Analysis 1

Introduction to Bayesian Analysis 1 Biostats VHM 801/802 Courses Fall 2005, Atlantic Veterinary College, PEI Henrik Stryhn Introduction to Bayesian Analysis 1 Little known outside the statistical science, there exist two different approaches

More information

Dichotomizing partial compliance and increased participant burden in factorial designs: the performance of four noncompliance methods

Dichotomizing partial compliance and increased participant burden in factorial designs: the performance of four noncompliance methods Merrill and McClure Trials (2015) 16:523 DOI 1186/s13063-015-1044-z TRIALS RESEARCH Open Access Dichotomizing partial compliance and increased participant burden in factorial designs: the performance of

More information

Modeling and Environmental Science: In Conclusion

Modeling and Environmental Science: In Conclusion Modeling and Environmental Science: In Conclusion Environmental Science It sounds like a modern idea, but if you view it broadly, it s a very old idea: Our ancestors survival depended on their knowledge

More information

Strategies for handling missing data in randomised trials

Strategies for handling missing data in randomised trials Strategies for handling missing data in randomised trials NIHR statistical meeting London, 13th February 2012 Ian White MRC Biostatistics Unit, Cambridge, UK Plan 1. Why do missing data matter? 2. Popular

More information

Thresholds for statistical and clinical significance in systematic reviews with meta-analytic methods

Thresholds for statistical and clinical significance in systematic reviews with meta-analytic methods Jakobsen et al. BMC Medical Research Methodology 2014, 14:120 CORRESPONDENCE Open Access Thresholds for statistical and clinical significance in systematic reviews with meta-analytic methods Janus Christian

More information

Vocabulary. Bias. Blinding. Block. Cluster sample

Vocabulary. Bias. Blinding. Block. Cluster sample Bias Blinding Block Census Cluster sample Confounding Control group Convenience sample Designs Experiment Experimental units Factor Level Any systematic failure of a sampling method to represent its population

More information

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp The Stata Journal (22) 2, Number 3, pp. 28 289 Comparative assessment of three common algorithms for estimating the variance of the area under the nonparametric receiver operating characteristic curve

More information

The RoB 2.0 tool (individually randomized, cross-over trials)

The RoB 2.0 tool (individually randomized, cross-over trials) The RoB 2.0 tool (individually randomized, cross-over trials) Study design Randomized parallel group trial Cluster-randomized trial Randomized cross-over or other matched design Specify which outcome is

More information

The Exposure-Stratified Retrospective Study: Application to High-Incidence Diseases

The Exposure-Stratified Retrospective Study: Application to High-Incidence Diseases The Exposure-Stratified Retrospective Study: Application to High-Incidence Diseases Peng T. Liu and Debra A. Street Division of Public Health and Biostatistics, CFSAN, FDA 5100 Paint Branch Pkwy, College

More information

The Impact of Continuity Violation on ANOVA and Alternative Methods

The Impact of Continuity Violation on ANOVA and Alternative Methods Journal of Modern Applied Statistical Methods Volume 12 Issue 2 Article 6 11-1-2013 The Impact of Continuity Violation on ANOVA and Alternative Methods Björn Lantz Chalmers University of Technology, Gothenburg,

More information

Comparison of Meta-Analytic Results of Indirect, Direct, and Combined Comparisons of Drugs for Chronic Insomnia in Adults: A Case Study

Comparison of Meta-Analytic Results of Indirect, Direct, and Combined Comparisons of Drugs for Chronic Insomnia in Adults: A Case Study ORIGINAL ARTICLE Comparison of Meta-Analytic Results of Indirect, Direct, and Combined Comparisons of Drugs for Chronic Insomnia in Adults: A Case Study Ben W. Vandermeer, BSc, MSc, Nina Buscemi, PhD,

More information

I N PREVIOUS COMMUNICATIONS, 1. 2

I N PREVIOUS COMMUNICATIONS, 1. 2 Day of Conception in Relation to Length of Menstrual Cycle A Study of 65 Conceptions Resulting from Isolated Coitus DOUGLAS P. MURPHY, M.D., and EDITHA F. TORRANO, M.D. I N PREVIOUS COMMUNICATIONS,. 2

More information

DEFINING THE CASE STUDY Yin, Ch. 1

DEFINING THE CASE STUDY Yin, Ch. 1 Case Study Research DEFINING THE CASE STUDY Yin, Ch. 1 Goals for today are to understand: 1. What is a case study 2. When is it useful 3. Guidelines for designing a case study 4. Identifying key methodological

More information

Evaluation Models STUDIES OF DIAGNOSTIC EFFICIENCY

Evaluation Models STUDIES OF DIAGNOSTIC EFFICIENCY 2. Evaluation Model 2 Evaluation Models To understand the strengths and weaknesses of evaluation, one must keep in mind its fundamental purpose: to inform those who make decisions. The inferences drawn

More information

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Mantel-Haenszel Procedures for Detecting Differential Item Functioning A Comparison of Logistic Regression and Mantel-Haenszel Procedures for Detecting Differential Item Functioning H. Jane Rogers, Teachers College, Columbia University Hariharan Swaminathan, University of

More information

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1 Welch et al. BMC Medical Research Methodology (2018) 18:89 https://doi.org/10.1186/s12874-018-0548-0 RESEARCH ARTICLE Open Access Does pattern mixture modelling reduce bias due to informative attrition

More information

Controlled Trials. Spyros Kitsiou, PhD

Controlled Trials. Spyros Kitsiou, PhD Assessing Risk of Bias in Randomized Controlled Trials Spyros Kitsiou, PhD Assistant Professor Department of Biomedical and Health Information Sciences College of Applied Health Sciences University of

More information

Differential Item Functioning

Differential Item Functioning Differential Item Functioning Lecture #11 ICPSR Item Response Theory Workshop Lecture #11: 1of 62 Lecture Overview Detection of Differential Item Functioning (DIF) Distinguish Bias from DIF Test vs. Item

More information

Live WebEx meeting agenda

Live WebEx meeting agenda 10:00am 10:30am Using OpenMeta[Analyst] to extract quantitative data from published literature Live WebEx meeting agenda August 25, 10:00am-12:00pm ET 10:30am 11:20am Lecture (this will be recorded) 11:20am

More information

Exploring the Impact of Missing Data in Multiple Regression

Exploring the Impact of Missing Data in Multiple Regression Exploring the Impact of Missing Data in Multiple Regression Michael G Kenward London School of Hygiene and Tropical Medicine 28th May 2015 1. Introduction In this note we are concerned with the conduct

More information

Incomplete working draft, please do not quote without authors permission

Incomplete working draft, please do not quote without authors permission Will fertility of Danish women remain stable due to assisted reproduction? Assessing the role of assisted reproduction in sustaining cohort fertility rates Incomplete working draft, please do not quote

More information

Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision

Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision ISPUB.COM The Internet Journal of Epidemiology Volume 7 Number 2 Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision Z Wang Abstract There is an increasing

More information

Performance of the Trim and Fill Method in Adjusting for the Publication Bias in Meta-Analysis of Continuous Data

Performance of the Trim and Fill Method in Adjusting for the Publication Bias in Meta-Analysis of Continuous Data American Journal of Applied Sciences 9 (9): 1512-1517, 2012 ISSN 1546-9239 2012 Science Publication Performance of the Trim and Fill Method in Adjusting for the Publication Bias in Meta-Analysis of Continuous

More information

Confidence Intervals On Subsets May Be Misleading

Confidence Intervals On Subsets May Be Misleading Journal of Modern Applied Statistical Methods Volume 3 Issue 2 Article 2 11-1-2004 Confidence Intervals On Subsets May Be Misleading Juliet Popper Shaffer University of California, Berkeley, shaffer@stat.berkeley.edu

More information

Epidemiology 2200b Lecture 3 (continued again) Review of sources of bias in Case-control, cohort and ecologic(al) studies

Epidemiology 2200b Lecture 3 (continued again) Review of sources of bias in Case-control, cohort and ecologic(al) studies Epidemiology 2200b Lecture 3 (continued again) Review of sources of bias in Case-control, cohort and ecologic(al) studies Before we start one- and two-sided tests When to we perform a one-sided hypothesis

More information

Proof. Revised. Chapter 12 General and Specific Factors in Selection Modeling Introduction. Bengt Muthén

Proof. Revised. Chapter 12 General and Specific Factors in Selection Modeling Introduction. Bengt Muthén Chapter 12 General and Specific Factors in Selection Modeling Bengt Muthén Abstract This chapter shows how analysis of data on selective subgroups can be used to draw inference to the full, unselected

More information

Methods for Computing Missing Item Response in Psychometric Scale Construction

Methods for Computing Missing Item Response in Psychometric Scale Construction American Journal of Biostatistics Original Research Paper Methods for Computing Missing Item Response in Psychometric Scale Construction Ohidul Islam Siddiqui Institute of Statistical Research and Training

More information

Estimating the number of components with defects post-release that showed no defects in testing

Estimating the number of components with defects post-release that showed no defects in testing SOFTWARE TESTING, VERIFICATION AND RELIABILITY Softw. Test. Verif. Reliab. 2002; 12:93 122 (DOI: 10.1002/stvr.235) Estimating the number of components with defects post-release that showed no defects in

More information

baseline comparisons in RCTs

baseline comparisons in RCTs Stefan L. K. Gruijters Maastricht University Introduction Checks on baseline differences in randomized controlled trials (RCTs) are often done using nullhypothesis significance tests (NHSTs). In a quick

More information

Glossary. Ó 2010 John Wiley & Sons, Ltd

Glossary. Ó 2010 John Wiley & Sons, Ltd Glossary The majority of the definitions within this glossary are based on, but are only a selection from, the comprehensive list provided by Day (2007) in the Dictionary of Clinical Trials. We have added

More information

Types of Data. Systematic Reviews: Data Synthesis Professor Jodie Dodd 4/12/2014. Acknowledgements: Emily Bain Australasian Cochrane Centre

Types of Data. Systematic Reviews: Data Synthesis Professor Jodie Dodd 4/12/2014. Acknowledgements: Emily Bain Australasian Cochrane Centre Early Nutrition Workshop, December 2014 Systematic Reviews: Data Synthesis Professor Jodie Dodd 1 Types of Data Acknowledgements: Emily Bain Australasian Cochrane Centre 2 1 What are dichotomous outcomes?

More information

Attributes Statistical Sampling Tables

Attributes Statistical Sampling Tables Appendix A Attributes Statistical Sampling Tables 129 Attributes Statistical Sampling Tables A.1 Four tables appear at the end of this appendix to assist the auditor in planning and evaluating a statistical

More information

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS) Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it

More information

GLOSSARY OF GENERAL TERMS

GLOSSARY OF GENERAL TERMS GLOSSARY OF GENERAL TERMS Absolute risk reduction Absolute risk reduction (ARR) is the difference between the event rate in the control group (CER) and the event rate in the treated group (EER). ARR =

More information

Research Prospectus. Your major writing assignment for the quarter is to prepare a twelve-page research prospectus.

Research Prospectus. Your major writing assignment for the quarter is to prepare a twelve-page research prospectus. Department of Political Science UNIVERSITY OF CALIFORNIA, SAN DIEGO Philip G. Roeder Research Prospectus Your major writing assignment for the quarter is to prepare a twelve-page research prospectus. A

More information

Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al.

Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al. Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al. Holger Höfling Gad Getz Robert Tibshirani June 26, 2007 1 Introduction Identifying genes that are involved

More information

E:\ F SOCI 502\Lectures\Research_Design\Research_Design_Text.wpd SOCI 502: NOTES ON RESEARCH DESIGN

E:\ F SOCI 502\Lectures\Research_Design\Research_Design_Text.wpd SOCI 502: NOTES ON RESEARCH DESIGN 1 E:\02 2004F SOCI 502\Lectures\Research_Design\Research_Design_Text.wpd SOCI 502: NOTES ON RESEARCH DESIGN 2 RESEARCH DESIGN:! A research design is a set of logical procedures that (when followed) enables

More information

MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and. Lord Equating Methods 1,2

MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and. Lord Equating Methods 1,2 MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and Lord Equating Methods 1,2 Lisa A. Keller, Ronald K. Hambleton, Pauline Parker, Jenna Copella University of Massachusetts

More information

Sampling Problems in Estimating Small Mammal Population Size1

Sampling Problems in Estimating Small Mammal Population Size1 Sampling Problems in Estimating Small Mammal Population Size1 George E. Menkens, Jr.2 and Stanley H. Anderson3 Abstract. -Estimates of population size are influenced by four sources of error: measurement,

More information

Epidemiologic Methods and Counting Infections: The Basics of Surveillance

Epidemiologic Methods and Counting Infections: The Basics of Surveillance Epidemiologic Methods and Counting Infections: The Basics of Surveillance Ebbing Lautenbach, MD, MPH, MSCE University of Pennsylvania School of Medicine Nothing to disclose PENN Outline Definitions / Historical

More information

Methodological aspects of non-inferiority and equivalence trials

Methodological aspects of non-inferiority and equivalence trials Methodological aspects of non-inferiority and equivalence trials Department of Clinical Epidemiology Leiden University Medical Center Capita Selecta 09-12-2014 1 Why RCTs Early 1900: Evidence based on

More information

ICH E9(R1) Technical Document. Estimands and Sensitivity Analysis in Clinical Trials STEP I TECHNICAL DOCUMENT TABLE OF CONTENTS

ICH E9(R1) Technical Document. Estimands and Sensitivity Analysis in Clinical Trials STEP I TECHNICAL DOCUMENT TABLE OF CONTENTS ICH E9(R1) Technical Document Estimands and Sensitivity Analysis in Clinical Trials STEP I TECHNICAL DOCUMENT TABLE OF CONTENTS A.1. Purpose and Scope A.2. A Framework to Align Planning, Design, Conduct,

More information

Checking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior

Checking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior 1 Checking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior Gregory Francis Department of Psychological Sciences Purdue University gfrancis@purdue.edu

More information