This is a repository copy of Practical guide to sample size calculations: superiority trials.

Similar documents
This is a repository copy of Practical guide to sample size calculations: non-inferiority and equivalence trials.

White Rose Research Online URL for this paper: Version: Supplemental Material

This is a repository copy of Antibiotic prophylaxis of endocarditis: a NICE mess.

Article: Min, T and Ford, AC (2016) Efficacy of mesalazine in IBS. Gut, 65 (1). pp ISSN

NEED A SAMPLE SIZE? How to work with your friendly biostatistician!!!

This is a repository copy of Ruling Minds: Psychology in the British Empire, by Erik Linstrum.

This is a repository copy of Mobilising identity through social media: psychosocial support for young people with life-limiting conditions.

This is a repository copy of Testing for asymptomatic bacteriuria in pregnancy..

This is a repository copy of Hospice volunteers as facilitators of public engagement in palliative care priority setting research..

White Rose Research Online URL for this paper: Version: Accepted Version

This is a repository copy of Programmed death 1 expressing regulatory T cells in vitiligo.

This is a repository copy of Early deaths from ischaemic heart disease in childhood-onset type 1 diabetes.

Article: Young, R.J. and Woll, P.J. (2016) Eribulin in soft-tissue sarcoma. Lancet, 387 (10028). pp ISSN

This is a repository copy of Treatment of severe, chronic hand eczema. Results from a UK-wide survey..

This is a repository copy of The Database of Abstracts of Reviews of Effects (DARE).

This is a repository copy of Quadrupling or quintupling inhaled corticosteroid doses to prevent asthma exacerbations: The jury's still out.

White Rose Research Online URL for this paper: Version: Accepted Version

Methodological aspects of non-inferiority and equivalence trials

This is a repository copy of Pinaverium in Irritable Bowel Syndrome: Old Drug, New Tricks?.

This is a repository copy of Psoriasis flare with corticosteroid use in psoriatic arthritis.

Sheila Barron Statistics Outreach Center 2/8/2011

Independent computerised aphasia therapy at home:

Fundamental Clinical Trial Design

This is a repository copy of Measuring the effect of public health campaigns on Twitter: the case of World Autism Awareness Day.

Power & Sample Size. Dr. Andrea Benedetti

TruHearing app - Tinnitus Manager user guide

Clinical trial design issues and options for the study of rare diseases

This is a repository copy of Recommendations on multiple testing adjustment in multi-arm trials with a shared control group.

This is a repository copy of How successful will the sugar levy be in improving diet and reducing inequalities in health?.

City, University of London Institutional Repository

White Rose Research Online URL for this paper: Version: Accepted Version

This is a repository copy of Treating active rheumatoid arthritis with Janus kinase inhibitors..

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

COMMITTEE FOR PROPRIETARY MEDICINAL PRODUCTS (CPMP) POINTS TO CONSIDER ON MISSING DATA

Evidence. Advice, guidance and standards. Evidence

Does Menstrual Hygiene Matter?

Sample Size and Power

Welcome to our e-course on research methods for dietitians

GN Hearing app - Tinnitus Manager user guide

This is a repository copy of Development of Nutritools, an interactive dietary assessment tools website, for use in health research.

This is a repository copy of Scalable anthocyanin extraction and purification methods for industrial applications.

White Rose Research Online URL for this paper: Version: Accepted Version

This is a repository copy of Emotion sharing: implications for trainee doctor well-being.

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

e.com/watch?v=hz1f yhvojr4 e.com/watch?v=kmy xd6qeass

Statistical Considerations in Pilot Studies

Sample Size and Power. Objectives. Take Away Message. Laura Lee Johnson, Ph.D. Statistician

Dr. Kelly Bradley Final Exam Summer {2 points} Name

White Rose Research Online URL for this paper: Version: Supplemental Material

This is a repository copy of Anti-MAG negative distal acquired demyelinating symmetric neuropathy in association with a neuroendocrine tumor..

Sample size calculations: basic principles and common pitfalls

Business Statistics Probability

Adaptive Treatment Arm Selection in Multivariate Bioequivalence Trials

Statistics for Clinical Trials: Basics of Phase III Trial Design

Sample Size Considerations. Todd Alonzo, PhD

PRINCIPLES OF STATISTICS

SMART Clinical Trial Designs for Dynamic Treatment Regimes

CHAPTER OBJECTIVES - STUDENTS SHOULD BE ABLE TO:

SCHOOL OF MATHEMATICS AND STATISTICS

Screening for ovarian cancer Kehoe, Sean

This is a repository copy of The anorexic voice and severity of eating pathology in anorexia nervosa..

Determining the size of a vaccine trial

To open a CMA file > Download and Save file Start CMA Open file from within CMA

This is a repository copy of Benzodiazepines for Psychosis-Induced Aggression or Agitation.

Bayesian and Frequentist Approaches

Sample size for clinical trials

To open a CMA file > Download and Save file Start CMA Open file from within CMA

CHAPTER III METHODOLOGY

App user guide. resound.com

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

User Guide. For Jacoti Hearing Center Version 1.1. Manufacture Year 2016

RAG Rating Indicator Values

GLOBAL HEALTH. PROMIS Pediatric Scale v1.0 Global Health 7 PROMIS Pediatric Scale v1.0 Global Health 7+2

CHAPTER NINE DATA ANALYSIS / EVALUATING QUALITY (VALIDITY) OF BETWEEN GROUP EXPERIMENTS

Creative Commons Attribution-NonCommercial-Share Alike License

Simple Linear Regression the model, estimation and testing

Live life, less complicated. InPen MOBILE APP. Healthcare Provider INSTRUCTIONS FOR USE. CompanionMedical.com

510(k) submissions. Getting US FDA clearance for your device: Improving

Inferential Statistics

Guidance Document for Claims Based on Non-Inferiority Trials

Implementing scientific evidence into clinical practice guidelines

Study Guide for the Final Exam

PushUpMadness. ISDS 3100 Spring /27/2012. Team Five: Michael Alford Jared Falcon Raymond Fuenzalida Dustin Jenkins

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

This is a repository copy of Microarchitecture of bone predicts fractures in older women.

BIOSTATISTICAL METHODS

Overview. Goals of Interpretation. Methodology. Reasons to Read and Evaluate

Statistical Power Sampling Design and sample Size Determination

Hypothesis testing: comparig means

This is a repository copy of Teaching assistants perspectives of deaf students learning experiences in mainstream secondary classrooms.

Clinical Trial Report Synopsis. Patient insights following use of LEO aerosol foam and Daivobet gel in subjects with psoriasis vulgaris

Results. NeuRA Treatments for internalised stigma December 2017

Student Performance Q&A:

Overview. Survey Methods & Design in Psychology. Readings. Significance Testing. Significance Testing. The Logic of Significance Testing

Ten recommendations for Osteoarthritis and Cartilage (OAC) manuscript preparation, common for all types of studies.

Quick guide for Oticon Opn & Oticon ON App 1.8.0

Investigative Biology (Advanced Higher)

Development of the Web Users Self Efficacy scale (WUSE)

Critical Thinking A tour through the science of neuroscience

Transcription:

This is a repository copy of Practical guide to sample size calculations: superiority trials. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/97114/ Version: Accepted Version Article: Flight, L. orcid.org/0000-0002-9569-8290 and Julious, S.A. (2016) Practical guide to sample size calculations: superiority trials. Pharmaceutical Statistics, 15 (1). pp. 75-79. ISSN 1539-1604 https://doi.org/10.1002/pst.1718 This is the peer reviewed version of the following article: Flight, L., and Julious, S. A. (2016) Practical guide to sample size calculations: superiority trials. Pharmaceut. Statist., 15: 75 79. doi: 10.1002/pst.1718., which has been published in final form at http://dx.doi.org/10.1002/pst.1718. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving (http://olabout.wiley.com/wileycda/section/id-820227.html). Reuse Unless indicated otherwise, fulltext items are protected by copyright with all rights reserved. The copyright exception in section 29 of the Copyright, Designs and Patents Act 1988 allows the making of a single copy solely for the purpose of non-commercial research or private study within the limits of fair dealing. The publisher or other rights-holder may allow further reproduction and re-use of this version - refer to the White Rose Research Online record for this item. Where records identify the publisher as the copyright holder, users can verify any specific terms of use on the publisher s website. Takedown If you consider content in White Rose Research Online to be in breach of UK law, please notify us by emailing eprints@whiterose.ac.uk including the URL of the record and the reason for the withdrawal request. eprints@whiterose.ac.uk https://eprints.whiterose.ac.uk/

Practical guide to sample size calculations: superiority trials Laura Flight 1 and Steven. A. Julious 1 March 24, 2016 Abstract A sample size justification is a vital part of any investigation. However, estimating the number of participants required to give meaningful results is not always as straightforward. A number of components are required to facilitate a suitable sample size. Practical advice and examples are provided illustrating how to carry out a the calculations by hand and using the app SampSize. 1 Introduction The introduction paper in this series highlighted the key general components required to estimate the sample size of a clinical trial [1]. This paper provides a practical guide for applying these steps to superiority parallel group clinical trials where the primary endpoint can be assumed to be Normally distributed. In a superiority trial, the objective is to determine whether there is evidence of a difference in the comparator of interest between the regimens, with reference to the null hypothesis that the regimens are the same. This paper will go through the steps of a sample size calculation for a superiority trial first giving an explanation of superiority trials and their hypotheses. The formulae for sample size calculations are then given, highlighting how each of the components forms part of the calculation. Worked examples are used to demonstrate the operational steps in a sample size calculation. The worked examples will use the app SampSize [2] and byhand solutions to emphasise the ease of calculation. Details of how to obtain the app are provided in Table 1. Table 1: How to obtain the app SampSize The SampSize app is available on the Apple App Store to download for free and can be used on ipod Touch, ipad and iphones. The app is also available on the Android Market. It requires Android version 2.3.3 and above. For the calculations in this paper, an ipad is used. 1

2 Components of a Sample Size In the introduction to this series, the steps needed for a sample size calculation are identified. Table 2 gives a summary of these points. An important step before carrying out any sample size calculation is being able to complete this table. Table 2: Summary of steps required for a sample size calculation, Step Summary Objective Is the trial aiming to show superiority, non-inferiority or equivalence? Endpoint What endpoint will be used to show the primary outcome? Normal, binary, ordinal or time to event? Error Type I error: How much chance are you willing to take of rejecting the null when it is actually true? Type II error: How much chance are you willing to take of not rejecting the null when it is actually false? Effect size What is the minimum difference worth detecting? Population Variance What is the population variability? Other Do you need to account for dropouts? How many patients meet the inclusion criteria? 3 Superiority Clinical Trials In a superiority trial, the objective is to determine whether there is evidence of a difference in the desired outcome between treatment A and treatment B with mean response A and B, respectively. The hypotheses under consideration are: H 0 : The two treatments are not different with respect to the mean response µ A = µ B. H 1 : The two treatments are different with respect to the mean response µ A µ B. For a superiority trial, the null hypothesis can be rejected if A > B or if A < B by a statistically significant amount. This results in two chances of making a type I error. The statistical test is referred to as a two-tailed test with each tail allocated an equal amount of the type I error (2.5%). The sum of these tails is equal to the overall type I error rate of 5%. The null hypothesis can be rejected if the test of A < B is statistically significant at the 2.5% level or the test of A > B is statistically significant at the 2.5% level. This is illustrated in Figure 1. 2

Figure 1: Density plot of superiority trial under the null hypothesis. To design a two group trial, the sample size per arm can be estimated [3] from the formula given in Figure 2. The allocation ratio (r) is such that the number of participants on treatment B is r times the number on treatment A, that is, n B = rn A. Note: n = n A +n B is minimised when r = 1. Table 3: Normal deviates for common percentiles. x Z 1 x 0.200 0.842 0.150 1.036 0.100 1.282 0.050 1.645 0.025 1.960 0.010 2.326 0.001 3.090 Table 3 gives common Normal deviates for different percentiles. For example, for β = 0.1, we would have x = 0.1 and Z 1 x = 1.282, while for α = 0.05, we would have x = 0.025 and Z 1 x = 1.96. These values are useful when calculating a sample size by-hand. Once the trial has been conducted, the data are collected and cleaned for 3

Population variance ( 2 ) Type II error ( ) Type I error ( ) n r Z Z rd Allocation ratio (r) Effect size (d) Figure 2: Formula to calculate the sample size for a superiority parallel group trial with Normally distributed endpoint. analysis, the population variance, σ 2, is usually considered unknown for the analysis, and a sample variance estimate, s 2, is used instead. This is estimated with n A (r+1) 2 degrees of freedom. The following formula can be used [3],[4] ( rna d 1 β = Probt t 1 α 2 2,n A (r +1) 2), 2 ) (r +1σ 2) (1) where Probt( ) is defined as the cumulative density function of a non-central t distribution, taken from the function in the package SAS (SAS Institute, Inc., Cary, NC, USA). This result gives the power for a given sample size. To estimate a sample size for a given power, we must iterate up the sample sizes until the requisite power is reached. The SampSize app uses the non-central t distribution in its calculations. To assist, Table 4 gives sample sizes using (1) for various standardised differences. Although the formula in Figure 2 is easier to calculate by-hand, it gives slightly lower sample sizes when compared with (1). For quick calculations when we have 90% power and a two-sided 5% type I error rate, the following result can be used: n A = 10.5σ2 (r +1) d 2 r (2) 4

Table 4: Normal deviates for common percentiles. δ s 1 2 3 4 0.05 8407 6306 5605 5255 0.10 2103 1577 1402 1314 0.15 935 702 624 585 0.20 527 395 351 329 0.25 338 253 225 211 0.30 235 176 157 147 0.35 173 130 115 108 0.40 133 100 89 83 0.45 105 79 70 66 0.50 86 64 57 53 0.55 71 53 47 44 0.60 60 45 40 37 0.65 51 38 34 32 0.70 44 33 30 28 0.75 39 29 26 24 0.80 34 26 23 21 0.85 31 23 20 19 0.90 27 21 18 17 0.95 25 19 17 15 1.00 23 17 15 14 or for r = 1: n A = 21σ2 d 2 (3) 4 Cactus Example In the CACTUS clinical trial [5], participants suffering from long-standing aphasia post stroke are to be randomised to one of three arms: 1. usual care, 2. self-managed computerised speech and language therapy in addition to usual care and 3. attention control in addition to usual care. The primary objective of the trial is to compare usual care with self-managed computerised therapy. The primary outcome of interest is the change in the number of words named correctly at 6 months. The minimal clinically meaningful difference is improvement in word retrieval of 10%. Based on a pilot study [5], the population standard deviation is assumed to be 17.38%. The dropout rate applied is 15% (95% confidence interval (CI), 5-32%). Assuming 90% power, a two-sided type I error rate of 0.05 and an allocation ratio of r = 1, we can go through the steps now to estimate the sample size. 5

Table 5: Key components required for sample size. Step Summary Objective Superiority: H 0 : µ A = µ B vs H 1 : µ A µ B Endpoint Improvement in word retrieval (normal) Error Type I error α = 0.05 Type II error β = 0.1, power 1 β = 0.9 Effect size d = 10% Population Variance σ = 17.38% Other r = 1 After identifying these key points summarised in Table 5, it is possible to plug the values into the formula in Figure 2. This gives n A = (1+1)(Z 1 0.1 +Z 1 0.05/2 ) 2 (17.38 2 ) 1 10 2 (4) Using the common percentiles from Table III, the sample size is 64 for each group. To apply the 15% dropout rate, we divide by the completion rate (100%- 15%= 85%). This suggests that 76 participants are needed in each arm of the trial. The app SampSize can also be used for the calculation. To use the app, you need to select options: Superiority, Parallel Group, Normal and Calculate Sample Size. You then get to the top screen as given Figure 3. Each component of the sample size calculation can be entered into the app. For ease, the app provides a list of suggested entries you can select from. For example (Figure 4), if you select Significance Level, the next screen gives the options 0.001, 0.01, 0.02, 0.025, 0.03, 0.04, 0.05, 0.075 and 0.10 (sic). Alternatively, you can select the Custom option where it is possible to enter a value for the significance level manually. SampSize gives a sample size of 65 per group. Applying the 15% dropout rate, a total of 154 participants are required to achieve 90% power. The difference in the two sample size estimates arises because SampSize uses a non-central t distribution to estimate the sample size. To use the sample size table, we first need to estimate the standardised difference, δ = d σ = 10 = 0.575 (5) 17.38 Unfortunately, when using the tables, the standardised differences increase in increments of 0.05. As a result, there is no entry in Table 4 for this standardised difference. Standardised differences of δ = 0.55 and δ = 0.60 give sample sizes of 71 and 60, respectively. A sample size of 71 is a conservative estimate, which we would need to amend using SampSize or other calculations. The 95% CI for the dropout rate suggests this value might be as low as 5% or as high as 32%. If we design the study and estimate the evaluable sample size to be 65 (as in the preceding text), accounting for a 15% dropout rate, we need 6

to recruit 77 patients per arm. However, once the trial has begun, we experience a much higher dropout rate of 32%, and our evaluable sample size decreases to 53. We can use the app to estimate the power of our study with this smaller sample size. As before, select a Superiority, Parallel Group trial with a Normal endpoint, but now rather than choosing to calculate a sample size, select to calculate power. With the same inputs as before but with a sample size of 53, the power is estimated to be 83%. Although this is still an acceptable power, it is much less than the proposed 90% power. Figure 3: Screenshot of SampSize app custom settings for a sample size calculation. This worked example illustrates how our power and sample size are sensitive to the dropout rate, and so, attempts to estimate this rate in the planning of the trial should be made. This might be based on the experiences of previous trials recruiting similar patients as with the worked example here or based on similar interventions. 7

Figure 4: Screenshot of SampSize app for CACTUS worked example. 5 Summary This paper has illustrated how to calculate sample sizes for superiority trials where the data are assumed to be Normally distributed. It builds on the steps identified in the first paper of the series for sample size estimation. The paper gives formulae for sample size estimation from quick easy-to-use results to more complicated calculations that require iteration to find a solution. To assist in the calculations, sample size tables and calculations using an app SampSize are described. The subsequent papers in the series applies these ideas to non-inferiority and equivalence [6]. 6 Acknowledgements This report is an independent research arising from a Research Methods Fellowship (RMFI-2013-04-011 Goodacre) supported by the National Institute for Health Research. The views expressed in this publication are those of the authors and not necessarily those of the NHS, the National Institute for Health Research, the Department of Health or the University of Sheffield. We would like to thank two anonymous reviewers, whose considered comments greatly improved the output of the paper. 8

References [1] Flight L, Julious SA. Practical guide to sample size calculations: an introduction. Pharmaceutical Statistics. [2] epigenesys. A University of Sheffield company. Available at https://www.epigenesys.org.uk/portfolio/sampsize//. (accessed 07.07.2015). [3] Julious SA. Sample Sizes for Clinical Trials. Chapman and Hall: Boca Raton, Florida, 2009. [4] Julious SA. Tutorial in Biostatistics: sample sizes for clinical trials with Normal data. Statistics in Medicine 2004; 23:19211986. [5] Palmer R, Enderby P, Cooper C, Latimer N, Julious S, Paterson G, Dimairo M, Dixon S, Mortley J, Hilton R, Delaney A, Hughes H. Computer therapy compared with usual care for people with long-standing aphasia poststroke: a pilot randomized controlled trial.stroke 2012; 43:19041911. [6] Flight L, Julious SA. Practical guide to sample size calculations: noninferiority and equivalence trials. Pharmaceutical Statistics. 9