Clinical Perspective. Interpreting Validity Indexes for Diagnostic Tests: An Illustration Using the Berg Balance Test

Size: px
Start display at page:

Download "Clinical Perspective. Interpreting Validity Indexes for Diagnostic Tests: An Illustration Using the Berg Balance Test"

Transcription

1 Clinical Perspective Interpreting Validity Indexes for Diagnostic Tests: An Illustration Using the Berg Balance Test Physical therapists routinely make diagnostic and prognostic decisions in the course of patient care. The purpose of this clinical perspective is to illustrate what we believe is the optimal method for interpreting the results of studies that describe the diagnostic or prognostic accuracy of examination procedures. To illustrate our points, we chose the Berg Balance Test as an exemplar measure. We combined the data from 2 previously published research reports designed to determine the validity of the Berg Balance Test for predicting risk of falls among elderly people. We calculated the most common validity indexes, including sensitivity, specificity, predictive values, and likelihood ratios for the combined data. Clinical scenarios were used to demonstrate how we believe these validity indexes should be used to guide clinical decisions. We believe therapists should use validity indexes to decrease the uncertainty associated with diagnostic and prognostic decisions. More studies of the accuracy of diagnostic and prognostic tests used by physical therapists are urgently needed. [Riddle DL, Stratford PW. Interpreting validity indexes for diagnostic tests: an illustration using the Berg Balance Test. Phys Ther. 1999;79: ] Key Words: Diagnosis; Tests and measurements, general. Daniel L Riddle Paul W Stratford Downloaded from Physical Therapy. Volume. Number. October

2 Physical therapists routinely perform diagnostic tests on their patients. For diagnostic test results to be most useful, we contend that validity estimates from studies of the diagnostic test in question should be used to guide clinical decisions. The purpose of this perspective is to describe a conceptual model proposed by other authors 1,2 for the application of validity indexes for diagnostic (or prognostic) tests to clinical practice. We use a clinical illustration to demonstrate how measures, which we refer to as validity indexes (ie, sensitivity, specificity, positive and negative predictive values, likelihood ratios), can be interpreted for individual patients. The illustration combines data from 2 studies on the use (validity) of the Berg Balance Test (BBT) for predicting risk of falls among elderly people aged 65 to 94 years. 3,4 The illustration is meant only to demonstrate how validity indexes can be useful for practice and not necessarily to assist clinicians in the examination of patients suspected of having balance disorders. Studies that can be used to determine whether meaningful clinical inferences can be made based on diagnostic tests are classified as criterion-related validity studies. 5 Criterion-related validity studies take 1 of 2 forms. Researchers can compare a clinical measure with a gold standard measure (ideally, a valid diagnostic test or a definitive measure of whether the condition of interest is truly present) obtained at about the same time as the measure being studied. In our illustration, the patient s report of falling is considered the gold standard measure. In other cases, a gold standard measure may be a diagnosis made at the time of surgery or via an invasive diagnostic procedure. Studies in which some form of gold standard is obtained at about the same time as the diagnostic test being studied are commonly called concurrent criterion-related validity studies. 5 Researchers can also compare a measure s prediction of a future event with what actually happens to a patient in the future. These studies are commonly termed predictive criterion-related validity studies. 5 Studies designed to estimate the risk of a future adverse event are often used by clinicians to make judgments about prognoses. For example, investigating whether the How can clinicians use diagnostic test BBT can be used to predict whether a person studies to guide will fall in the clinical decisions for individual patients? future is an illustration of a predictive criterion-related validity study. The gold standard for this type of study would be the subjects report of falls for a period of time following administration of the BBT. The Berg Balance Test The BBT was designed to be an easy-to-administer, safe, simple, and reasonably brief measure of balance for elderly people. The developers expressed the hope that the BBT would be used to monitor the status of a patient s balance and to assess disease course and response to treatment. 6 Patients are asked to complete 14 tasks, and each task is rated by an examiner on a 5-point scale ranging from 0 (cannot perform) to 4 (normal performance). Elements of the test are supposed to be representative of daily activities that require balance, including tasks such as sitting, standing, leaning over, and stepping. Some tasks are rated according to the quality of the performance of the task, whereas the time taken to complete the task is measured for other tasks. The developers of the BBT provided operational definitions for each task and the criteria for grading each task. Overall scores can range from 0 (severely impaired balance) to 56 (excellent balance). Data exist to support the reliability of BBT scores obtained from elderly subjects. 3,6,7 For example, Bogle Thorbahn and Newton 3 reported an intertester reliability (Spearman rho) value of.88 for 17 subjects aged 69 to 94 years. Evidence also exists to support the content validity, 6 construct validity, 7,8 and criterion-related validity 3,4,8 of test scores for inferring fall risk in elderly subjects tested in a variety of settings. Construct validity has been assessed using a variety of approaches. For example, construct validity was supported to the extent that BBT scores were shown to correlate reasonably well with other measures of balance (Pearson r.38.91) and measures of motor performance (Pearson r DL Riddle, PhD, PT, is Associate Professor, Department of Physical Therapy, Medical College of Virginia Campus, Virginia Commonwealth University, 1200 E Broad, Richmond, VA (USA) (driddle@hsc.vcu.edu). Address all correspondence to Dr Riddle. PW Stratford, PT, is Associate Professor, School of Rehabilitation Science, and Associate Member, Department of Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, Ontario, Canada. Concept, writing, and data analysis were provided by Riddle and Stratford. Consultation (including review of manuscript before submitting) was provided by Cheryl Ford-Smith, Susan Cromwell, Dr Roberta Newton, and Dr Anne Shumway-Cook. This article was submitted December 7, 1998, and was accepted July 7, Riddle and Stratford Physical Therapy. Volume 79. Number 10. October 1999 Downloaded from

3 .62.94). 7,8 For example, the Pearson r correlation between the BBT and the balance subscale of the Tinetti Performance-Oriented Mobility Assessment 9 was The Pearson r correlation between the BBT and the Barthel Index mobility subscale 10 was The Illustration To illustrate how to interpret validity indexes, we have combined data from 2 studies 3,4 designed to determine whether BBT scores could identify elderly people (age range years) who are at risk for falling. Subjects in both studies were of similar ages and had similar BBT scores, and the proportions of male and female subjects were also similar (Tab. 1). In both studies, the subjects reported whether they had fallen and the number of falls in the 6 months prior to being admitted to the study. In addition, for both studies, the authors appeared to use essentially the same definition for what constituted a fall. Bogle Thorbahn and Newton 3 defined a fall as an unexpected contact of any part of the body with the ground. Shumway-Cook and colleagues 4 defined a fall as any event that led to an unplanned, unexpected contact with a supporting surface. The 2 studies differed in 2 potentially important ways. First, Shumway-Cook et al 4 excluded subjects with comorbidities that may have affected balance. Bogle Thorbahn et al 3 did not exclude these types of subjects. Subjects in the study by Shumway-Cook and colleagues reported no comorbidities, whereas 38% of the subjects in the study by Bogle Thorbahn and Newton reported having diagnoses of neurological or orthopedic conditions. Second, subjects in the study by Shumway-Cook et al were required to have fallen at least twice in the previous 6 months, whereas subjects in the study by Bogle Thorbahn et al had to have fallen only once or more in the previous 6 months. It is unclear how these differences affected the validity estimates reported by these authors, but we believe the studies were similar enough to allow us to combine the data for the illustration in this article. It is also unclear why the proportion of fallers (50%) in the study by Shumway-Cook et al was much higher than the proportion of fallers (17%) in the study by Bogle Thorbahn and Newton. Diagnostic Test Methodology We believe that the subjects studied (the sample) should represent those types of patients who will be measured during clinical practice. 11 In our illustration, the sample of subjects was elderly people (ages ranging from 65 to 94 years) living independently. Some patients will have the disorder of interest (using our illustration, some subjects reported falls), and some patients will not have the disorder of interest (some reported no falls). The test being studied (ie, the BBT) and the gold standard or criterion measure (ie, determination of whether the subject had Table 1. Characteristics of the Subjects Combined From Two Studies 3,4 Characteristic Study of Shumway- Cook and colleagues 4 (N 44) Age (y) X SD Range Sex (%) Male Female Berg Balance Test X SD Range Gold standard classification of fallers (%) Study of Bogle Thorbahn and Newton 3 (N 66) fallen in the past 6 months) are applied to all subjects, and the test s diagnostic accuracy (Tab. 2) is determined. 9 The results from diagnostic accuracy studies are often summarized in a format similar to that shown in Table In this table, the terms condition present and condition absent are used to identify people who truly have or do not have the condition of interest (the gold standard test is either positive or negative). The letters a, b, c, and d are used to reference cells in the table, and the sums a b, c d, a c, b d, and a b c d denote marginal values. The cell values and marginal values are combined in various ways to calculate validity indexes. Definitions of terms related to diagnostic testing and formulas for the many validity indexes also are presented in Table 2. Sensitivity and Specificity Sensitivity indicates how often a diagnostic test detects a disease or condition when it is present. Sensitivity essentially tells the clinician how good the test is at correctly identifying patients with the condition of interest. Specificity indicates how often a diagnostic test is negative in the absence of the disease or condition. Specificity essentially tells the clinician how good the test is at correctly identifying the absence of disease. 15 The closer the sensitivity or specificity is to 100%, the more sensitive or specific the test. The authors of both studies in our illustration reported the sensitivity and specificity of the BBT for determining current fall risk. Berg et al 8 contended that the best way Downloaded from Physical Therapy. Volume. Number. October 1999 Riddle and Stratford. 941

4 Table 2. Two Two Table, Formulas, and Definitions for Validity Indexes a Gold Standard Test Result Diagnostic Test Result (Condition Present) (Condition Absent) Total True Positive (a) False Positive (b) a b False Negative True Negative c d (c) (d) Total a c b d a b c d Sensitivity: Those people correctly identified by the test as having the condition of interest as a percentage of all those who truly have the condition of interest: [100% (a/[a c])]. Specificity: Those people correctly identified by the test as not having the condition of interest as a percentage of all those who truly do not have the condition of interest: [100% (d/[b d])]. False Positive Rate: Those people falsely identified by the test as having the condition of interest as a percentage of all patients without the condition of interest: [100% (b/b d)]. False Negative Rate: Those people falsely identified by the test as not having the condition of interest as a percentage of all patients with the condition of interest: [100% (c/[a c])]. Positive Predictive Value: Those people correctly identified by the test as having the condition of interest as a percentage of all those identified by the test as having the condition of interest: [100% (a/[a b])]. Negative Predictive Value: Those people correctly identified by the test as not having the condition of interest as a percentage of all those identified by the test as not having the condition of interest: [100% (d/[c d])]. Diagnostic Accuracy: The percentage of people who are correctly diagnosed: [100% (a d)/(a b c d)]. Prevalence: The percentage of people in a target population who truly have the condition of interest: [100% (a c)/(a b c d)]. Likelihood Ratio for a Positive Test: Is sensitivity divided by 1 specificity [{a/(a c)}/{b/(b d)}]. Likelihood Ratio for a Negative Test: Is 1 sensitivity divided by specificity [{c/(a c)}/{d/(b d)}]. Pretest Probability of the Disorder: The therapist s estimate of the patient s chance of having the disorder (condition of interest) prior to the therapist doing the test. It is usually estimated by the clinician based on prior knowledge and experience. Posttest Probability of the Disorder: The patient s chance of having the condition of interest after the results of the test are obtained. a All definitions agree with the Standards for Tests and Measurements in Physical Therapy Practice. 5 Definitions for sensitivity, specificity, false positive rate, false negative rate, positive predictive rate, and negative predictive rate are derived from the Standards for Tests and Measurements in Physical Therapy Practice. 5 Definitions for diagnostic accuracy, prevalence, likelihood ratio for a positive test, likelihood ratio for a negative test, pretest probability of the disorder, and posttest probability of the disorder are derived from Sackett and colleagues. 1,2 to interpret scores on the BBT is to use a single cutoff point of 45 to differentiate those at risk for falls (those with scores of 45) and those who are not at risk for falls (those with scores of 45). Using a cutoff point of 45, as recommended by Berg et al, the sensitivity for the data collected by Shumway-Cook and colleagues 4 was 55% and the specificity was 95%. For the data collected by Bogle Thorbahn and Newton, 3 the sensitivity was 82% and the specificity was 87%. When we combined the data from both studies, a cutoff point of 45 yielded a sensitivity of 64% and a specificity of 90% (Tab. 3). A sensitivity of 64% indicates that 64% of subjects who were true fallers had a positive BBT (a score of 45). That is, approximately a third of the subjects who were fallers were missed by the BBT. Although there are no agreed-on standards for judging sensitivity and specificity, we believe the sensitivity of 64% should generally be considered quite low because more than a third of the subjects were misclassified. A specificity of 90% indicates that 90% of subjects who were nonfallers had a negative BBT (a score of 45). That is, only 10% of the nonfallers were missed by the BBT. Specificity was much higher than sensitivity, indicating that the BBT does a better job of identifying subjects who are not fallers than subjects who are fallers. When we use diagnostic tests, we do not know who has the condition of interest and who does not have the condition of interest. That is, sensitivity and specificity have somewhat limited usefulness because they do not describe validity in the context of the test result. 1 Rather, they describe validity in the context of the gold standard, a value we do not know when we do diagnostic tests. Sensitivity, for example, does not take into account the false positive test results (Tab. 2) on a group of patients. Stated another way, sensitivity does not describe how often patients with positive tests have the disorder of interest. Sensitivity only describes the proportion of patients with the disorder of interest who have a positive test. Similarly, specificity does not take into account false negative test results (Tab. 2). Specificity does not describe how often patients with negative tests do not have the disorder of interest. Specificity only describes the proportion of patients without the disorder of interest who have a negative test. Diagnostic testing, in our view, is used because clinicians want to know the probability of the condition existing. Because clinicians make decisions based on diagnostic test results and not necessarily on results of tests that are considered gold standards, some authors 1 have contended that positive and negative predictive values (see 942. Riddle and Stratford Physical Therapy. Volume 79. Number 10. October 1999 Downloaded from

5 Table 3. Sensitivity and Specificity for Four Cutoff Points of the Berg Balance Test (BBT) BBT Cutoff Point 2 2 Tables for Four BBT Cutoff Points Gold Standard for Cutoff of 40 Gold Standard for Cutoff of 45 Gold Standard for Cutoff of 50 Gold Standard for Cutoff of 55 Fall No Fall Fall No Fall Fall No Fall Fall No Fall 15 a b c d a b c d a b c d56 32a b c d20 Sensitivity a/(a c) 45% 64% 85% 97% Specificity d/(b d) 96% 90% 73% 26% next section) are more important than sensitivity and specificity for clinical practice. Positive and Negative Predictive Values Before diagnostic testing, therapists usually have collected a variety of information (eg, medical history, some examination data) from the patient. Based on their knowledge, training, and experience, therapists can sometimes use these data, depending on what is known about various conditions, to estimate the probability the condition of interest is present. This is known as the pretest probability of the disorder. 1 For example, if a therapist found that an elderly patient had a history of dizziness and required assistance with most activities of daily living, the therapist might anticipate that the patient s risk of falling was quite high, say on the order of 60%. Because the therapist knew evidence existed to indicate that dizziness 16 and difficulty with home activities of daily living 17 increase fall risk, the therapist estimated the pretest probability for falls to be quite high. The pretest probability estimate of 60% is only an estimate and may contain some error. The therapist could then do a BBT to better estimate the patient s risk of falling. Positive and negative predictive values describe the probability of disease after the test is completed. The probability of the condition of interest after the test result is obtained is also known as the posttest probability of the disorder. 1 For many clinicians, the idea of estimating the probability of a disorder prior to doing a diagnostic test (pretest probability) may seem like a new or unusual concept. We believe that some clinicians, based on their experience and training, may use an ordinal-based scale estimate of pretest probability, such as the disease is highly likely, somewhat likely, or not very likely given the patient s signs and symptoms. In our view, however, using percentage estimates of pretest probability is not commonly done by most therapists. We suggest that therapists should make percentage estimates of the pretest probability of the disorder of interest. For example, if a clinician used an ordinal scale similar to the one just described, we contend that the clinician should convert it to a percentage estimate of pretest probability in the following way. If the pretest probability of the disorder were judged to be highly likely, this judgment could be converted to a 75% pretest probability, whereas a rating of somewhat likely could be converted to pretest probability of 50%. A rating of not very likely might be converted to a pretest probability of 25%. We believe that, as therapists become more comfortable with making percentage estimates of pretest probability, they will become more accurate, although we have no data to support this argument. By using percentage estimates for pretest probability, therapists can take full advantage of positive and negative predictive values (and likelihood ratios, to be discussed elsewhere in this article) reported in the literature. Several examples are discussed elsewhere in this article to illustrate how pretest probability can be estimated and how these estimates can influence the interpretation of the diagnostic test. Positive predictive value is the proportion of patients with a positive test who have the condition of interest. 1 Negative predictive value is the proportion of patients with a negative test who do not have the condition of interest. 1 The closer the positive predictive value is to 100%, the more likely the disease is present with a positive test finding. The closer the negative predictive value is to Downloaded from Physical Therapy. Volume. Number. October 1999 Riddle and Stratford. 943

6 Table 4. Validity Estimates for Several Different Cutoff Points of the Berg Balance Test Berg Balance Test Result Positive Predictive Value (95% CI a ) Negative Predictive Value (95% CI) Sensitivity (95% CI) Specificity (95% CI) Positive Likelihood Ratio (95% CI) Negative Likelihood Ratio (95% CI) 35 77% 67% 30% 96% (54 100) (58 76) (14 46) (92 100) ( ) ( ) 40 83% 67% 45% 96% (66 100) (57 77) (28 62) (92 100) ( ) ( ) 45 72% 85% 64% 90% (56 88) (77 93) (48 80) (83 97) ( ) ( ) 50 57% 92% 85% 73% (43 71) (85 99) (73 97) (63 83) ( ) ( ) 55 36% 95% 97% 26% (26 46) (86 100) (91 100) (16 36) ( ) ( ) 60 30% 100% 100% 1% 1.01 Undefined (21 39) (5) (91) (0 3) (1 1.04) a CI confidence interval. 100%, the more likely the disease is absent with a negative test finding. In our illustration, the combined data from both studies yielded a positive predictive value of 72% when using a cutoff point of 45 on the BBT (Tab. 4). A positive predictive value of 72% indicates that 72% of patients with a positive test (a BBT of 45) were classified as fallers (the gold standard) and 28% of the patients were misclassified as fallers based on the BBT, an error rate that we consider to be fairly high. A negative predictive value of 85% indicates that 85% of patients with a negative test (a BBT of 45) were classified as nonfallers (the gold standard). Our misclassification rate for nonfallers is less than for fallers (ie, we can be more confident about identifying nonfallers than fallers based on BBT test results). As with sensitivity and specificity, no standard exists for what constitutes an acceptable level of positive or negative predictive value. In addition, interpretations of predictive values, sensitivity, and specificity are not always straightforward. In the next section, we attempt to describe the critical issues that we believe should be considered when interpreting validity indexes. Issues Related to the Interpretation of Sensitivity, Specificity, and Predictive Values Some tests have a binary outcome (2 mutually exclusive categories such as present or absent ), but many other test results are reported on an ordinal scale (such as the manual muscle test) or a continuous scale (such as the BBT). When using sensitivity, specificity, and predictive values, the researcher is forced to dichotomize results for ordinal and continuous measures (such as the BBT) and, therefore, may lose information about the usefulness of the test. One example is the use of a single cutoff point of 45 for the BBT. We will show later how some researchers have dealt with the problem of only one cutoff point for continuous measures. The choice of the cutoff point influences the sensitivity, specificity, and positive and negative predictive values. This concept is illustrated in Table 4. For example, if the cutoff point for the BBT were set at 40, the sensitivity would be 45% and the specificity would be 96%. With a cutoff point of 50, the sensitivity is 85% and the specificity is 73%. Generally, the choice of cutoff point by the researcher will increase one validity index (eg, sensitivity) but will decrease the other validity index (eg, specificity). For example, when sensitivity rises (as seen when going from a cutoff point of 40 to a cutoff point of 50 on the BBT), specificity falls. The same concept holds for positive and negative predictive values. When the positive predictive value rises (as seen when going from a cutoff point of 50 to a cutoff point of 40 on the BBT), the negative predictive value falls (Tab. 4). The principal factor influencing the clinician s choice of a cutoff point is related to the consequence of misclassifying patients. Broadly speaking, there are 3 choices for a cutoff point: (1) maximize both sensitivity and specificity, (2) maximize sensitivity at the cost of minimizing specificity, and (3) maximize specificity at the cost of minimizing sensitivity. Maximizing sensitivity and specificity is appropriate when the consequences of false positives and false negatives are about equal. Maximizing 944. Riddle and Stratford Physical Therapy. Volume 79. Number 10. October 1999 Downloaded from

7 sensitivity at the cost of minimizing specificity is desirable when the consequence of a false negative (eg, falsely identifying a subject as a nonfaller) exceeds the consequence of a false positive (eg, falsely identifying the subject as a faller). Conversely, maximizing specificity at the cost of minimizing sensitivity is desirable when the consequence of a false positive exceeds the consequence of a false negative. In the case of the BBT, it would appear that sensitivity should be optimized to avoid classifying a faller as a nonfaller. Misclassifying fallers would appear to have serious consequences (eg, fractures). An important advantage associated with the use of sensitivity and specificity is that they are not influenced by prevalence. Prevalence is defined as the proportion of patients with the disorder of interest among all patients tested. 1 A therapist can use sensitivity and specificity estimates from a published report and apply these estimates to a patient as long as the patient is reasonably similar to the subjects in the study. Predictive values should guide clinical decisions (they estimate validity in the context of the test result), but unlike sensitivity and specificity, predictive values are prevalence dependent. 1 That is, as the proportion of those with the disease changes, predictive values also change. Predictive values, therefore, vary when the prevalence of the disorder of interest changes. As the prevalence increases, the positive predictive value increases and the negative predictive value decreases. When the prevalence decreases, the positive predictive value decreases and the negative predictive value increases. Because the chance that an individual patient will have a target disorder varies (ie, the pretest probability changes depending on the patient s signs and symptoms), the prevalence associated with a diagnostic accuracy study may not apply to a given patient. For example, in the study by Shumway-Cook et al, 4 there was a prevalence of fallers of 50%. If, for example, a clinician estimated the pretest probability of falling for a patient to be only 10%, the predictive values from the data of Shumway-Cook et al would not provide accurate estimates of positive or negative predictive values for the patient. The positive predictive value from the data of Shumway-Cook and colleagues would be spuriously high (because of the higher prevalence), and the negative predictive value would be spuriously low for the patient with a pretest probability of 10%. Unfortunately, predictive values are influenced by prevalence, whereas sensitivity and specificity are not. Sensitivity and specificity, however, are related to positive and negative predictive values in the following way. When specificity is high, the positive predictive value tends to be high, and when sensitivity is high, the negative predictive value tends to be high. That is, when sensitivity is high, a negative test generally indicates the disorder is not present (or, in our illustration, the person is not at risk of falling). When specificity is high, a positive test generally indicates the disorder is present (the person is at risk of falling). 2 Table 4 illustrates this concept. When specificity is high, for example, for a BBT cutoff point of 40 (96%), the positive predictive value will generally be high (83%). A clinician might hypothetically believe, for example, that based on medical history and examination data, a patient had a pretest probability of falling of approximately 40% and the patient might subsequently have a score of 37 on the BBT, a score considered positive using a cutoff point of 40 (Tab. 4). The positive predictive value would be 83%, an increase of 43 percentage points from the pretest probability. We contend that the clinician can be reasonably confident the patient is a faller. Similarly, when sensitivity is high (97% for a cutoff point of 55), the negative predictive value will also generally be high (95%). For example, a clinician might believe, based on a patient s medical history and examination data, that the patient had a pretest probability of falling of approximately 40% (or a pretest probability of not falling of 60%). The patient might subsequently have a score of 56 on the BBT, a score considered negative using a cutoff point of 55 (Tab. 4). The negative predictive value (posttest probability) in this hypothetical example would be 95%, and we argue that the clinician can be very confident the patient is not a faller. We noted earlier that predictive values are dependent on prevalence, and in our examples, the prevalence (pretest probability) for falls was estimated to be 40%, a reasonable approximation of the prevalence reported in our illustration using the BBT data. Had the pretest probabilities for the patient examples been appreciably lower or higher, the predictive values reported in the 2 examples above would not have been accurate estimates of posttest probability. In summary, sensitivity and specificity are not dependent on prevalence and are therefore seen as useful for clinical practice. 1 As a general guide, we believe clinicians should conclude the condition is likely to be present when a test is positive and the specificity for the test is high. Conversely, clinicians should conclude the condition is likely to be absent when a test is negative and the sensitivity for the test is high. 1,2 Positive and negative predictive values are, in part, prevalence dependent. As a result, we argue that predictive values are meaningful only when the prevalence reported in a study approximates the pretest probability of the disorder the clinician has estimated for the patient. To be most accurate, pretest probability estimates should be based on sound scientific data. Confidence Intervals for Validity Indexes Sensitivity, specificity, positive and negative predictive values, and likelihood ratios represent point estimates of Downloaded from Physical Therapy. Volume. Number. October 1999 Riddle and Stratford. 945

8 Table 5. Positive Likelihood Ratios for Several Different Intervals of Berg Balance Test Scores Berg Balance Test Result Gold Standard Test Result Positive Negative Number Proportion Number Proportion Positive Likelihood Ratio (95% CI a ) / / ( ) / / ( ) / / ( ) / / ( ) / / ( ) Total a CI confidence interval. population values. 15 Point estimates are estimations of the true value for the index of interest. To determine the accuracy of a point estimate, confidence intervals (CIs) are calculated. 15 Confidence intervals indicate how closely a study s point estimate of these values approximate the population values. 15 Confidence intervals essentially describe for clinicians how confident they can be about a point estimate. For example, if sensitivity was 80%, with a 95% CI of 70% to 90%, the true value for sensitivity in the population (with 95% certainty) lies between 70% and 90%. The width of a CI becomes narrower as the sample size increases, and it becomes wider as the sample size decreases. 15 In addition, the width is dependent on the variability of the measure with the population. 15 The degree of confidence we place on these validity estimates can be calculated. 1,18 In our view, studies that examine the validity of diagnostic tests should provide CI estimates. For example, the 95% CI for specificity reported by Bogle Thorbahn and Newton 3 ranged from 67% (not very specific) to 100% (perfect specificity). The 95% CI for specificity for the combined data from the studies of Bogle Thorbahn and Newton 3 and Shumway-Cook et al 4 ranged from 83% to 97% (both values, in our opinion, represent reasonably high specificity). Likelihood Ratios Positive and negative likelihood ratios are 2 additional validity indexes for diagnostic tests. Likelihood ratios have been proposed to be more efficient and more powerful than sensitivity, specificity, and predictive values. 15,19 Likelihood ratios essentially combine the benefits of both sensitivity and specificity into one index. 1 Likelihood ratios indicate by how much a given diagnostic test result will raise or lower the pretest probability of the target disorder. 20 Likelihood ratios are reported in a decimal number format rather than as percentages. A likelihood ratio of 1 means the posttest probability Likelihood ratios should not be confused with odds ratios. Odds ratios are an estimate of risk often expressed in case-control studies designed to investigate causation of a disease. (probability of the condition after the test results are obtained) for the target disorder is the same as the pretest probability (probability of the condition before the test was done). Likelihood ratios greater than 1 increase the chance the target disorder is present, whereas likelihood ratios less than 1 decrease the chance the target disorder is present. 20 Jaeschke and colleagues 20 proposed the following guide to interpreting likelihood ratios. Likelihood ratios greater than 10 or less than 0.1 generate large and often conclusive changes from pretest to posttest probability. Likelihood ratios between 5 and 10 or between 0.2 and 0.1 generate moderate changes from pretest to posttest probability. Likelihood ratios from 2 to 5 and from 0.5 to 0.2 result in small (but sometimes important) shifts in probability, and likelihood ratios from 0.5 to 2 result in small and rarely important changes in probability. Because likelihood ratios can be applied to score intervals for tests with continuous measures, we believe they are more useful than sensitivity, specificity, and predictive values, which are limited to data presented in a dichotomous format. For example, the positive likelihood ratio for the score interval of 40 to 44 (a test score considered positive based on recommendations of Berg and colleagues 8 ) is 2.8 (Tab. 5). This likelihood ratio indicates that a patient with a BBT score between 40 and 44 is 2.8 times more likely to be a faller than a nonfaller. The 95% CI ranges from 0.9 to 8.5. That is, the 95% CI overlaps 1 (no change in the probability of the disorder); therefore, a clinician cannot be very confident that a score between 40 and 44 increases the probability of identifying a patient at risk for falls. If a patient scores below 40 on the BBT, however, the likelihood ratio increases to 11.7 (95% CI ). A patient with a BBT score below 40 is at greater risk for falls as compared with patients with scores between 40 and 44. On average, patients with BBT scores less than 40 are almost 12 times more likely to be a faller than a nonfaller Riddle and Stratford Physical Therapy. Volume 79. Number 10. October 1999 Downloaded from

9 Applications of Likelihood Ratios to Clinical Practice Likelihood ratios can also be calculated for several different cutoff points of the BBT (Tab. 4). Scores below the cutoff are considered positive tests, and scores above the cutoff are considered to be negative tests. Because the scale is dichotomized when using cutoffs, both positive and negative likelihood ratios can be calculated. For example, given a BBT cutoff point of 40, the positive likelihood ratio is 11.7 (95% CI ). That is, a patient with a score of less than 40 is approximately 12 times more likely to be a faller than a nonfaller. The negative likelihood ratio is 0.6 (95% CI ). That is, a patient with a negative BBT score (score of 40) is 0.6 times as likely to be a faller as a nonfaller. When using a cutoff point of 40, for a negative score (score of 40), a patient is more likely to be a nonfaller than a faller. Based on the data summarized in Table 4, lower cutoffs will usually increase the magnitude of the positive likelihood ratio (a desirable trait), but they will also increase the magnitude of the negative likelihood ratio (an undesirable trait). Another advantage of the use of likelihood ratios is that, along with the use of a nomogram (Figure), a clinician can determine the probability of a disorder, given the result of the test (also called posttest probability ). 21 Because likelihood ratios do not vary when disorder prevalence varies, likelihood ratios can be generalized to other patients. To use the nomogram, the clinician must first estimate the pretest probability of the disorder. The pretest probability of the disorder (likelihood of the presence of the disorder prior to doing the test) is estimated, as mentioned earlier, by the clinician s own clinical training and experience with similar types of patients in the specific setting in which the patients are seen. 2 The constellation of signs and symptoms also influences the clinician s judgment of the pretest probability of the disorder. If we knew the likelihood ratios for each of the medical history items and signs and symptoms of patients, we could repeatedly recalculate the pretest and posttest probability of the disorder of interest and come up with a very accurate estimate of the final posttest probability. 20 Most of these data, unfortunately, are unavailable, so clinicians typically must rely on training, experience, and knowledge of the literature to estimate the pretest probability of the disorder. To use the nomogram, the clinician simply estimates the pretest probability of the disorder and identifies this value in the left-hand column of the nomogram (Figure). A straightedge is then anchored on the left column of the Figure at the pretest probability estimate and aligned on the middle column at the likelihood ratio. The right column indicates the posttest probability. To demonstrate how likelihood ratios and the nomogram can be used to guide clinical decision making, we Figure. Nomogram for interpreting diagnostic test results. Reprinted with permission from Fagan. 21 Copyright 1975, Massachusetts Medical Society. All rights reserved. will apply our concept and argument to 2 hypothetical situations. For the first example, assume your 67-year-old patient lived alone in her home and was independent and relatively active. Her only comorbidity was that she had a hip joint replacement 1 year prior to testing. The therapist suspected the pretest probability of the disor- Downloaded from Physical Therapy. Volume. Number. October 1999 Riddle and Stratford. 947

10 der (falls, in this case) would be relatively low, perhaps on the order of 20%. The patient then had a BBT done and a score of 50 (a negative test, using a cutoff point of 50) was obtained. The negative likelihood ratio for a cutoff point of 50 is 0.2 (Tab. 4). We align a ruler with the left column of the nomogram (Figure) at 20 (20% pretest probability) and with the middle column at a likelihood ratio of approximately 0.2. We find that the posttest probability of current fall risk for this patient is approximately 5%, an improvement of 15 percentage points from the pretest probability (the chance of the patient being a faller has gone from 20% down to 5%). Hypothetically, we substantially increased our level of certainty about the patient s current risk of falling based on the BBT score. Our second hypothetical example is about a 75-year-old man who was diagnosed with congestive heart failure approximately 5 years previously and requires assistance with some activities of daily living. He reports losing his balance occasionally and remembers falling once in the past few years. Based on the patient s medical history and functional status, the pretest probability for falls would be fairly high (ie, on the order of 50%). A BBT was done, and a score of 38 (a positive test, using a cutoff point of 40) was obtained. Using the data in Table 4, the positive likelihood ratio for a score of less than 40 is That is, this patient is 11.7 times more likely to be a faller than a nonfaller. Using the nomogram shown in the Figure, the posttest probability for current fall risk is approximately 92%, an increase of 42 percentage points above the pretest probability. If we believe our data are correct and our estimates are appropriate, we can theoretically be confident that we have identified a patient who has a very high probability of falling. We again appear to have substantially increased our level of certainty about the patient s risk of falling. Summary Validity indexes for diagnostic tests were reviewed, and terms used in studies designed to describe the validity of diagnostic tests were defined. Data from 2 studies examining the validity of measurements obtained with the BBT for inferring current fall risk were used as an illustration to demonstrate how clinicians could use diagnostic test studies to guide clinical decisions for individual patients. Unfortunately, there are only a small number of diagnostic test studies describing the validity of examination procedures commonly used by physical therapists. There is an urgent need to conduct more studies of the usefulness of diagnostic and prognostic tests in physical therapy. Acknowledgments We thank Dr Anne Shumway-Cook, Linda Thorbahn, and Dr Roberta Newton for their insights and for allowing us to use their data in this article. We also thank Cheryl Ford-Smith and Sue Cromwell for reviewing an earlier version of the manuscript. References 1 Sackett DL, Haynes RB, Guyatt GH, Tugwell P. Clinical Epidemiology: A Basic Science for Clinical Medicine. 2nd ed. Boston, Mass: Little, Brown and Co Inc; 1991: Sackett DL, Richardson WS, Rosenberg W, Haynes RB. Evidence-based Medicine: How to Practice and Teach EBM. New York, NY: Churchill Livingstone Inc; Bogle Thorbahn LD, Newton RA. Use of the Berg Balance Test to predict falls in elderly persons. Phys Ther. 1996;76: Shumway-Cook A, Baldwin M, Polissar NL, Gruber W. Predicting the probability for falls in community-dwelling older adults. Phys Ther. 1997;77: Task Force on Standards for Measurement in Physical Therapy. Standards for tests and measurements in physical therapy practice. Phys Ther. 1991;71: Berg KO, Wood-Dauphinée SL, Williams JI, Gayton D. Measuring balance in the elderly: preliminary development of an instrument. Physiotherapy Canada. 1989;41: Berg KO, Maki BE, Williams JI, et al. Clinical and laboratory measures of postural balance in an elderly population. Arch Phys Med Rehabil. 1992;73: Berg KO, Wood-Dauphinée SL, Williams JI, Maki B. Measuring balance in the elderly: validation of an instrument. Can J Public Health. 1992;83(suppl 2):S7 S11. 9 Tinetti ME. Performance-oriented assessment of mobility problems in elderly patients. J Am Geriatr Soc. 1986;34: Mahoney FL, Barthel DW. Functional evaluation: the Barthel index. Md State Med J. 1965;14: Department of Clinical Epidemiology and Biostatistics, McMaster University. How to read clinical journals, II: to learn about a diagnostic test. Can Med Assoc J. 1981:124: Department of Clinical Epidemiology and Biostatistics, McMaster University. Interpretation of diagnostic data, 2: how to do it with a simple table (part A). Can Med Assoc J. 1983:129: Department of Clinical Epidemiology and Biostatistics, McMaster University. Interpretation of diagnostic data, 2: how to do it with a simple table (part B). Can Med Assoc J. 1983:129: Department of Clinical Epidemiology and Biostatistics, McMaster University. Interpretation of diagnostic data, 2: how to do it with simple math. Can Med Assoc J. 1983:129: Sackett DL. A primer on the precision and accuracy of the clinical examination. JAMA. 1992;267: Luukinen H, Koski K, Kivela SL, Laippala P. Social status, life changes, housing conditions, health, functional abilities, and lifestyle as risk factors for recurrent falls among the home-dwelling elderly. Public Health. 1996;110: Tinetti ME, Speechley M, Ginter SF. Risk factors for falls among elderly persons living in the community. N Engl J Med. 1988;319: Colton T. Statistics in Medicine. Boston, Mass: Little, Brown and Co Inc; 1974: Crombie DL. Diagnostic process. J Coll Gen Prac. 1963;6: Jaeschke R, Guyatt GH, Sackett DL. Users guides to the medical literature, III: how to use an article about a diagnostic test, B: What are the results and will they help me in caring for my patients? JAMA. 1994;271: Fagan TJ. Nomogram for Bayes theorem [letter]. N Engl J Med. 1975;293: Riddle and Stratford Physical Therapy. Volume 79. Number 10. October 1999 Downloaded from

The recommended method for diagnosing sleep

The recommended method for diagnosing sleep reviews Measuring Agreement Between Diagnostic Devices* W. Ward Flemons, MD; and Michael R. Littner, MD, FCCP There is growing interest in using portable monitoring for investigating patients with suspected

More information

Interval Likelihood Ratios: Another Advantage for the Evidence-Based Diagnostician

Interval Likelihood Ratios: Another Advantage for the Evidence-Based Diagnostician EVIDENCE-BASED EMERGENCY MEDICINE/ SKILLS FOR EVIDENCE-BASED EMERGENCY CARE Interval Likelihood Ratios: Another Advantage for the Evidence-Based Diagnostician Michael D. Brown, MD Mathew J. Reeves, PhD

More information

Research Report. Predicting the Probability for Falls in Community-Dwelling Older Adults Using the Timed Up & Go Test

Research Report. Predicting the Probability for Falls in Community-Dwelling Older Adults Using the Timed Up & Go Test Research Report Predicting the Probability for Falls in Community-Dwelling Older Adults Using the Timed Up & Go Test Background and Purpose. This study examined the sensitivity and specificity of the Timed

More information

EBM Diagnosis. Denise Campbell-Scherer Stefanie R. Brown. Departments of Medicine and Pediatrics University of Miami Miller School of Medicine

EBM Diagnosis. Denise Campbell-Scherer Stefanie R. Brown. Departments of Medicine and Pediatrics University of Miami Miller School of Medicine EBM Diagnosis Denise Campbell-Scherer Stefanie R. Brown Departments of Medicine and Pediatrics University of Miami Miller School of Medicine Department of Family Medicine University of Alberta Canada Mission

More information

Please demonstrate each task and/or give instructions as written. When scoring, please record the lowest response category that applies for each item.

Please demonstrate each task and/or give instructions as written. When scoring, please record the lowest response category that applies for each item. Berg Balance Test Name Date Location Rater GENERAL INSTRUCTIONS Please demonstrate each task and/or give instructions as written. When scoring, please record the lowest response category that applies for

More information

CORE MEASURE: CORE MEASURE: BERG BALANCE SCALE (BBS)

CORE MEASURE: CORE MEASURE: BERG BALANCE SCALE (BBS) OVERVIEW NUMBER OF TEST ITEMS SCORING EQUIPMENT TIME (NEW CLINICIAN) TIME (EXPERIENCED CLINICIAN) COST o The BBS is a widely-used, clinician-rated scale used to assess sitting and standing, static and

More information

Essential Skills for Evidence-based Practice: Statistics for Therapy Questions

Essential Skills for Evidence-based Practice: Statistics for Therapy Questions Essential Skills for Evidence-based Practice: Statistics for Therapy Questions Jeanne Grace Corresponding author: J. Grace E-mail: Jeanne_Grace@urmc.rochester.edu Jeanne Grace RN PhD Emeritus Clinical

More information

Diagnosing Anaemia. Conjunctival pallor. Results

Diagnosing Anaemia. Conjunctival pallor. Results Diagnosing Anaemia Conjunctival pallor Results Useful information? Serum ferritin predicts iron-deficiency anaemia Serum ferritin results Comment Bandolier is always on the lookout for good papers which

More information

Overview The BBS is a widely-used, clinician-rated scale used to assess sitting and standing, static and dynamic balance.

Overview The BBS is a widely-used, clinician-rated scale used to assess sitting and standing, static and dynamic balance. Core Measure: Berg Balance Scale (BBS) Overview The BBS is a widely-used, clinician-rated scale used to assess sitting and standing, static and dynamic balance. Number of Test Items The BBS consists of

More information

The Role of Likelihood Ratio in Clinical Diagnosis: Applicability in the Setting of Spontaneous Bacterial Peritonitis

The Role of Likelihood Ratio in Clinical Diagnosis: Applicability in the Setting of Spontaneous Bacterial Peritonitis CLINICAL GASTROENTEROLOGY AND HEPATOLOGY 2005;3:85 89 EVIDENCE-BASED MEDICINE The Role of Likelihood Ratio in Clinical Diagnosis: Applicability in the Setting of Spontaneous Bacterial Peritonitis FERNANDO

More information

Susan W. Muir PT PhD. Post-Doctoral Fellow Division of Geriatric Medicine Schulich School of Medicine & Dentistry University of Western Ontario

Susan W. Muir PT PhD. Post-Doctoral Fellow Division of Geriatric Medicine Schulich School of Medicine & Dentistry University of Western Ontario Susan W. Muir PT PhD Post-Doctoral Fellow Division of Geriatric Medicine Schulich School of Medicine & Dentistry University of Western Ontario University of Toronto Rehabilitation Rounds June 14, 2012

More information

Rating Scale Analysis of the Berg Balance Scale

Rating Scale Analysis of the Berg Balance Scale 1128 Rating Scale Analysis of the Berg Balance Scale Diana L. Kornetti, MA, PT, Stacy L. Fritz, MSPT, Yi-Po Chiu, MHS, PT, Kathye E. Light, PhD, PT, Craig A. Velozo, PhD, OTR ABSTRACT. Kornetti DL, Fritz

More information

Diagnostic research in perspective: examples of retrieval, synthesis and analysis Bachmann, L.M.

Diagnostic research in perspective: examples of retrieval, synthesis and analysis Bachmann, L.M. UvA-DARE (Digital Academic Repository) Diagnostic research in perspective: examples of retrieval, synthesis and analysis Bachmann, L.M. Link to publication Citation for published version (APA): Bachmann,

More information

University of Groningen. Maintaining balance in elderly fallers Swanenburg, Jaap

University of Groningen. Maintaining balance in elderly fallers Swanenburg, Jaap University of Groningen Maintaining balance in elderly fallers Swanenburg, Jaap IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please

More information

Medical doctors perception of the number needed to treat (NNT)

Medical doctors perception of the number needed to treat (NNT) æoriginal PAPER Medical doctors perception of the number needed to treat () A survey of doctors recommendations for two therapies with different Peder Andreas Halvorsen 1, Ivar Sønbø Kristiansen 2, Olaf

More information

Assessments of Interrater Reliability and Internal Consistency of the Norwegian Version of the Berg Balance Scale

Assessments of Interrater Reliability and Internal Consistency of the Norwegian Version of the Berg Balance Scale 94 ORIGINAL ARTICLE Assessments of Interrater Reliability and Internal Consistency of the Norwegian Version of the Berg Balance Scale Karin E. Halsaa, PT, Therese Brovold, PT, Vibeke Graver, PhD, PT, Leiv

More information

Created in January 2005 Duration: approx. 20 minutes

Created in January 2005 Duration: approx. 20 minutes 1 1 The Timed Up and Go Test Created in January 2005 Duration: approx. 20 minutes 2 Credits 2005 Stein Gerontological Institute. All rights reserved. Principal medical contributors: Alan Katz, MD Francois

More information

THE FUNCTIONAL REACH TEST (FRT) is a valuable

THE FUNCTIONAL REACH TEST (FRT) is a valuable 538 Is the Functional Reach Test Useful for Identifying Falls Risk Among Individuals With Parkinson s Disease? Andrea L. Behrman, PhD, PT, Kathye E. Light, PhD, PT, Sheryl M. Flynn, PhD, PT, Mary T. Thigpen,

More information

Searching for Clinical Prediction Rules in MEDLINE

Searching for Clinical Prediction Rules in MEDLINE 391 Research Paper Searching for Clinical Prediction Rules in MEDLINE BETTE JEAN INGUI, MLS, MARY A. M. ROGERS, PHD, MS Abstract Objectives: Clinical prediction rules have been advocated as a possible

More information

Methods of Diagnosing Sleep Apnea. The Diagnosis of Sleep Apnea: Questionnaires and Home Studies

Methods of Diagnosing Sleep Apnea. The Diagnosis of Sleep Apnea: Questionnaires and Home Studies Sleep, 19(10):S243-S247 1996 American Sleep Disorders Association and Sleep Research Society Methods of Diagnosing Sleep Apnea J The Diagnosis of Sleep Apnea: Questionnaires and Home Studies W. Ward Flemons

More information

OCW Epidemiology and Biostatistics, 2010 Michael D. Kneeland, MD November 18, 2010 SCREENING. Learning Objectives for this session:

OCW Epidemiology and Biostatistics, 2010 Michael D. Kneeland, MD November 18, 2010 SCREENING. Learning Objectives for this session: OCW Epidemiology and Biostatistics, 2010 Michael D. Kneeland, MD November 18, 2010 SCREENING Learning Objectives for this session: 1) Know the objectives of a screening program 2) Define and calculate

More information

RACE611 CLINICAL EPIDEMIOLOGY AND EVIDENCE-BASED MEDICINE Prognostic study

RACE611 CLINICAL EPIDEMIOLOGY AND EVIDENCE-BASED MEDICINE Prognostic study RACE611 CLINICAL EPIDEMIOLOGY AND EVIDENCE-BASED MEDICINE Prognostic study Assoc.Prof.Dr.Atiporn Ingsathit Master of Science Program in Medical Epidemiology and Doctor of Philosophy Program in Clinical

More information

Perspective. Making Geriatric Assessment Work: Selecting Useful Measures. Key Words: Geriatric assessment, Physical functioning.

Perspective. Making Geriatric Assessment Work: Selecting Useful Measures. Key Words: Geriatric assessment, Physical functioning. Perspective Making Geriatric Assessment Work: Selecting Useful Measures Often the goal of physical therapy is to reduce morbidity and prevent or delay loss of independence. The purpose of this article

More information

Sensitivity and Specificity of the Minimal Chair Height Standing Ability Test: A Simple and Affordable Fall-Risk Screening Instrument

Sensitivity and Specificity of the Minimal Chair Height Standing Ability Test: A Simple and Affordable Fall-Risk Screening Instrument Sensitivity and Specificity of the Minimal Chair Height Standing Ability Test: A Simple and Affordable Fall-Risk Screening Instrument By: Nadia C. Reider, MSc ; Patti-Jean Naylor, PhD ; Catherine Gaul,

More information

SPECIAL COMMUNICATION

SPECIAL COMMUNICATION Evidence-Based Medicine, Part 2. An Introduction to Critical Appraisal of Articles on Therapy Roberto Cardarelli, DO, MPH Richard F. Virgilio, DO Lockwood Taylor, MPH This article provides an introductory

More information

Journal of Pediatric Sciences

Journal of Pediatric Sciences Journal of Pediatric Sciences Pediatric Residents Knowledge of Evidence Based Medicine: A Pilot Study Hasan Alshabanah, Bosco Paes, Rafat Mosalli Journal of Pediatric Sciences 2010;2:e6 How to cite this

More information

Research Report. Determinants of Balance Confidence in Community-Dwelling Elderly People

Research Report. Determinants of Balance Confidence in Community-Dwelling Elderly People Research Report Determinants of Balance Confidence in Community-Dwelling Elderly People Background and Purpose. The fear of falling can have detrimental effects on physical function in the elderly population,

More information

Interventions, Effects, and Outcomes in Occupational Therapy

Interventions, Effects, and Outcomes in Occupational Therapy Interventions, Effects, and Outcomes in Occupational Therapy ADULTS AND OLDER ADULTS Instructor s Manual Learning Activities Mary Law, PhD, FCAOT Professor and Associate Dean Rehabilitation Science McMaster

More information

Research Report. A Comparison of Five Low Back Disability Questionnaires: Reliability and Responsiveness

Research Report. A Comparison of Five Low Back Disability Questionnaires: Reliability and Responsiveness Research Report A Comparison of Five Low Back Disability Questionnaires: Reliability and Responsiveness APTA is a sponsor of the Decade, an international, multidisciplinary initiative to improve health-related

More information

This article is the second in a series in which I

This article is the second in a series in which I COMMON STATISTICAL ERRORS EVEN YOU CAN FIND* PART 2: ERRORS IN MULTIVARIATE ANALYSES AND IN INTERPRETING DIFFERENCES BETWEEN GROUPS Tom Lang, MA Tom Lang Communications This article is the second in a

More information

William C Miller, PhD, FCAOT Professor Occupational Science & Occupational Therapy University of British Columbia Vancouver, BC, Canada

William C Miller, PhD, FCAOT Professor Occupational Science & Occupational Therapy University of British Columbia Vancouver, BC, Canada William C Miller, PhD, FCAOT Professor Occupational Science & Occupational Therapy University of British Columbia Vancouver, BC, Canada THE L TEST MANUAL Version: November 2014 Table of Contents Introduction...

More information

Total testing process applied to therapeutic drug monitoring: impact on patients outcomes and economics

Total testing process applied to therapeutic drug monitoring: impact on patients outcomes and economics Clinical Chemistry 44:2 370 374 (1998) TDM Conference Total testing process applied to therapeutic drug monitoring: impact on patients outcomes and economics Gerald E. Schumacher* and Judith T. Barr The

More information

Gait dysfunction is a particularly prevalent and important

Gait dysfunction is a particularly prevalent and important Modified Emory Functional Ambulation Profile An Outcome Measure for the Rehabilitation of Poststroke Gait Dysfunction Heather R. Baer, MD; Steven L. Wolf, PhD, PT, FAPTA Background and Purpose The modified

More information

University of Wollongong. Research Online. Australian Health Services Research Institute

University of Wollongong. Research Online. Australian Health Services Research Institute University of Wollongong Research Online Australian Health Services Research Institute Faculty of Business 2011 Measurement of error Janet E. Sansoni University of Wollongong, jans@uow.edu.au Publication

More information

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Ball State University

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Ball State University PEER REVIEW HISTORY BMJ Open publishes all reviews undertaken for accepted manuscripts. Reviewers are asked to complete a checklist review form (see an example) and are provided with free text boxes to

More information

Discrimination Weighting on a Multiple Choice Exam

Discrimination Weighting on a Multiple Choice Exam Proceedings of the Iowa Academy of Science Volume 75 Annual Issue Article 44 1968 Discrimination Weighting on a Multiple Choice Exam Timothy J. Gannon Loras College Thomas Sannito Loras College Copyright

More information

An introduction to power and sample size estimation

An introduction to power and sample size estimation 453 STATISTICS An introduction to power and sample size estimation S R Jones, S Carley, M Harrison... Emerg Med J 2003;20:453 458 The importance of power and sample size estimation for study design and

More information

Using Number Needed to Treat to Interpret Treatment Effect

Using Number Needed to Treat to Interpret Treatment Effect Continuing Medical Education 20 Using Number Needed to Treat to Interpret Treatment Effect Der-Shin Ke Abstract- Evidence-based medicine (EBM) has rapidly emerged as a new paradigm in medicine worldwide.

More information

MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and. Lord Equating Methods 1,2

MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and. Lord Equating Methods 1,2 MCAS Equating Research Report: An Investigation of FCIP-1, FCIP-2, and Stocking and Lord Equating Methods 1,2 Lisa A. Keller, Ronald K. Hambleton, Pauline Parker, Jenna Copella University of Massachusetts

More information

EBP ASKING. Constructing a Good Clinical Question Using the PICO Format

EBP ASKING. Constructing a Good Clinical Question Using the PICO Format EBP ASKING Constructing a Good Clinical Question Using the PICO Format Objectives: To demonstrate understanding of a good clinical question. To distinguish between a background question, usually answerable

More information

External validation of abbreviated versions of the activities-specific balance confidence scale in Parkinson's disease

External validation of abbreviated versions of the activities-specific balance confidence scale in Parkinson's disease Washington University School of Medicine Digital Commons@Becker Physical Therapy Faculty Publications Program in Physical Therapy 2010 External validation of abbreviated versions of the activities-specific

More information

Consider the following hypothetical

Consider the following hypothetical Nurse Educator Nurse Educator Vol. 32, No. 1, pp. 16-20 Copyright! 2007 Wolters Kluwer Health Lippincott Williams & Wilkins How to Read, Interpret, and Understand Evidence-Based Literature Statistics Dorette

More information

Psychology, 2010, 1: doi: /psych Published Online August 2010 (

Psychology, 2010, 1: doi: /psych Published Online August 2010 ( Psychology, 2010, 1: 194-198 doi:10.4236/psych.2010.13026 Published Online August 2010 (http://www.scirp.org/journal/psych) Using Generalizability Theory to Evaluate the Applicability of a Serial Bayes

More information

Agenetic disorder serious, perhaps fatal without

Agenetic disorder serious, perhaps fatal without ACADEMIA AND CLINIC The First Positive: Computing Positive Predictive Value at the Extremes James E. Smith, PhD; Robert L. Winkler, PhD; and Dennis G. Fryback, PhD Computing the positive predictive value

More information

Chapter 11. Experimental Design: One-Way Independent Samples Design

Chapter 11. Experimental Design: One-Way Independent Samples Design 11-1 Chapter 11. Experimental Design: One-Way Independent Samples Design Advantages and Limitations Comparing Two Groups Comparing t Test to ANOVA Independent Samples t Test Independent Samples ANOVA Comparing

More information

FUNCTIONAL CONSISTENCY IN THE FACE OF TOPOGRAPHICAL CHANGE IN ARTICULATED THOUGHTS Kennon Kashima

FUNCTIONAL CONSISTENCY IN THE FACE OF TOPOGRAPHICAL CHANGE IN ARTICULATED THOUGHTS Kennon Kashima Journal of Rational-Emotive & Cognitive-Behavior Therapy Volume 7, Number 3, Fall 1989 FUNCTIONAL CONSISTENCY IN THE FACE OF TOPOGRAPHICAL CHANGE IN ARTICULATED THOUGHTS Kennon Kashima Goddard College

More information

SPECIAL COMMUNICATION

SPECIAL COMMUNICATION Evidence-Based Medicine, Part 5. An Introduction to Critical Appraisal of Articles on Prognosis Roberto Cardarelli, DO, MPH Joseph R. Oberdorfer, OMS IV This article provides an introductory step-by-step

More information

Data that can be classified as belonging to a distinct number of categories >>result in categorical responses. And this includes:

Data that can be classified as belonging to a distinct number of categories >>result in categorical responses. And this includes: This sheets starts from slide #83 to the end ofslide #4. If u read this sheet you don`t have to return back to the slides at all, they are included here. Categorical Data (Qualitative data): Data that

More information

Essential Skills for Evidence-based Practice Understanding and Using Systematic Reviews

Essential Skills for Evidence-based Practice Understanding and Using Systematic Reviews J Nurs Sci Vol.28 No.4 Oct - Dec 2010 Essential Skills for Evidence-based Practice Understanding and Using Systematic Reviews Jeanne Grace Corresponding author: J Grace E-mail: Jeanne_Grace@urmc.rochester.edu

More information

Evidence Based Practice (EBP) Five Step Process EBM. A Definition of EBP 10/13/2009. Fall

Evidence Based Practice (EBP) Five Step Process EBM. A Definition of EBP 10/13/2009. Fall What is EBP? Classic Definition of Evidence Based Medicine (EBM) By Aaron Eakman PTOT 413/513 OT Profession Fall 2009 the explicit, judicious and conscientious use of current best evidence from health

More information

The Regression-Discontinuity Design

The Regression-Discontinuity Design Page 1 of 10 Home» Design» Quasi-Experimental Design» The Regression-Discontinuity Design The regression-discontinuity design. What a terrible name! In everyday language both parts of the term have connotations

More information

The Problem With Sensitivity and Specificity

The Problem With Sensitivity and Specificity EVIDENCE-BASED EMERGENCY MEDICINE/EDITORIAL E. John, MD From the Department of Emergency Medicine, Albert Einstein College of Medicine, Bronx, NY. The Problem With Sensitivity and Specificity See related

More information

Neuro-Oncology Practice

Neuro-Oncology Practice Neuro-Oncology Practice Neuro-Oncology Practice 2(4), 162 166, 2015 doi:10.1093/nop/npv030 Advance Access date 7 September 2015 Diagnostic tests: how to estimate the positive predictive value Annette M.

More information

Continence, falls and the frailty syndrome. Anne Foley - BGS Bladders and Bowel Health 2012

Continence, falls and the frailty syndrome. Anne Foley - BGS Bladders and Bowel Health 2012 Continence, falls and the frailty syndrome Outline Frailty Geriatric syndromes and giants Aetiology What can be done? The future Frailty Frailty Frailty (noun): The state of being weak in health or body

More information

AOTA S EVIDENCE EXCHANGE CRITICALLY APPRAISED PAPER (CAP) GUIDELINES Annual AOTA Conference Poster Submissions Critically Appraised Papers (CAPs) are

AOTA S EVIDENCE EXCHANGE CRITICALLY APPRAISED PAPER (CAP) GUIDELINES Annual AOTA Conference Poster Submissions Critically Appraised Papers (CAPs) are AOTA S EVIDENCE EXCHANGE CRITICALLY APPRAISED PAPER (CAP) GUIDELINES Annual AOTA Conference Poster Submissions Critically Appraised Papers (CAPs) are at-a-glance summaries of the methods, findings and

More information

THERAPEUTIC REASONING

THERAPEUTIC REASONING THERAPEUTIC REASONING Christopher A. Klipstein (based on material originally prepared by Drs. Arthur Evans and John Perry) Objectives: 1) Learn how to answer the question: What do you do with the post

More information

Two-sample Categorical data: Measuring association

Two-sample Categorical data: Measuring association Two-sample Categorical data: Measuring association Patrick Breheny October 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 40 Introduction Study designs leading to contingency

More information

Validity, responsiveness and the minimal clinically important difference for the de Morton Mobility Index (DEMMI) in an older acute medical population

Validity, responsiveness and the minimal clinically important difference for the de Morton Mobility Index (DEMMI) in an older acute medical population RESEARCH ARTICLE Open Access Validity, responsiveness and the minimal clinically important difference for the de Morton Mobility Index (DEMMI) in an older acute medical population Natalie A de Morton 1,2*,

More information

Issues in the Development, Practice, Training, and Research of Integrative Therapies ABSTRACT

Issues in the Development, Practice, Training, and Research of Integrative Therapies ABSTRACT Issues in the Development, Practice, Training, and Research of Integrative Therapies 52 Commentary on The Case of Ms. Q: A Demonstration of Integrative Psychotherapy Guided by Core Clinical Hypotheses

More information

STUDIES OF THE ACCURACY OF DIAGNOSTIC TESTS: (Relevant JAMA Users Guide Numbers IIIA & B: references (5,6))

STUDIES OF THE ACCURACY OF DIAGNOSTIC TESTS: (Relevant JAMA Users Guide Numbers IIIA & B: references (5,6)) STUDIES OF THE ACCURACY OF DIAGNOSTIC TESTS: (Relevant JAMA Users Guide Numbers IIIA & B: references (5,6)) Introduction: The most valid study design for assessing the accuracy of diagnostic tests is a

More information

Evidence-Based Medicine: Diagnostic study

Evidence-Based Medicine: Diagnostic study Evidence-Based Medicine: Diagnostic study What is Evidence-Based Medicine (EBM)? Expertise in integrating 1. Best research evidence 2. Clinical Circumstance 3. Patient values in clinical decisions Haynes,

More information

Reliability, validity, and all that jazz

Reliability, validity, and all that jazz Reliability, validity, and all that jazz Dylan Wiliam King s College London Introduction No measuring instrument is perfect. The most obvious problems relate to reliability. If we use a thermometer to

More information

Evidence Based Medicine Prof P Rheeder Clinical Epidemiology. Module 2: Applying EBM to Diagnosis

Evidence Based Medicine Prof P Rheeder Clinical Epidemiology. Module 2: Applying EBM to Diagnosis Evidence Based Medicine Prof P Rheeder Clinical Epidemiology Module 2: Applying EBM to Diagnosis Content 1. Phases of diagnostic research 2. Developing a new test for lung cancer 3. Thresholds 4. Critical

More information

9. Interpret a Confidence level: "To say that we are 95% confident is shorthand for..

9. Interpret a Confidence level: To say that we are 95% confident is shorthand for.. Mrs. Daniel AP Stats Chapter 8 Guided Reading 8.1 Confidence Intervals: The Basics 1. A point estimator is a statistic that 2. The value of the point estimator statistic is called a and it is our "best

More information

Providing High Value Cost-Conscious Care:

Providing High Value Cost-Conscious Care: Providing High Value Cost-Conscious Care: Biostatistical Concepts You Need to Know 2012-2013 Presentation #5 0f 10 http://hvc.acponline.org/ Learning Objectives Understand that a working knowledge of basic

More information

Fabio La Porta, MD 1, Marco Franceschini, MD 2, Serena Caselli, PT 1, Sonia Susassi, PT 1, Paola Cavallini, PT 1 and Alan Tennant, PhD 3

Fabio La Porta, MD 1, Marco Franceschini, MD 2, Serena Caselli, PT 1, Sonia Susassi, PT 1, Paola Cavallini, PT 1 and Alan Tennant, PhD 3 J Rehabil Med 2011; 43: 445 453 ORIGINAL REPORT Unified Balance Scale: classic psychometric and clinical properties Fabio La Porta, MD 1, Marco Franceschini, MD 2, Serena Caselli, PT 1, Sonia Susassi,

More information

Bayes Theorem and diagnostic tests with application to patients with suspected angina

Bayes Theorem and diagnostic tests with application to patients with suspected angina 96 Tutorial December 2013 - Issue 2 Bayes Theorem and diagnostic tests with application to patients with suspected angina Andrew Owen PhD, FESC Department of Cardiology, Canterbury Christ Church University,

More information

Sensitivity, Specificity and Predictive Value [adapted from Altman and Bland BMJ.com]

Sensitivity, Specificity and Predictive Value [adapted from Altman and Bland BMJ.com] Sensitivity, Specificity and Predictive Value [adapted from Altman and Bland BMJ.com] The simplest diagnostic test is one where the results of an investigation, such as an x ray examination or biopsy,

More information

Statistics for Psychology

Statistics for Psychology Statistics for Psychology SIXTH EDITION CHAPTER 3 Some Key Ingredients for Inferential Statistics Some Key Ingredients for Inferential Statistics Psychologists conduct research to test a theoretical principle

More information

Research Report. Key Words: Aging, Falls, Health fair, Risk, Screening. Kirsten K Ness, James G Gurney, Gillian H Ice

Research Report. Key Words: Aging, Falls, Health fair, Risk, Screening. Kirsten K Ness, James G Gurney, Gillian H Ice Research Report Screening, Education, and Associated Behavioral Responses to Reduce Risk for Falls Among People Over Age 65 Years Attending a Community Health Fair Background and Purpose. Because of the

More information

A qualitative approach to Bayes theorem

A qualitative approach to Bayes theorem 10.1136/ebm-2011-0007 1 Section of General Internal Medicine, Boston University School of Medicine, Boston, Massachusetts, USA 2 Internal Medicine, The Ohio State University College of Medicine, Columbus,

More information

Examining the Psychometric Properties of The McQuaig Occupational Test

Examining the Psychometric Properties of The McQuaig Occupational Test Examining the Psychometric Properties of The McQuaig Occupational Test Prepared for: The McQuaig Institute of Executive Development Ltd., Toronto, Canada Prepared by: Henryk Krajewski, Ph.D., Senior Consultant,

More information

SPECIAL COMMUNICATION. Evidence-Based Medicine, Part 3. An Introduction to Critical Appraisal of Articles on Diagnosis

SPECIAL COMMUNICATION. Evidence-Based Medicine, Part 3. An Introduction to Critical Appraisal of Articles on Diagnosis Evidence-Based Medicine, Part 3. An Introduction to Critical Appraisal of Articles on Diagnosis Damon A. Schranz, DO Michael A. Dunn, OMS III, MBA This article provides an introductory step-by-step process

More information

About Reading Scientific Studies

About Reading Scientific Studies About Reading Scientific Studies TABLE OF CONTENTS About Reading Scientific Studies... 1 Why are these skills important?... 1 Create a Checklist... 1 Introduction... 1 Abstract... 1 Background... 2 Methods...

More information

Clinical Reasoning: Use of Diagnostic Testing

Clinical Reasoning: Use of Diagnostic Testing Clinical Reasoning: Use of Diagnostic Testing Viju John, MD OCTOBER 21, 2016 Objectives 1. Determine pre and post-test probability. 2. Understand the concepts of threshold to test and a threshold to treat.

More information

REPRODUCIBILITY AND RESPONSIVENESS OF EVALUATIVE OUTCOME MEASURES

REPRODUCIBILITY AND RESPONSIVENESS OF EVALUATIVE OUTCOME MEASURES International Journal of Technology Assessment in Health Care, 17:4 (2001), 479 487. Copyright c 2001 Cambridge University Press. Printed in the U.S.A. REPRODUCIBILITY AND RESPONSIVENESS OF EVALUATIVE

More information

The Predictive Validity of the Test of Infant Motor Performance on School Age Motor Developmental Delay

The Predictive Validity of the Test of Infant Motor Performance on School Age Motor Developmental Delay Pacific University CommonKnowledge PT Critically Appraised Topics School of Physical Therapy 2012 The Predictive Validity of the Test of Infant Motor Performance on School Age Motor Developmental Delay

More information

Clinical Utility of Likelihood Ratios

Clinical Utility of Likelihood Ratios CONCEPTS Clinical Utility of Likelihood Ratios From the Departments of Emergency Medicine, Medicine, Epidemiology, and Social Medicine, Albert Einstein College of Medicine, Bronx, NY. Received for publication

More information

Predicting the outcome of acute stroke: prospective evaluation of five multivariate models

Predicting the outcome of acute stroke: prospective evaluation of five multivariate models Journal of Neurology, Neurosurgery, and Psychiatry 1992;55:347-351 Department of Health Care of the Elderly, University Hospital, Nottingham J R F Gladman Department of Medicine, Ipswich Hospital D M J

More information

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010 OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010 SAMPLING AND CONFIDENCE INTERVALS Learning objectives for this session:

More information

Title: Validation of the Comprehensive Feeding Practices Questionnaire with parents of 10-to-12-year-olds

Title: Validation of the Comprehensive Feeding Practices Questionnaire with parents of 10-to-12-year-olds Author's response to reviews Title: Validation of the Comprehensive Feeding Practices Questionnaire with parents of 10-to-12-year-olds Authors: Elisabeth L Melbye (elisabeth.l.melbye@uis.no) Torvald Øgaard

More information

A Case Study: Two-sample categorical data

A Case Study: Two-sample categorical data A Case Study: Two-sample categorical data Patrick Breheny January 31 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/43 Introduction Model specification Continuous vs. mixture priors Choice

More information

To open a CMA file > Download and Save file Start CMA Open file from within CMA

To open a CMA file > Download and Save file Start CMA Open file from within CMA Example name Effect size Analysis type Level Tamiflu Symptom relief Mean difference (Hours to relief) Basic Basic Reference Cochrane Figure 4 Synopsis We have a series of studies that evaluated the effect

More information

When an internist evaluates a patient in an ambulatory. The Relation of Conjunctival Pallor to the Presence of Anemia

When an internist evaluates a patient in an ambulatory. The Relation of Conjunctival Pallor to the Presence of Anemia The Relation of Conjunctival Pallor to the Presence of Anemia Tarang N. Sheth, BArtsSc, Niteesh K. Choudhry, MD, Matt Bowes, BSc, Allan S. Detsky, MD, PhD OBJECTIVE: To determine the value of conjunctival

More information

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize

More information

Basic Concepts in Research and DATA Analysis

Basic Concepts in Research and DATA Analysis Basic Concepts in Research and DATA Analysis 1 Introduction: A Common Language for Researchers...2 Steps to Follow When Conducting Research...2 The Research Question...3 The Hypothesis...3 Defining the

More information

An evidence-based medicine approach to the treatment of endometriosis-associated chronic pelvic pain: placebo-controlled studies Howard F M

An evidence-based medicine approach to the treatment of endometriosis-associated chronic pelvic pain: placebo-controlled studies Howard F M An evidence-based medicine approach to the treatment of endometriosis-associated chronic pelvic pain: placebo-controlled studies Howard F M Authors' objectives To assess the efficacy of treatment of endometriosis-associated

More information

Kathryn D. Mitchell, PT, DPT, NCS, MSCS; Han Chen, MD, MPH; Sheri P. Silfies, PT, PhD

Kathryn D. Mitchell, PT, DPT, NCS, MSCS; Han Chen, MD, MPH; Sheri P. Silfies, PT, PhD Test-Retest Reliability, Validity, and Minimal Detectable Change of the Balance Evaluation Systems Test to Assess Balance in Persons with Multiple Sclerosis Kathryn D. Mitchell, PT, DPT, NCS, MSCS; Han

More information

Why do Psychologists Perform Research?

Why do Psychologists Perform Research? PSY 102 1 PSY 102 Understanding and Thinking Critically About Psychological Research Thinking critically about research means knowing the right questions to ask to assess the validity or accuracy of a

More information

Reliability and Validity

Reliability and Validity Reliability and Today s Objectives Understand the difference between reliability and validity Understand how to develop valid indicators of a concept Reliability and Reliability How accurate or consistent

More information

Margaret Schenkman, PT, PhD, FAPTA University of Colorado, Denver Colorado

Margaret Schenkman, PT, PhD, FAPTA University of Colorado, Denver Colorado Margaret Schenkman, PT, PhD, FAPTA University of Colorado, Denver Colorado Present a framework for clinical reasoning with emphasis on Patient centered care Application of enablement and disablement frameworks

More information

Brunel balance assessment (BBA)

Brunel balance assessment (BBA) Brunel balance assessment (BBA) Tyson, S Title Authors Type URL Brunel balance assessment (BBA) Tyson, S Published Date 2004 Monograph This version is available at: http://usir.salford.ac.uk/4886/ USIR

More information

Cleveland Clinic Mellen Center for Multiple Sclerosis. Mellen Center Approaches: Falls and Fall Prevention in MS. Q: What is a fall?

Cleveland Clinic Mellen Center for Multiple Sclerosis. Mellen Center Approaches: Falls and Fall Prevention in MS. Q: What is a fall? Mellen Center Approaches: Falls and Fall Prevention in MS Q: What is a fall? A: A fall can be defined as an unplanned change in position resulting in the individual resting on the ground or a lower level.

More information

Psychology Research Process

Psychology Research Process Psychology Research Process Logical Processes Induction Observation/Association/Using Correlation Trying to assess, through observation of a large group/sample, what is associated with what? Examples:

More information

Effect of Balance Training on Balance and Confidence in Older Adults

Effect of Balance Training on Balance and Confidence in Older Adults International Journal of Sport Studies. Vol., 4 (6), 681-685, 2014 Available online at http: www.ijssjournal.com ISSN 2251-7502 2014; Science Research Publications Effect of Balance Training on Balance

More information

Jeffrey N. Katz. THE NORTH AMERICAN SPINE SOCIETY (NASS) LUMBAR SPINE OUTCOME ASSESSMENT INSTRUMENT General Description. Administration.

Jeffrey N. Katz. THE NORTH AMERICAN SPINE SOCIETY (NASS) LUMBAR SPINE OUTCOME ASSESSMENT INSTRUMENT General Description. Administration. Arthritis & Rheumatism (Arthritis Care & Research) Vol. 49, No. 5S, October 15, 2003, pp S43 S49 DOI 10.1002/art.11399 2003, American College of Rheumatology MEASURES OF FUNCTION Measures of Adult Back

More information

Student Performance Q&A:

Student Performance Q&A: Student Performance Q&A: 2009 AP Statistics Free-Response Questions The following comments on the 2009 free-response questions for AP Statistics were written by the Chief Reader, Christine Franklin of

More information

MCQ Course in Pediatrics Al Yamamah Hospital June Dr M A Maleque Molla, FRCP, FRCPCH

MCQ Course in Pediatrics Al Yamamah Hospital June Dr M A Maleque Molla, FRCP, FRCPCH MCQ Course in Pediatrics Al Yamamah Hospital 10-11 June Dr M A Maleque Molla, FRCP, FRCPCH Q1. Following statements are true in the steps of evidence based medicine except ; a) Convert the need for information

More information

ITEM ANALYSIS OF MID-TRIMESTER TEST PAPER AND ITS IMPLICATIONS

ITEM ANALYSIS OF MID-TRIMESTER TEST PAPER AND ITS IMPLICATIONS ITEM ANALYSIS OF MID-TRIMESTER TEST PAPER AND ITS IMPLICATIONS 1 SARITA DESHPANDE, 2 RAVINDRA KUMAR PRAJAPATI 1 Professor of Education, College of Humanities and Education, Fiji National University, Natabua,

More information