Author's response to reviews

Size: px
Start display at page:

Download "Author's response to reviews"

Transcription

1 Author's response to reviews Title: Comparison of two Bayesian methods to detect mode effects between paper-based and computerized adaptive assessments: A preliminary Monte Carlo study Authors: Barth B. Riley (barthriley@comcast.net) Adam C. Carle (adam.carle.cchmc@gmail.com) Version: 2 Date: 3 May 2012 Author's response to reviews: see over

2 April 24th, 2012 Ralph D'Agostino, Vern Farewell, Joel Greenhouse, and Louise Ryan Editors BMC Medical Research Methodology RE: MS Dear Editors; Thank you for the opportunity to resubmit our manuscript, which was manuscript originally entitled Comparison of two Bayesian Methods to Detect Differential Item Functioning between Paper-Based and Computerized Adaptive Assessments: A Monte Carlo Study. With this letter, we submit a revised version that takes into account all of the comments of each reviewer. We appreciate the reviewers thoughtful recommendations and believe that the manuscript has been strengthened as a result of this process. We note that the revised version is lengthy due to the recommendations by the reviewers for additional text and reorganization of sections to improve clarity. We have also added supplementary online material (example data files, which are referred to in the text as Additional Files 2-4). Consequently, we have attempted to shorten the manuscript as much as possible (e.g., removing Figure 1 of the original version, and deleting unnecessary verbiage). Table 4 from the original version of the manuscript has been split into two tables to in order to allow each table to be presented in portrait format. This manuscript has not been previously published nor is it under consideration in the same or similar form in any other journal. Again, we are truly grateful to the reviewers for their helpful comments and to you for the opportunity to address these comments and provide a revised manuscript. We look forward to hearing from you. Sincerely, Barth Riley, Ph.D. 1 Department of Health Systems Science, M/C 802, University of Illinois at Chicago 845 S. Damen Ave., Chicago, IL 60612, USA barthr@uic.edu

3 Riley and Carle, Page 2 Reviewer 1 Minor Essential Revisions 1. Bayesian method. I think the authors should explain the reasoning behind each step in the three step Bayesian method. For instance, I am not sure why step 1 is necessary. It is very standard to obtain item parameters without estimates of underlying ability. We have made some revisions to the text on pages 8-9 to provide a rationale for each step. For step 1, we added the sentence: "This is to ensure that item parameters estimated in subsequent steps are on a common metric." Step 2 now reads as: "Using θ i obtained in Step 1, estimate the posterior distributions of mode-specific item parameters for subsequent comparison in step 3" (bold italicized text added). The first sentence describing step 3 has been changed to: " Estimate DIF for each item common across modes by assessing the difference in the posterior distributions of the item parameters (i.e., between β j CAT and β j P&P )." 2. Design. I am sorry but I can't find how many simulated test-takers were included in each replicate. Perhaps I am missing this information. If absent this detail should be added to the manuscript. On p. 11 (end of paragraph 0) the following was modified: Person measures (θ i ) for 500 simulees were generated using an N(0, 1) standard normal distribution. (Bold italicized text was added). On p. 12 (second paragraph) we added the following: Generation of CAT Item Response Data: Prior to performing CAT simulations, response data for all 100 items in the simulated item banks described above were generated for a total of 3,000 simulees in each iteration. This sample size permitted examination of the effect of CAT item usage on DIF detection rates. 3. Conclusion. Page 24 para 2: The authors write "The present study lends support to the use of a Bayesian framework for assessing DIF between CAT and conventional modes of administering an instrument." As I look at Table 2, false discovery looks well controlled, but power seems really low. What I take away from this is that neither of the two approaches are particularly good at detecting DIF in CAT vs. P&P given the study design. Am I missing the point? If yes, please spell it out for me. If no, perhaps an additional dimension to add to the simulation study is the number of simulated test takers, in order that the power get up to a level at which a person might make a decision about how to go about designing a study to assess mode DIF effects in CAT. We acknowledge the reviewer's point and have deleted the sentence in question. While it is difficult to predict the number of test takers required since there is no direct correspondence between sample size and item usage, on p. 16 (paragraph 1) we added the following:

4 Riley and Carle, Page 3 For RZ, an item usage of 369 and 422 were associated with power to detect DIF of 80 percent for the 1PL and 2PL conditions, respectively. For CrI, 80 percent power was associated with CAT item usage of 305 and 341 for 1PL and 2PL conditions, respectively. Discretionary Revisions 4. Title: Consider include "CAT Mode Effects" of some such in the title. The title has been changed as follows: " Comparison of two Bayesian Methods to Detect Mode Effects between Paper-Based and Computerized Adaptive Assessments: A Preliminary Monte Carlo Study" 5. page 7 para 1: Consider using "Power" instead of "Power rates" The term "power rates" has been replaced with "power." 6. page 7 para 2: Consider inserting an "s" in "representing the number of time[s]" The modification has been made. 7. Page 11 para 1: Are 10 replications per condition a sufficient number of replications? We acknowledged the small number of iterations as a limitation of the study findings in the Discussion section. As stated in the Abstract and at the end of the Introduction, this is a preliminary study. The decision to use 10 iterations per combination of conditions was due to the time-intensive nature of the Markov chain Monte Carlo simulations. Other studies (e.g., Glickman et al. 2009, see references) have employed this number of iterations. 8. page 12 para 0: Could the authors comment on what kinds of health constructs are represented by an item bank that has difficulty parameters uniformly distributed in a range [-3,+3] and population severity/ability parameters forming a distribution N(0,1)? Specifically, is it reasonable to propose that such an item difficulty and severity/ability distribution reflects the kinds of item banks researchers are likely to encounter in health research settings? We realize that the simulation results may not generalize to all applications in which CAT and paper-based health assessments are compared. We have added a sentence to the Discussion acknowledging the importance of testing the procedures using various distributions of parameters (p. 26, paragraph 2): Further research is needed to examine the robustness of the method under varying prior assumptions concerning the distribution of item and person parameters and when data fail to conform to these prior assumptions. It was nevertheless important for our purposes to assess the ability of the two procedures to detect DIF in items at varying difficulty levels, which we believed was best achieved through use of a uniform distribution of item difficulty values. We also wanted to

5 Riley and Carle, Page 4 minimize floor and ceiling effects in both items and simulees for which parameters could not be estimated, which may result from skewed, low/high average or widely varying distributions of person/item parameters. Further, it is not uncommon practice to set distributional properties of certain parameters in order to identify the model, e.g., setting the standard deviation of person measures to Page 12 para 1: The authors report RMSE: is this reasonably good fit? We are not aware of specific criteria or cutoffs for "good" RMSE in this context. However, we have added the following (p. 13, paragraph 1): RMSEs and correlations between item parameters and their estimates were consistent parameter recovery results presented elsewhere [38-40]. 10. Page 11-12: Perhaps somewhere in here the authors could describe the sample size used for estimating item bank parameter estimates (using Mplus). See response to #2 above. 12. Page 13 para 1: Is it reasonable to envision a researcher seeking to examine DIF due to mode effects (P&P vs. CAT) when mu_cat is 1 and mu_p&p is 0? I think this would be an extremely suspect design and probably a condition not worth considering. Perhaps I am being unimaginative. Regardless, the paper might be stronger if the authors would comment on this point. Given that mean differences in group measures have been shown to affect DIF detection, (see references in the manuscript) we wanted to include this as a condition in our simulation study. We have included in the Discussion (p. 26 paragraph 1) the following sentence: These results highlight the need for careful sampling of respondents who complete each form of the instrument and assessment of trait-level differences prior to assessment of mode effects. 13. Page 16 para 2: Consider using "specificity" instead of "specificity rates". And elsewhere throughout the manuscript, consider deprecating "rate" for percentage, percent, or without additional qualification. The modification was made. 14. Results: It is reasonable to present results to three places of precision given a sample size of 160 replications? We have rounded all values in Table 2 to two decimal points and have made similar changes throughout the text. Consequently, we also revised the text on p. 14 (last paragraph) which now includes the sentence: "CAT to full-instrument correlations were.97 across all conditions. MSEs were 0.26 for the 1PL and for the 2PL conditions. "

6 Riley and Carle, Page Page 22, para 0: This is a comment/suggestion and not a point that needs addressing in this manuscript (unless the authors agree with the suggestion). The authors write "Research is clearly needed to determine the causes of elevated false positive rate for easy-to-endorse items." This reminds me of the "constant bias" problem that observed variable conditioning approaches to DIF detection (Millsap and Everson, 1993 Applied Psychol Meas 17: ; Camilli, in Differential item functioning Holland et al., Eds. [Lawrence Erlbaum Associates, Publishers, Hillsdale, NJ, 1993] pp ). I suspect this is due to step 1 of the procedure, which estimates a latent ability trait estimate assuming no DIF and uses this to estimate item parameters in a second step. I think what you need is a method that allows for simultaneously estimating item parameters, and possible differences across P&P and CAT administration group in selected items, and while controlling for that DIF effect on underlying ability estimates and underlying ability in a single step. This could be done with an Mplus and a MIMIC model approach, but the authors would have to translate SEM parameters (measurement slopes, thresholds, and direct effects) into IRT parameters (c.f. Macintosh and Hashim, 2003 Applied psychological measurement 27:372-9). The reviewer's point is well taken. We have added a sentence following the sentence on p. 22 that makes this and one other recommendation for further research: Two possible avenues of research in this area include: (1) further examination of different priors for item parameters and their effect on DIF detection for easy-toendorse items, and (2) an iterative process of identifying DIF items and then removing or appropriately weighting them in the estimation of person measures. Regarding the suggestion of using a MIMIC model in MPlus, our experience with this approach has been problematic, since it may be difficult to converge on a solution when there is a considerable amount of missing data as is the case in CAT. 16. Page 22 para 2: One more explanatory sentence in this paragraph ("As might be expected...") would be a welcome addition. The revised paragraph is as follows: As might be expected, DIF magnitude (i.e., the difference between CAT and P&P item parameters for a given item) was significantly and positively related to power. However, the same was not true for the percentage of items with DIF in the item bank. The latter result suggests that the power to detect a single DIF item is not significantly affected by the presence of other DIF items in the bank which may "contaminate" the person measures. (Bold italicized text was added). 17. Page 22 para 2: The authors report that a number of factors were related to power, and are explained usually by their association with frequency of use in CAT (e.g., difficulty, discrimination). Perhaps the authors have not adequately modeled the dependency of power to detect DIF on CAT item frequency. That is, the relationship may be non-linear with respect to CAT

7 Riley and Carle, Page 6 item frequency. In this case the explanatory models for power would be misspecified. Given the length and complexity of the paper, we decided to focus only on main effects in the prediction of true and false DIF detection. It is quite possible that inclusion of interaction effects may be warranted and may explicate the relationship of CAT item usage, item difficulty and discrimination with respect to power. We may see, for instance, that the relationship of CAT item usage to true DIF detection may vary with item difficulty and/or discrimination. This perhaps would reflect the non-linear effect the reviewer is suggesting. However, since the relationship of item usage to power was statistically significant and model AUCs were high (.95 and.93) for both RZ and CrI, the models in their present form are quite strong. 18. Reproducibility and Replication. I think the authors should consider including one or two CSV of their generated data as a supplementary file. Researchers interested in developing other approaches to DIF detection in CAT might benefit from comparing their results to ones published in this manuscript. We are certainly open to include additional material via a weblink associated with the paper. While we can provide sample data used in different phases of the project (parameter estimation, CAT simulation, DIF estimation), it would be helpful to know which data would be most useful. Reviewer 2 Major Compulsory Revisions Introduction 1. There has been quite a bit of literature comparing P&P and computerized administration (although not CAT) of self-report measures. For example, a recent meta-analysis [1] of 65 papers comparing computerized and P&P administration of patient reported outcomes (PRO) found that the methods are equivalent. The authors should consider this research and determine whether it is relevant to their study. There are two points to consider when examining the Gwaltney et al (2008) investigation in the context of the present study. First, Gwaltney et al. (2008) did not examine mode DIF but rather mode effects at the person level. Second, unlike computer-based assessment, CAT relies heavily on item parameters for item selection and standard errors to determine when to stop the assessment. DIF, even if it does not result in differential test functioning, can adversely affect CAT efficiency and possibly cause some measures to be biased. We therefore maintain that the issue of mode effects remains relevant when comparing conventional and CAT-administered assessments. 2. A related issue is the generalizability of this work. It is not clear how often one will need to compare P&P and CAT administration to assess DIF. ideally, one would determine the existence of DIF by administering the

8 Riley and Carle, Page 7 entire item bank via both computer and paper and then assess DIF. This procedure would allow one to use traditional methods to assess DIF, which are well developed and far easier to implement. A critical question is whether comparison of paper-administered and computeradministered banks of items would generalize to the CAT vs. P&P comparison of interest here. A unique feature of CAT is that items are selected sequentially based on previouslyadministered items and estimates of trait level. The use of item parameters that is not appropriate for the CAT context can affect the efficiency and precision of the CAT. Variation in item administration order may also lead to differences in how respondents answer certain items which would not be reflected in a fixed-order computer-based assessment. Methods 3. The authors need to include a data section so that they can describe: i. How the data was developed ii. The assumptions of the underlying data. iii. The item banks constructed from the data. iv. Any other relevant information that will help the reader understands the dataset and replicate it. We have inserted the heading Data Generation Procedures at the top of p 11. This section extends to the middle of p. 13. In this section we describe the distributional assumptions underlying item and person parameters, the CAT procedures used in the CAT simulation. We also provide a flowchart summarizing the overall process of the simulations. In addition, we plan to provide example datasets as online material. 4. The authors state that their, primary interest was identification of DIF in psychological and health-outcome measures. To do this, the authors generated item banks that included 100 items and simulated CAT administration using 30 items for each pseudo-respondent. This does not seem to be consistent with CATs used to assess health outcomes and probably does not reflect those assessing psychological outcomes. Item banks developed through the PROMIS initiative, for example, have as few as 9 items, and most have between 30 and 40 items. Further, in practice, most CATs in healthcare are not a fixed length. Rather, most are designed to terminate after a minimum criterion is exceeded. PROMIS CATs, for example, often terminate after 5 items are administered. As stated above, we recognize that the selection of the number and distribution of items and pseudo respondents may not reflect actual application. First, we have modified the sentence above as follows: Second, we focused on instruments fitting a one- (1PL) or two-parameter (2PL) IRT model [10-12], which are commonly applied to health outcome measures. Second, we have modified the sentence on p. 25 concerning the study limitations: Additional limitations of the present study include the small number of replications per experimental condition, the use of a fixed-length CAT and fixed item bank size.

9 Riley and Carle, Page 8 5. It is believed that the paper would benefit from reorganizing the section. For example, virtually all the information in the Methods section should be subsumed under the Analysis subsection. Within this section, the analyses should be presented in a coherent order; possibly the sequence in which the analyses were conducted. Each of these subsections would then include parts the authors have already written, such as Generation of the Validation (Paper-and-Pencil) Item Parameters and Response Data. We have made a number of changes to this section in order to more clearly convey the study procedures and their order. We have done this by including additional section headings and sentences or paragraphs describing the organizing of the subsequent section. For instance, we have inserted the following (top of p. 8) as an introduction to the Methods section (p. 9): The methods employed in this study will be presented in three main sections. First, we describe the development and underlying assumptions of two Bayesian methods for detecting item-level mode effects. Second, we describe the simulation study, including its design and data generation procedures. The third section outlines the analysis of the simulated data. Results 1. The authors should report the findings using the revised format of the Analysis section or with some alterations that facilitate reader understanding. We have made revisions throughout the Results section to convey the order of the analyses outlined in the Analysis section. We have, for instance, added text to link results to the research questions, as was done in the Analysis section. Discussion 1. The authors provide a thoughtful presentation about the pros and cons of both procedures; however, it is not until the Summary section that they finally state their findings that the modified robust Z statistic is preferred. This should be stated earlier, perhaps in the 2nd paragraph of the Discussion section. Further, it is not clear whether this finding is consistent across 1- and 2- parameter models. In the first paragraph of the Discussion we wrote: The CrI method resulted in slightly higher power, but this was offset by a higher false positive rate relative to RZ. For both 1PL and 2PL conditions, power was higher for CrI than RZ, but control of Type I error was better for RZ compared to CrI. Thus, power and Type I error control results were consistent across these two conditions. 2. The authors do not discuss the limitations of their methodology. Will these findings be replicated using real data? Will the inclusion of items with nonuniform DIF affect the results? If not, why? The authors do report in the

10 Riley and Carle, Page 9 Conclusion section that the procedure can be adapted to assess non-uniform DIF too, but this is not a conclusion. We have removed any statement in the manuscript concerning the potential utility of the studied procedures in detecting non-uniform mode DIF. Minor Essential Revisions Abstract (all of these are minor) 1. The methods sections needs to be modified. I understand that there are 16 conditions, but it requires the reader some work to sort this out. To provide greater clarity as to how the factors contributed to the total of 16 conditions, we modified the sentence in the abstract as shown below: A simulation study was conducted under the following conditions: (1) data-generating model (one- vs. two-parameter IRT model); (2) moderate vs. large DIF sizes; (3) percentage of DIF items (10% vs. 30%), and (4) mean difference in θ estimates across modes of 0 vs. 1 logits. 2. It is not clear whether the results are consistent across 1- and 2-parameter models. We modified the first sentence of the Results paragraph of the Abstract as follows: Results: Both methods evidenced good to excellent false positive control, with RZ providing better control of false positives and with slightly higher power for CrI, irrespective of measurement model. 3. The data the authors used is relevant to the findings, thus, it seems a brief description is in order. As data for the study were simulated, we have provided a detailed description of the data-generation procedures under Data Generation (beginning of p. 11). We will also provide example datasets as online material. 4. In the Discussion section, the authors state that the modified robust Z statistic is preferred. This should be stated in the abstract. We state in the Abstract the relative performance differences between the two methods (see Results and Conclusion paragraphs). Introduction 1. The authors discuss the purpose of the study in the final paragraph of the Introduction, but restate it, using more detail, in the Methods section (page 14). This seems redundant. The Introduction should be reorganized so that the research questions can be listed once. We deleted the research questions originally listed on page 14 and revised the last section of the Introduction to clearly state the research questions and place them in a single section.

11 Riley and Carle, Page 10 Methods 1. The authors should list the software and how it was used at the end of the Methods section. We have moved all references to software under the subheading Software at the end of the Methods section (p. 16). 2. The authors should list the software and how it was used at the end of the Methods section. We have added a Software section at the end of the Methods. 3. Figure 1 does not optimally depict the steps of the analysis because the boxes in the 1pl and 2pl columns are virtually identical. However, the authors may wish to use that framework to organize the sections of the analysis section (they would then also need to describe the DIF steps in more detail). We agree with the reviewer that the sequence of steps for the 1PL and 2PL conditions are virtually identical. Consequently, the inclusion of a flowchart may not add significantly to the reader's understanding of the processes being described. We have therefore decided to remove this figure and instead made changes to the organization of the text to improve clarity. 4. The authors clearly state that non-uniform DIF was not used in this study. But they do not indicate why. It seems like they should include it, or explain why they did not. Perhaps, it is also a limitation of the study. See response to Reviewer 2, Major Compulsory Revisions, Discussion, point 2. Results 1. It is not clear whether the authors should report information about the data they created. We have included the section heading Data Generation Procedures and reorganize this section in order to clearly describe how data were simulated at each step of the study. We believe it is important to describe these procedures to facilitate replication of the study. However, we are also open to placing this information in an Appendix if the reviewer feels this would be more appropriate. 2. Page 15 (bottom) the authors use the word medium, but it seems median is appropriate. It is not clear what the authors are demonstrating with this statistic because they do not include a rationale for its inclusion in the Analysis section. The word should be median instead of "medium." We have made this correction.

12 Riley and Carle, Page The paper would benefit from a more thorough treatment of the results. For example, on page 16, they state the results of a comparison of the area under the curve for the RZ and #CrI are significant, but they do not indicate what it means. On p. 17 (paragraph 1) we have added the following sentence concerning the ROC analysis comparing RZ and CrI: This indicates that RZ values are significantly stronger predictor of the presence of mode DIF compared to CrI values. 4. The authors report ordinary least squares regressions for the 1st time in the Results section, but they should report all analyses conducted in the Methods section. We have moved the description of the analysis procedures used to create Figure 2 from the Results to the Analysis section in the methods. Both the description of the analytic procedures and the results of the ordinary least squares regression are placed under the heading Relationship of Item Difficulty to Power and Type I Error: Discussion 1. The authors should indicate if these findings are replicable when CATs do not have a fixed length. On p. 21 (paragraph 2), we added the following (see bold italicized text): Thus, power to detect DIF in items that are very easy or difficult to endorse is lower than that for items of average difficulty. This would likely explain why absolute item difficulty was a significant predictor of power even after controlling for CAT item usage. These findings may in part reflect the use of a fixed-length CAT during the simulation. In the case of a variable-length CAT, more items would likely be administered to simulees at the extremes of the trait continuum in order to achieve sufficient measurement precision, including items that are very easy or difficult to endorse. Conversely, we would expect fewer items to be administered to simulees who are in the center of the trait distribution under if a variable-length CAT were used. Miscellaneous 1. The authors provide 2 supplements but do not refer to them in the paper. We have made reference to Additional File 1 [Appendix A: Generating and Estimated Item Parameters for the Two Simulated Item Banks (Validation data) Based on the Oneand Two-Parameter IRT Models, Respectively] on p. 11, paragraph 4, and Additional File 5 (Appendix B - WinBUGS code) on p. 15 paragraph 2. We have also included references in the text to example data files (Additional Files 2-4).

Author s response to reviews

Author s response to reviews Author s response to reviews Title: The validity of a professional competence tool for physiotherapy students in simulationbased clinical education: a Rasch analysis Authors: Belinda Judd (belinda.judd@sydney.edu.au)

More information

Title: Identifying work ability promoting factors for home care aides and assistant nurses

Title: Identifying work ability promoting factors for home care aides and assistant nurses Author's response to reviews Title: Identifying work ability promoting factors for home care aides and assistant nurses Authors: Agneta Larsson (agneta.larsson@ltu.se) Lena Karlqvist (lena.karlqvist@ltu.se)

More information

Title: Socioeconomic conditions and number of pain sites in women

Title: Socioeconomic conditions and number of pain sites in women Author's response to reviews Title: Socioeconomic conditions and number of pain sites in women Authors: Finn E Skjeldestad (fisk@fhi.no) Toril Rannestad (Toril.Rannestad@hist.no) Version: 2 Date: 17 January

More information

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Mantel-Haenszel Procedures for Detecting Differential Item Functioning A Comparison of Logistic Regression and Mantel-Haenszel Procedures for Detecting Differential Item Functioning H. Jane Rogers, Teachers College, Columbia University Hariharan Swaminathan, University of

More information

Title:Mixed-strain Housing for Female C57BL/6, DBA/2, and BALB/c Mice: Validating a Split-plot Design that promotes Refinement and Reduction

Title:Mixed-strain Housing for Female C57BL/6, DBA/2, and BALB/c Mice: Validating a Split-plot Design that promotes Refinement and Reduction Author's response to reviews Title:Mixed-strain Housing for Female C57BL/6, DBA/2, and BALB/c Mice: Validating a Split-plot Design that promotes Refinement and Reduction Authors: Michael Walker Mr (mwalk04@uoguelph.ca)

More information

Title:Video-confidence: a qualitative exploration of videoconferencing for psychiatric emergencies

Title:Video-confidence: a qualitative exploration of videoconferencing for psychiatric emergencies Author's response to reviews Title:Video-confidence: a qualitative exploration of videoconferencing for psychiatric emergencies Authors: Marianne V Trondsen (marianne.trondsen@telemed.no) Stein Roald Bolle

More information

Title:Emergency ambulance service involvement with residential care homes in the support of older people with dementia: an observational study

Title:Emergency ambulance service involvement with residential care homes in the support of older people with dementia: an observational study Author's response to reviews Title:Emergency ambulance service involvement with residential care homes in the support of older people with dementia: an observational study Authors: Sarah Amador (s.amador@herts.ac.uk)

More information

Title:Prediction of poor outcomes six months following total knee arthroplasty in patients awaiting surgery

Title:Prediction of poor outcomes six months following total knee arthroplasty in patients awaiting surgery Author's response to reviews Title:Prediction of poor outcomes six months following total knee arthroplasty in patients awaiting surgery Authors: Eugen Lungu (eugen.lungu@umontreal.ca) François Desmeules

More information

Research and Evaluation Methodology Program, School of Human Development and Organizational Studies in Education, University of Florida

Research and Evaluation Methodology Program, School of Human Development and Organizational Studies in Education, University of Florida Vol. 2 (1), pp. 22-39, Jan, 2015 http://www.ijate.net e-issn: 2148-7456 IJATE A Comparison of Logistic Regression Models for Dif Detection in Polytomous Items: The Effect of Small Sample Sizes and Non-Normality

More information

Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection

Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection Author's response to reviews Title: A robustness study of parametric and non-parametric tests in Model-Based Multifactor Dimensionality Reduction for epistasis detection Authors: Jestinah M Mahachie John

More information

Title: Home Exposure to Arabian Incense (Bakhour) and Asthma Symptoms in Children: A Community Survey in Two Regions in Oman

Title: Home Exposure to Arabian Incense (Bakhour) and Asthma Symptoms in Children: A Community Survey in Two Regions in Oman Author's response to reviews Title: Home Exposure to Arabian Incense (Bakhour) and Asthma Symptoms in Children: A Community Survey in Two Regions in Oman Authors: Omar A Al-Rawas (orawas@squ.edu.om) Abdullah

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 39 Evaluation of Comparability of Scores and Passing Decisions for Different Item Pools of Computerized Adaptive Examinations

More information

Title:Modern contraceptive use among sexually active men in Uganda: Does discussion with a health worker matter?

Title:Modern contraceptive use among sexually active men in Uganda: Does discussion with a health worker matter? Author's response to reviews Title:Modern contraceptive use among sexually active men in Uganda: Does discussion with a health worker matter? Authors: Allen Kabagenyi Ms. (allenka79@yahoo.com) Patricia

More information

Guidelines for reviewers

Guidelines for reviewers Guidelines for reviewers Registered Reports are a form of empirical article in which the methods and proposed analyses are pre-registered and reviewed prior to research being conducted. This format of

More information

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Ball State University

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Ball State University PEER REVIEW HISTORY BMJ Open publishes all reviews undertaken for accepted manuscripts. Reviewers are asked to complete a checklist review form (see an example) and are provided with free text boxes to

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title: Diabetes duration and health-related quality of life in individuals with onset of diabetes in the age group 15-34 years - a Swedish population-based study using EQ-5D

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title: Physiotherapy interventions in scientific physiotherapy publications focusing on interventions for children with cerebral palsy: A qualitative phenomenographic approach.

More information

A Comparison of Several Goodness-of-Fit Statistics

A Comparison of Several Goodness-of-Fit Statistics A Comparison of Several Goodness-of-Fit Statistics Robert L. McKinley The University of Toledo Craig N. Mills Educational Testing Service A study was conducted to evaluate four goodnessof-fit procedures

More information

Differential Item Functioning

Differential Item Functioning Differential Item Functioning Lecture #11 ICPSR Item Response Theory Workshop Lecture #11: 1of 62 Lecture Overview Detection of Differential Item Functioning (DIF) Distinguish Bias from DIF Test vs. Item

More information

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida and Oleksandr S. Chernyshenko University of Canterbury Presented at the New CAT Models

More information

A Comparison of Pseudo-Bayesian and Joint Maximum Likelihood Procedures for Estimating Item Parameters in the Three-Parameter IRT Model

A Comparison of Pseudo-Bayesian and Joint Maximum Likelihood Procedures for Estimating Item Parameters in the Three-Parameter IRT Model A Comparison of Pseudo-Bayesian and Joint Maximum Likelihood Procedures for Estimating Item Parameters in the Three-Parameter IRT Model Gary Skaggs Fairfax County, Virginia Public Schools José Stevenson

More information

GMAC. Scaling Item Difficulty Estimates from Nonequivalent Groups

GMAC. Scaling Item Difficulty Estimates from Nonequivalent Groups GMAC Scaling Item Difficulty Estimates from Nonequivalent Groups Fanmin Guo, Lawrence Rudner, and Eileen Talento-Miller GMAC Research Reports RR-09-03 April 3, 2009 Abstract By placing item statistics

More information

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD Psy 427 Cal State Northridge Andrew Ainsworth, PhD Contents Item Analysis in General Classical Test Theory Item Response Theory Basics Item Response Functions Item Information Functions Invariance IRT

More information

Title: Estimation of the burden of varicella in Europe before the introduction of universal childhood immunization

Title: Estimation of the burden of varicella in Europe before the introduction of universal childhood immunization Author s response to reviews Title: Estimation of the burden of varicella in Europe before the introduction of universal childhood immunization Authors: Margarita Riera-Montes (margarita.riera@p-95.com)

More information

Publishing Your Study: Tips for Young Investigators. Learning Objectives 7/9/2013. Eric B. Bass, MD, MPH

Publishing Your Study: Tips for Young Investigators. Learning Objectives 7/9/2013. Eric B. Bass, MD, MPH Publishing Your Study: Tips for Young Investigators Eric B. Bass, MD, MPH Learning Objectives To apply a logical approach to organizing & presenting your work in a manuscript To recognize the importance

More information

Statistical reports Regression, 2010

Statistical reports Regression, 2010 Statistical reports Regression, 2010 Niels Richard Hansen June 10, 2010 This document gives some guidelines on how to write a report on a statistical analysis. The document is organized into sections that

More information

Title: Human breast cancer associated fibroblasts exhibit subtype specific gene expression profiles

Title: Human breast cancer associated fibroblasts exhibit subtype specific gene expression profiles Author's response to reviews Title: Human breast cancer associated fibroblasts exhibit subtype specific gene expression profiles Authors: Julia Tchou (julia.tchou@uphs.upenn.edu) Andrew V Kossenkov (akossenkov@wistar.org)

More information

Please revise your paper to respond to all of the comments by the reviewers. Their reports are available at the end of this letter, below.

Please revise your paper to respond to all of the comments by the reviewers. Their reports are available at the end of this letter, below. Dear editor and dear reviewers Thank you very much for the additional comments and suggestions. We have modified the manuscript according to the comments below. We have also updated the literature search

More information

FATIGUE. A brief guide to the PROMIS Fatigue instruments:

FATIGUE. A brief guide to the PROMIS Fatigue instruments: FATIGUE A brief guide to the PROMIS Fatigue instruments: ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS Ca Bank v1.0 Fatigue PROMIS Pediatric Bank v2.0 Fatigue PROMIS Pediatric Bank v1.0 Fatigue* PROMIS

More information

English Editing Samples

English Editing Samples English Editing Samples Contents Advanced Edit... 2 Premium Edit... 3 Customized cover letter... 4 Feedback on original writing... 6 Editing certificate... 9 Journal response letter editing... 10 Advanced

More information

Impact of Differential Item Functioning on Subsequent Statistical Conclusions Based on Observed Test Score Data. Zhen Li & Bruno D.

Impact of Differential Item Functioning on Subsequent Statistical Conclusions Based on Observed Test Score Data. Zhen Li & Bruno D. Psicológica (2009), 30, 343-370. SECCIÓN METODOLÓGICA Impact of Differential Item Functioning on Subsequent Statistical Conclusions Based on Observed Test Score Data Zhen Li & Bruno D. Zumbo 1 University

More information

Title: Healthy snacks at the checkout counter: A lab and field study on the impact of shelf arrangement and assortment structure on consumer choices

Title: Healthy snacks at the checkout counter: A lab and field study on the impact of shelf arrangement and assortment structure on consumer choices Author's response to reviews Title: Healthy snacks at the checkout counter: A lab and field study on the impact of shelf arrangement and assortment structure on consumer choices Authors: Ellen van Kleef

More information

Title:Environmental footprints of Mediterranean versus Western Dietary Patterns: Beyond the health benefits of the Mediterranean Diet

Title:Environmental footprints of Mediterranean versus Western Dietary Patterns: Beyond the health benefits of the Mediterranean Diet Author's response to reviews Title:Environmental footprints of Mediterranean versus Western Dietary Patterns: Beyond the health benefits of the Mediterranean Diet Authors: Sara Sáez Almendros (sara.saez.almendros@gmail.com)

More information

Comparability Study of Online and Paper and Pencil Tests Using Modified Internally and Externally Matched Criteria

Comparability Study of Online and Paper and Pencil Tests Using Modified Internally and Externally Matched Criteria Comparability Study of Online and Paper and Pencil Tests Using Modified Internally and Externally Matched Criteria Thakur Karkee Measurement Incorporated Dong-In Kim CTB/McGraw-Hill Kevin Fatica CTB/McGraw-Hill

More information

ANXIETY A brief guide to the PROMIS Anxiety instruments:

ANXIETY A brief guide to the PROMIS Anxiety instruments: ANXIETY A brief guide to the PROMIS Anxiety instruments: ADULT PEDIATRIC PARENT PROXY PROMIS Pediatric Bank v1.0 Anxiety PROMIS Pediatric Short Form v1.0 - Anxiety 8a PROMIS Item Bank v1.0 Anxiety PROMIS

More information

Title: Reliability and validity of the adolescent stress questionnaire in a sample of European adolescents - the HELENA study

Title: Reliability and validity of the adolescent stress questionnaire in a sample of European adolescents - the HELENA study Author's response to reviews Title: Reliability and validity of the adolescent stress questionnaire in a sample of European adolescents - the HELENA study Authors: Tineke De Vriendt (tineke.devriendt@ugent.be)

More information

Centre for Education Research and Policy

Centre for Education Research and Policy THE EFFECT OF SAMPLE SIZE ON ITEM PARAMETER ESTIMATION FOR THE PARTIAL CREDIT MODEL ABSTRACT Item Response Theory (IRT) models have been widely used to analyse test data and develop IRT-based tests. An

More information

IAASB Main Agenda (September 2005) Page Agenda Item. Analysis of ISA 330 and Mapping Document

IAASB Main Agenda (September 2005) Page Agenda Item. Analysis of ISA 330 and Mapping Document IAASB Main Agenda (September 2005 Page 2005 1869 Agenda Item 4-D.1 Analysis of ISA 330 and Mapping Document 1. Exhibit 1 sets out statements in ISA 330 that use the present tense to describe auditor actions,

More information

PAIN INTERFERENCE. ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS-Ca Bank v1.1 Pain Interference PROMIS-Ca Bank v1.0 Pain Interference*

PAIN INTERFERENCE. ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS-Ca Bank v1.1 Pain Interference PROMIS-Ca Bank v1.0 Pain Interference* PROMIS Item Bank v1.1 Pain Interference PROMIS Item Bank v1.0 Pain Interference* PROMIS Short Form v1.0 Pain Interference 4a PROMIS Short Form v1.0 Pain Interference 6a PROMIS Short Form v1.0 Pain Interference

More information

Title:Continuity of GP care is associated with lower use of complementary and alternative medical providers A population-based cross-sectional survey

Title:Continuity of GP care is associated with lower use of complementary and alternative medical providers A population-based cross-sectional survey Author's response to reviews Title:Continuity of GP care is associated with lower use of complementary and alternative medical providers A population-based cross-sectional survey Authors: Anne Helen Hansen

More information

RESPONSE TO DECISION LETTER

RESPONSE TO DECISION LETTER RESPONSE TO DECISION LETTER Dear Editor-in-chief, We are grateful to the editors and reviewers for their time and constructive comments on our manuscript. We have implemented their comments and suggestions

More information

Bruno D. Zumbo, Ph.D. University of Northern British Columbia

Bruno D. Zumbo, Ph.D. University of Northern British Columbia Bruno Zumbo 1 The Effect of DIF and Impact on Classical Test Statistics: Undetected DIF and Impact, and the Reliability and Interpretability of Scores from a Language Proficiency Test Bruno D. Zumbo, Ph.D.

More information

accuracy (see, e.g., Mislevy & Stocking, 1989; Qualls & Ansley, 1985; Yen, 1987). A general finding of this research is that MML and Bayesian

accuracy (see, e.g., Mislevy & Stocking, 1989; Qualls & Ansley, 1985; Yen, 1987). A general finding of this research is that MML and Bayesian Recovery of Marginal Maximum Likelihood Estimates in the Two-Parameter Logistic Response Model: An Evaluation of MULTILOG Clement A. Stone University of Pittsburgh Marginal maximum likelihood (MML) estimation

More information

USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION

USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION Iweka Fidelis (Ph.D) Department of Educational Psychology, Guidance and Counselling, University of Port Harcourt,

More information

Computerized Mastery Testing

Computerized Mastery Testing Computerized Mastery Testing With Nonequivalent Testlets Kathleen Sheehan and Charles Lewis Educational Testing Service A procedure for determining the effect of testlet nonequivalence on the operating

More information

Title: Systematic review of lung function and COPD with peripheral blood DNA methylation in population based studies

Title: Systematic review of lung function and COPD with peripheral blood DNA methylation in population based studies Author s response to reviews Title: Systematic review of lung function and COPD with peripheral blood DNA methylation in population based studies Authors: Matthew Machin (matthew.machin12@imperial.ac.uk)

More information

Title: Structural equation modeling of parasympathetic and sympathetic response to traffic air pollution in a repeated measures study

Title: Structural equation modeling of parasympathetic and sympathetic response to traffic air pollution in a repeated measures study Author's response to reviews Title: Structural equation modeling of parasympathetic and sympathetic response to traffic air pollution in a repeated measures study Authors: Emmanuel S Baja (ebaja@hsph.harvard.edu)

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title:Mental health problems in the 10th grade and non-completion of upper secondary school: the mediating role of grades in a population-based longitudinal study Authors:

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title: A multicentre, double-blind, randomised, controlled, parallel-group study of the effectiveness of a pharmacist-acquired medication history in an emergency department

More information

VARIED THRUSH MANUSCRIPT REVIEW HISTORY REVIEWS (ROUND 2) Editor Decision Letter

VARIED THRUSH MANUSCRIPT REVIEW HISTORY REVIEWS (ROUND 2) Editor Decision Letter 1 VARIED THRUSH MANUSCRIPT REVIEW HISTORY REVIEWS (ROUND 2) Editor Decision Letter Thank you for submitting your revision to the Journal of Consumer Research. The manuscript and the revision notes were

More information

INTRODUCTION TO ASSESSMENT OPTIONS

INTRODUCTION TO ASSESSMENT OPTIONS DEPRESSION A brief guide to the PROMIS Depression instruments: ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS-Ca Bank v1.0 Depression PROMIS Pediatric Item Bank v2.0 Depressive Symptoms PROMIS Pediatric

More information

Title: Co-morbidities, complications and causes of death among people with femoral neck fracture - A three-year follow-up study.

Title: Co-morbidities, complications and causes of death among people with femoral neck fracture - A three-year follow-up study. Author s response to reviews Title: Co-morbidities, complications and causes of death among people with femoral neck fracture - A three-year follow-up study. Authors: Monica Berggren (monica.langstrom@umu.se)

More information

Using the Distractor Categories of Multiple-Choice Items to Improve IRT Linking

Using the Distractor Categories of Multiple-Choice Items to Improve IRT Linking Using the Distractor Categories of Multiple-Choice Items to Improve IRT Linking Jee Seon Kim University of Wisconsin, Madison Paper presented at 2006 NCME Annual Meeting San Francisco, CA Correspondence

More information

Title: Validation of the Comprehensive Feeding Practices Questionnaire with parents of 10-to-12-year-olds

Title: Validation of the Comprehensive Feeding Practices Questionnaire with parents of 10-to-12-year-olds Author's response to reviews Title: Validation of the Comprehensive Feeding Practices Questionnaire with parents of 10-to-12-year-olds Authors: Elisabeth L Melbye (elisabeth.l.melbye@uis.no) Torvald Øgaard

More information

Dear Dr. Villanueva,

Dear Dr. Villanueva, 22-12-2017 Dear Dr. Villanueva, We would like to thank you for your interest in our paper and the opportunity to resubmit our manuscript Living network meta-analysis for reducing research waste: an empirical

More information

Bayesian Tailored Testing and the Influence

Bayesian Tailored Testing and the Influence Bayesian Tailored Testing and the Influence of Item Bank Characteristics Carl J. Jensema Gallaudet College Owen s (1969) Bayesian tailored testing method is introduced along with a brief review of its

More information

Meta-Analysis and Publication Bias: How Well Does the FAT-PET-PEESE Procedure Work?

Meta-Analysis and Publication Bias: How Well Does the FAT-PET-PEESE Procedure Work? Meta-Analysis and Publication Bias: How Well Does the FAT-PET-PEESE Procedure Work? Nazila Alinaghi W. Robert Reed Department of Economics and Finance, University of Canterbury Abstract: This study uses

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title:Differences upon admission and in hospital course of children hospitalized pneumonia: a retrospective cohort study Authors: Raquel Simbalista (r.simbalista@terra.com.br)

More information

11 September Tower Old Broad Street London EC2N 1HQ United Kingdom t + 44 (0) f + 44 (0)

11 September Tower Old Broad Street London EC2N 1HQ United Kingdom t + 44 (0) f + 44 (0) 11 September 2007 Jim Sylph International Auditing and Assurance Standards Board 545 5th Avenue, 14th Floor New York, New York 10017 USA Edcomments@ifac.org Tower 42 25 Old Broad Street London EC2N 1HQ

More information

Ordinal Data Modeling

Ordinal Data Modeling Valen E. Johnson James H. Albert Ordinal Data Modeling With 73 illustrations I ". Springer Contents Preface v 1 Review of Classical and Bayesian Inference 1 1.1 Learning about a binomial proportion 1 1.1.1

More information

Using the Testlet Model to Mitigate Test Speededness Effects. James A. Wollack Youngsuk Suh Daniel M. Bolt. University of Wisconsin Madison

Using the Testlet Model to Mitigate Test Speededness Effects. James A. Wollack Youngsuk Suh Daniel M. Bolt. University of Wisconsin Madison Using the Testlet Model to Mitigate Test Speededness Effects James A. Wollack Youngsuk Suh Daniel M. Bolt University of Wisconsin Madison April 12, 2007 Paper presented at the annual meeting of the National

More information

You must answer question 1.

You must answer question 1. Research Methods and Statistics Specialty Area Exam October 28, 2015 Part I: Statistics Committee: Richard Williams (Chair), Elizabeth McClintock, Sarah Mustillo You must answer question 1. 1. Suppose

More information

An Introduction to Missing Data in the Context of Differential Item Functioning

An Introduction to Missing Data in the Context of Differential Item Functioning A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute

More information

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison

Multilevel IRT for group-level diagnosis. Chanho Park Daniel M. Bolt. University of Wisconsin-Madison Group-Level Diagnosis 1 N.B. Please do not cite or distribute. Multilevel IRT for group-level diagnosis Chanho Park Daniel M. Bolt University of Wisconsin-Madison Paper presented at the annual meeting

More information

Selection of Linking Items

Selection of Linking Items Selection of Linking Items Subset of items that maximally reflect the scale information function Denote the scale information as Linear programming solver (in R, lp_solve 5.5) min(y) Subject to θ, θs,

More information

Title: The size of the population potentially in need of palliative care in Germany - An estimation based on death registration data

Title: The size of the population potentially in need of palliative care in Germany - An estimation based on death registration data Author s response to reviews Title: The size of the population potentially in need of palliative care in Germany - An estimation based on death registration data Authors: Nadine Scholten (nadine.scholten@uk-koeln.de)

More information

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL 1 SUPPLEMENTAL MATERIAL Response time and signal detection time distributions SM Fig. 1. Correct response time (thick solid green curve) and error response time densities (dashed red curve), averaged across

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title: Validation of Doloplus-2 among nonverbal nursing home patients - An evaluation of Doloplus-2 in a clinical setting An evaluating of Doloplus-2 in a clinical setting

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title: Gender differences in Greek centenarians. A cross-sectional nation-wide study, examining multiple socio-demographic and personality factors and health locus of control.

More information

A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model

A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model Nevitt & Tam A Comparison of Robust and Nonparametric Estimators Under the Simple Linear Regression Model Jonathan Nevitt, University of Maryland, College Park Hak P. Tam, National Taiwan Normal University

More information

An Empirical Study on Causal Relationships between Perceived Enjoyment and Perceived Ease of Use

An Empirical Study on Causal Relationships between Perceived Enjoyment and Perceived Ease of Use An Empirical Study on Causal Relationships between Perceived Enjoyment and Perceived Ease of Use Heshan Sun Syracuse University hesun@syr.edu Ping Zhang Syracuse University pzhang@syr.edu ABSTRACT Causality

More information

Parameter Estimation with Mixture Item Response Theory Models: A Monte Carlo Comparison of Maximum Likelihood and Bayesian Methods

Parameter Estimation with Mixture Item Response Theory Models: A Monte Carlo Comparison of Maximum Likelihood and Bayesian Methods Journal of Modern Applied Statistical Methods Volume 11 Issue 1 Article 14 5-1-2012 Parameter Estimation with Mixture Item Response Theory Models: A Monte Carlo Comparison of Maximum Likelihood and Bayesian

More information

SLEEP DISTURBANCE ABOUT SLEEP DISTURBANCE INTRODUCTION TO ASSESSMENT OPTIONS. 6/27/2018 PROMIS Sleep Disturbance Page 1

SLEEP DISTURBANCE ABOUT SLEEP DISTURBANCE INTRODUCTION TO ASSESSMENT OPTIONS. 6/27/2018 PROMIS Sleep Disturbance Page 1 SLEEP DISTURBANCE A brief guide to the PROMIS Sleep Disturbance instruments: ADULT PROMIS Item Bank v1.0 Sleep Disturbance PROMIS Short Form v1.0 Sleep Disturbance 4a PROMIS Short Form v1.0 Sleep Disturbance

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Title: What 'outliers' tell us about missed opportunities for TB control: a cross-sectional study of patients in Mumbai, India

Title: What 'outliers' tell us about missed opportunities for TB control: a cross-sectional study of patients in Mumbai, India Author's response to reviews Title: What 'outliers' tell us about missed opportunities for TB control: a cross-sectional study of patients in Authors: Anagha Pradhan (anp1002004@yahoo.com) Karina Kielmann

More information

ANXIETY. A brief guide to the PROMIS Anxiety instruments:

ANXIETY. A brief guide to the PROMIS Anxiety instruments: ANXIETY A brief guide to the PROMIS Anxiety instruments: ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS Bank v1.0 Anxiety PROMIS Short Form v1.0 Anxiety 4a PROMIS Short Form v1.0 Anxiety 6a PROMIS Short

More information

An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy

An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy Number XX An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy Prepared for: Agency for Healthcare Research and Quality U.S. Department of Health and Human Services 54 Gaither

More information

Standards for reporting Plain Language Summaries (PLS) for Cochrane Diagnostic Test Accuracy Reviews (interim guidance adapted from Methodological

Standards for reporting Plain Language Summaries (PLS) for Cochrane Diagnostic Test Accuracy Reviews (interim guidance adapted from Methodological s for reporting Plain Language Summaries (PLS) for Cochrane Diagnostic Test Accuracy Reviews (interim guidance adapted from Methodological Expectations of Cochrane Intervention Reviews (MECIR) guidance

More information

A Modified CATSIB Procedure for Detecting Differential Item Function. on Computer-Based Tests. Johnson Ching-hong Li 1. Mark J. Gierl 1.

A Modified CATSIB Procedure for Detecting Differential Item Function. on Computer-Based Tests. Johnson Ching-hong Li 1. Mark J. Gierl 1. Running Head: A MODIFIED CATSIB PROCEDURE FOR DETECTING DIF ITEMS 1 A Modified CATSIB Procedure for Detecting Differential Item Function on Computer-Based Tests Johnson Ching-hong Li 1 Mark J. Gierl 1

More information

TITLE. Author name Author address (school address) INTRODUCTION

TITLE. Author name Author address (school address) INTRODUCTION SCIENTIFIC WRITING Dr. Craig 5/5/12 Skillful scientific writing involves using precise, concise language. Use primarily the first person and active voice (e.g. I conducted the trials, not the trials were

More information

Conflict of interest in randomised controlled surgical trials: Systematic review, qualitative and quantitative analysis

Conflict of interest in randomised controlled surgical trials: Systematic review, qualitative and quantitative analysis Reviewer Assessment Open Access P. Probst, K. Grummich, U. Klaiber, P. Knebel, A.Ulrich, M. W. Büchler, and M. K. Diener* Conflict of interest in randomised controlled surgical trials: Systematic review,

More information

How to Write a Summary

How to Write a Summary How to Write a Summary This PowerPoint is the property of Humber College s Writing Centre and cannot be reproduced without the consent of the Writing Centre - 2015 WRITING TUTORS What is a Summary? Summary

More information

PEER REVIEW HISTORY ARTICLE DETAILS

PEER REVIEW HISTORY ARTICLE DETAILS PEER REVIEW HISTORY BMJ Open publishes all reviews undertaken for accepted manuscripts. Reviewers are asked to complete a checklist review form (http://bmjopen.bmj.com/site/about/resources/checklist.pdf)

More information

Title: Risser Patient Satisfaction Scale: A Validation study in Greek Cancer Patients

Title: Risser Patient Satisfaction Scale: A Validation study in Greek Cancer Patients Author's response to reviews Title: Risser Patient Satisfaction Scale: A Validation study in Greek Cancer Patients Authors: Andreas I Charalambous (andreas.charalambous@cut.ac.cy) Theodoula Adamakidou

More information

Body: Re Should the threshold for definition of impaired fasting glucose be lowered?

Body: Re Should the threshold for definition of impaired fasting glucose be lowered? From: dme_editorial@wiley.com To: inouek@m.u-tokyo.ac.jp CC: dme_editorial@wiley.com Subject: Diabetic Medicine DME-2009-00292 Body: Re Should the threshold for definition of impaired fasting glucose be

More information

Standards for the reporting of new Cochrane Intervention Reviews

Standards for the reporting of new Cochrane Intervention Reviews Methodological Expectations of Cochrane Intervention Reviews (MECIR) Standards for the reporting of new Cochrane Intervention Reviews 24 September 2012 Preface The standards below summarize proposed attributes

More information

Title:Bounding the Per-Protocol Effect in Randomized Trials: An Application to Colorectal Cancer Screening

Title:Bounding the Per-Protocol Effect in Randomized Trials: An Application to Colorectal Cancer Screening Author's response to reviews Title:Bounding the Per-Protocol Effect in Randomized Trials: An Application to Colorectal Cancer Screening Authors: Sonja A Swanson (sswanson@hsph.harvard.edu) Oyvind Holme

More information

Title: ADHD in girls and boys - gender differences in co-existing symptoms and executive function measures

Title: ADHD in girls and boys - gender differences in co-existing symptoms and executive function measures Author's response to reviews Title: ADHD in girls and boys - gender differences in co-existing symptoms and executive function measures Authors: Erik W Skogli (erik.skogli@sykehuset-innlandet.no) Martin

More information

Title:The self-reported health of U.S. flight attendants compared to the general population

Title:The self-reported health of U.S. flight attendants compared to the general population Author's response to reviews Title:The self-reported health of U.S. flight attendants compared to the general population Authors: Eileen McNeely (emcneely@hsph.harvard.edu) Version:4Date:30 January 2014

More information

Manuscript ID BMJ R1 entitled "Education and coronary heart disease: a Mendelian randomization study"

Manuscript ID BMJ R1 entitled Education and coronary heart disease: a Mendelian randomization study BMJ - Decision on Manuscript ID BMJ.2017.03 7504.R1 Body: 11-May-2017 Dear Dr. Tillmann Manuscript ID BMJ.2017.037504.R1 entitled "Education and coronary heart disease: a Mendelian randomization study"

More information

RE: IESBA s Exposure Draft Proposed Revisions Pertaining to Safeguards in the Code Phase 1

RE: IESBA s Exposure Draft Proposed Revisions Pertaining to Safeguards in the Code Phase 1 31 March 2016 Mr. Ken Siong Technical Director International Ethics Standards Board for Accountants (IESBA) International Federation of Accountants (IFAC) 529 Fifth Avenue, 6 th Floor New York, New York

More information

Title: Survival endpoints in colorectal cancer. The effect of second primary other cancer on disease free survival.

Title: Survival endpoints in colorectal cancer. The effect of second primary other cancer on disease free survival. Author's response to reviews Title: Survival endpoints in colorectal cancer. The effect of second primary other cancer on disease free survival. Authors: Helgi Birgisson (helgi.birgisson@surgsci.uu.se)

More information

Jason L. Meyers. Ahmet Turhan. Steven J. Fitzpatrick. Pearson. Paper presented at the annual meeting of the

Jason L. Meyers. Ahmet Turhan. Steven J. Fitzpatrick. Pearson. Paper presented at the annual meeting of the Performance of Ability Estimation Methods for Writing Assessments under Conditio ns of Multidime nsionality Jason L. Meyers Ahmet Turhan Steven J. Fitzpatrick Pearson Paper presented at the annual meeting

More information

Review of Veterinary Epidemiologic Research by Dohoo, Martin, and Stryhn

Review of Veterinary Epidemiologic Research by Dohoo, Martin, and Stryhn The Stata Journal (2004) 4, Number 1, pp. 89 92 Review of Veterinary Epidemiologic Research by Dohoo, Martin, and Stryhn Laurent Audigé AO Foundation laurent.audige@aofoundation.org Abstract. The new book

More information

Sensitivity of DFIT Tests of Measurement Invariance for Likert Data

Sensitivity of DFIT Tests of Measurement Invariance for Likert Data Meade, A. W. & Lautenschlager, G. J. (2005, April). Sensitivity of DFIT Tests of Measurement Invariance for Likert Data. Paper presented at the 20 th Annual Conference of the Society for Industrial and

More information

Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research

Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research Alternative Methods for Assessing the Fit of Structural Equation Models in Developmental Research Michael T. Willoughby, B.S. & Patrick J. Curran, Ph.D. Duke University Abstract Structural Equation Modeling

More information

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison

Empowered by Psychometrics The Fundamentals of Psychometrics. Jim Wollack University of Wisconsin Madison Empowered by Psychometrics The Fundamentals of Psychometrics Jim Wollack University of Wisconsin Madison Psycho-what? Psychometrics is the field of study concerned with the measurement of mental and psychological

More information

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION Timothy Olsen HLM II Dr. Gagne ABSTRACT Recent advances

More information

Termination Criteria in Computerized Adaptive Tests: Variable-Length CATs Are Not Biased. Ben Babcock and David J. Weiss University of Minnesota

Termination Criteria in Computerized Adaptive Tests: Variable-Length CATs Are Not Biased. Ben Babcock and David J. Weiss University of Minnesota Termination Criteria in Computerized Adaptive Tests: Variable-Length CATs Are Not Biased Ben Babcock and David J. Weiss University of Minnesota Presented at the Realities of CAT Paper Session, June 2,

More information