Blinded, independent, central image review in oncology trials: how the study endpoint impacts the adjudication rate

Similar documents
Computed tomography and Modified RECIST criteria for assessment of response in malignant pleural mesothelioma

BI-RADS 3, 4 and 5 lesions on US: Five categories and their diagnostic efficacy and pitfalls in interpretation

Slowly growing malignant nodules and rapidly growing benign nodules: Evaluation of the value of volume doubling time

64-MDCT imaging of the pancreas: Scan protocol optimisation by different scan delay regimes

Diffuse high-attenuation within mediastinal lymph nodes on non-enhanced CT scan: Usefulness in the prediction of benignancy

Single cold nodule in Graves' disease: benign vs malignant

The Role of Radionuclide Lymphoscintigraphy in the Diagnosis of Lymphedema of the Extremities

Hyperechoic breast lesions can be malignant.

Identification and numbering of lumbar vertebrae using various anatomical landmarks on MRI of lumbosacral spine

Ethanol ablation of benign thyroid cysts and predominantly cystic thyroid nodules: factors that predict outcome.

High density thrombi of pulmonary embolism on precontrast CT scan: Is it dangerous?

Monophasic versus biphasic contrast application in CT of patients with head and neck tumour

Monitoring neo-adjuvant chemotherapy: comparison of contrast-enhanced spectral mammography (CESM) and MRI versus breast cancer characteristics

Cognitive target MRI-TRUS fusion biopsies of MRI detected PIRADS 4 and 5 lesions

THI-RADS. US differentiation of thyroid lesions.

THI-RADS. US differentiation of thyroid lesions.

Purpose. Methods and Materials. Results

AFib is the most common cardiac arrhythmia and its prevalence and incidence increases with age (Fuster V. et al. Circulation 2006).

Radiological features of Legionella Pneumophila Pneumonia

Sonographic and Mammographic Features of Phyllodes Tumours of the Breast: Correlation with Histological Grade

A pictorial review of normal anatomical appearences of Pericardial recesses on multislice Computed Tomography.

Computed tomographic dacryocystography as compared with X-ray dacryocystography in patients with dacryostenosis

PGMI classification of screening mammograms prior to interval cancer. Comparison with radiologists' consensus classification.

Seemingly isolated greater trochanter fractures do not exist

Medical device adverse incident reporting in interventional radiology

Cavitary lung lesion: Two different diagnosis with similar appearence

Computed tomography for the detection of thumb base osteoarthritis, comparison with digital radiography.

Standardizing mammographic breast compression: Pressure rather than force?

Breast cancer tumor size: Correlation between MRI and histopathology

Diffusion-weighted MRI (DWI) "claw sign" is useful in differentiation of infectious from degenerative Modic I signal changes of the spine

Assessment of renal cell carcinoma by two PET tracer : dual-time-point C-11 methionine and F-18 fluorodeoxyglucose

Long bones manifestations of congenital syphilis

The Virtual Lung Nodule Clinic

Scientific Exhibit Authors: V. Moustakas, E. Karallas, K. Koutsopoulos ; Rodos/GR, 2

Comparison of MRI and ultrasound based liver volumetry in iron overload diseases

Characterisation of cervical lymph nodes by US and PET-CT

Bolus administration of esmolol allows for safe and effective heart rate control during coronary computed tomography angiography

Quantifying Dual-energy computed tomography (DECT) in patients with renal calculi using a Toshiba Aquilion One Scanner.

Basic low - field MR imaging of meniscal injuries in children.

Clinical impact of double reading of thoracic CT

"Ultrasound measurements of the lateral ventricles in neonates: A comparison of multiple measurements methods."

FDG-18 PET/CT - radiation dose and dose-reduction strategy

MR-guided prostatic biopsy at 3T: the role of PI-RADS-score: a histopahologic-radiologic correlation

How to plan a Zenith AAA stent-graft from a CTA: Basic measurements and concepts explained

Satisfaction and quality of life: a survey-based assessment in patients with a totally implantable venous port system

Using diffusion-tensor imaging and tractography (DTT) to study biological characteristics of glyoma in brain stem for neurosurgical planning

A time-honored but almost forgotten sign of COPD: sabersheath trachea as a marker of severe airflow obstruction

Clinical impact of double reading of abdominal CT scans of surgical patients

Quantitative imaging of hepatic cirrhosis on abdominal CT images

Imaging characterization of renal clear cell carcinoma

Spinal injury is very common in Ireland: 19 per 100,000 (1). It poses a significant disease burden.

PI-RADS classification: prognostic value for prostate cancer grading

Influence of pulsed fluoroscopy and special radiation risk training on the radiation dose in pneumatic reduction of ileocoecal intussusceptions.

Biliary tree dilation - and now what?

Evaluation of BI-RADS 3 lesions in women with a high risk of hereditary breast cancer.

Computed tomography for pulmonary embolism: scan assessment of a one-year cohort and estimated cancer risk associated with diagnostic irradiation.

Single ventricle on cardiac MRI

Scientific Exhibit Authors:

A pictorial essay depicting CT and MR characteristic of adrenal pathologies: Indian study

The solitary pulmonary nodule: Assessing the success of predicting malignancy

Contrast-enhanced ultrasound (CEUS) in the evaluation and characterization of complex renal cysts

Contrast agents, Abdomen, CT, Contrast agent-intravenous, Cancer /ecr2015/C-1760

Acute pelvic pain in female patient: Clinical and Radiological evaluation

Acute pelvic pain in female patient: Clinical and Radiological evaluation

Feasibility of magnetic resonance elastography using myofascial phantom model

Low-dose computed tomography (CT) protocol in the screening of patients with social exposure to asbestos

Is ascites a sensible predictive sign of peritoneal involvement in patients with ovarian carcinoma?: our experience with FDG-PET/CT

Soft tissues lymphoma, the great pretender. MRI diagnostic keys.

Correlation Between BIRADS Classification and Ultrasound -guided Tru-Cut Biopsy Results of Breast Lesions: Retrospective Analysis of 285 Patients

Role of ultrasound in the evaluation of the ileocecal valve

Cierny-Mader classification of chronic osteomyelitis: Preoperative evaluation with cross-sectional imaging

Spectrum of findings of sclerosing adenosis at breast MRI.

Role of positron emission mammography (PEM) for assessment of axillary lymph node status in patients with breast cancer

Strain histogram analysis for elastography in breast cancer diagnosis

Emerging Referral Patterns for Whole-Body Diffusion Weighted Imaging (WB-DWI) in an Oncology Center

Anatomical Variations of the Levator Scapulae Muscle - an MR Imaging Study

Effect of intravenous contrast medium administration on prostate diffusion-weighted imaging

The effect of CT dose reduction on performance of a diagnostic task

The effect of CT dose reduction on performance of a diagnostic task

Intrahepatic cholangiocarcinoma: diffusion-weighted MR imaging findings

Valsalva-manoeuvre or prone belly position for computed tomography (CT) scan when an orbita varix is suspected: a single-case study.

CT Fluoroscopy-Guided vs Multislice CT Biopsy ModeGuided Lung Biopies:a preliminary experience

Popliteal pterygium syndrome

Purpose. Methods and Materials

The imaging evaluation of breast implants

Breast ultrasound appearances after Mammotome vacuumassisted

Prognostic value of CT texture analysis in patients with nonsmall cell lung cancer: Comparison with FDG-PET

PET/CT depiction of ATS mediastinal nodal stations: What every radiologist should know - diagnostic strategies and potential pitfalls

Utility of PET-CT for detection of N2 or N3 nodal mestastases in the mediastinum in patients with non-small cell lung cancer (NSCLC)

Dose reduction in Hologic Selenia FFDM units through AEC optimization, without compromising diagnostic image quality.

Idiopathic dilatation of the pulmonary artery : radiographic and MDCT features in 6 cases

How to obtain the waist circumference for retrospective studies - a prospective validation of CT images for the evaluation of the abdominal perimeter

Reliability of the pronator quadratus fat pad sign to predict the severity of distal radius fractures

Postmortem Computed Tomography Finding of Lungs in Sudden Infant Death.

Non-calculus causes of renal colic on CT KUB

Artifact in Head CT Images Due to Air Bubbles in X-Ray Tube Oil

Response in different subtypes of breast cancer following neoadjuvant chemotherapy: correlation of MR imaging findings with final pathology

Radiologic Findings of Mucocele-like Tumors of the breast: Can we differentiate pure benign from associated with high risk lesions?

Whirlpool sign of testis, a sonographic sign of incomplete torsion

Information system for the interventional radiology department

Transcription:

Blinded, independent, central image review in oncology trials: how the study endpoint impacts the adjudication rate Poster No.: C-0200 Congress: ECR 2014 Type: Authors: Keywords: DOI: Scientific Exhibit O. Bohnsack, M. Lesch, A. Urbank; Berlin/DE Oncology, Computer applications, CT, MR, Computer Applications-Detection, diagnosis, Cancer, Image verification 10.1594/ecr2014/C-0200 Any information contained in this pdf file is automatically generated from digital material submitted to EPOS by third parties in the form of scientific presentations. References to any names, marks, products, or services of third parties or hypertext links to thirdparty sites or information are provided solely as a convenience to you and do not in any way constitute or imply ECR's endorsement, sponsorship or recommendation of the third party, information, product or service. ECR is not responsible for the content of these pages and does not make any representations regarding the content or accuracy of material in this file. As per copyright regulations, any unauthorised use of the material or parts thereof as well as commercial reproduction or multiple distribution by any traditional or electronically based reproduction/publication method ist strictly prohibited. You agree to defend, indemnify, and hold ECR harmless from and against any and all claims, damages, costs, and expenses, including attorneys' fees, arising from or related to your use of these pages. Please note: Links to movies, ppt slideshows and any other multimedia files are not available in the pdf version of presentations. www.myesr.org Page 1 of 14

Aims and objectives The Adjudication Challenge A persisting topic of concern with a blinded, independent, central review of medical images is the adjudication rate between reviewers. Imaging in oncology clinical trials does not lose but rather gains increasing importance. However with double-reads by more than a single reviewer one will always encounter adjudication. It seems that the adjudication rate will never be 0%. Thus we strive to answer these two questions: What is an acceptable adjudication rate? How does its mere existence have a decisive impact on trial endpoints? Based upon the analysis of review data from oncology studies with different indications and endpoints we determined: the adjudication rates their different definition and meaning how those are derived their value and use in these clinical trials their impact on data validity for study teams. While the occurrence of adjudication does not necessarily imply, that one of the reviewers made a mistake in assessing patients' radiographic images, it still shows a discrepancy in opinion, in lesion selection, in tumor burden evaluation, in lesion measurement or in qualitative assessment, which all lead to a discrepancy in the review results, which in turn impact the study results. We focus on the review design and adjudication based on RECIST evaluation. RECIST is meant to be simply relying on just three assessment aspects: 1) quantitative, measuring diameters of target lesions; 2) qualitative, not measured non-target lesions; 3) new lesion identification. These three aspects are the basis of unwanted discrepancies. One would assume that many years of Radiology experience and ideally also as an independent reviewer is the driving factor for a low adjudication rate. Our analysis shows that the experience and CV-based qualification of the reviewers seems not to have a major influence on the disagreement of these experts' opinions. However the pairing of tightly monitored reviewers can have such an impact, but does this mean an artificially decreased adjudication rate is better or the results are more correct? What are ideas and options for reducing the adjudication rate in future oncology trials? Page 2 of 14

Images for this section: Fig. 1: Tough choices for an adjudicator Page 3 of 14

Methods and materials The Adjudication Method The design of adjudication can vary from study, indication and study design. Perceptive Informatics uses in standard oncology studies with response evaluation according to RECIST or IWG criteria for lymphoma the following model (see Figure 2 right side): In a disagreement between two primary readers, the readings will be given to a third radiologist for adjudication. This adjudicator will then decide, which of the two primary readers is "more right". The adjudicator will review the data from the assessments of the two primary radiologists and will determine the final radiologic outcome for the case, which must be one of the two primary reviewers' assessment. The adjudicator is not allowed to bring in a third opinion. Scenario in Figure 2, right: The adjudication is triggered based on the different timepoint response assessment between reviewer green and orange. In this example at baseline both select different measurable target lesions from the total lesion burden. At timepoint 2 one single lesion increases, selected as a target by only one. Reviewer orange calculates correctly a progression, whereas reviewer green correctly calculates partial response. Both reviewers' assessments are correct. This is one example how differently chosen lesions may trigger adjudication without any reviewer error made. Based on the clinical endpoints defined by the currently applicable FDA guidelines from 2007. Table 1 on the right summarizes the Perceptive Informatics Imaging recommendations for adjudication triggers. We recommend to chose these in accordance with the defined "imaging" related study endpoints. You will find that we do not recommend a timepoint by timepoint adjudication, in order to ease the review design and to reduce the adjudication rate. Images for this section: Page 4 of 14

Fig. 2: Standard oncology adjudication model Page 5 of 14

Page 6 of 14

Table 1: Adjudication based on "imaging" related study endpoints (aligned with the FDA guidelines, 2007) Page 7 of 14

Results The Adjudication Truth Adjudication rates can be derived differently with different meanings and as such with different impact. In Table 2 those 4 exemplary scenarios describe how various discrepancies, with or without Baseline computation, determine significant impacts on published adjudication rates. Baseline timepoints shall not be included in the adjudication rate calculation, since there is no assessment made at this timepoint, only the lesions' selection. However, even those values are not yet "the real truth". If we were to compute the real discrepant timepoints, one would have to use only the actual patient's timepoints. Images for this section: Table 2: Discrepancies computations, with and without Baseline Page 8 of 14

Conclusion The Adjudication Message Based on Table 1. we generated the Perceptive internal database analysis and summarize our findings in Table 3, right. The five most common cancer types in oncology studies are chosen to determine whether the indication could impact the adjudication. An average of 2-3 studies per indication is used to establish an overall % calculation based on indication in relation to study imaging endpoints. To discuss a standard adjudication rate across all indications an average percentage adjudication rate is computed. The outcome of this analysis is presented into the two main imaging related endpoints to visualize the impact of the study endpoint on adjudication rate: 1. Timepoint by timepoint based response assessment and 2. Progression vs. Non-Progression The adjudication rate is distinguished between total number of patients, discrepant time point assessments including the baseline, and discrepant time point assessments excluding the baseline. The database analysis shows a correlation to the indication and the study endpoint. A global, average adjudication rate may not reflect the truth as it is obvious that the rate depends on the indication itself. Especially, in ovarian cancer and breast cancer a higher adjudication rate is observed than for lung, prostrate, renal, and colorectal cancer. Each indication has its own challenge: evaluation of lymph nodes in breast cancer, or distinguish a benign cyst from a malignancy in ovarian cancer, or weather lesions can be considered measurable or not. Reasonable questioning of imaging derived data and its validity are commonly seen and challenged for meaning, clarification, and understanding. "The higher the adjudication rate the more questionable is the credibility of my data?" The rate is expected to be higher for a response rate study than for a progression free survival study. Page 9 of 14

"Is data of central review analysis more powerful than local sites' assessments?" Relying purely on the investigators' analysis bears the risk that other factors play a decisive role whether to keep a patient on study or to change treatment. It is very difficult to expect pure objectivity and only image-based patient treatment decisions. The rigour needed for a robust data analysis with standardized, reproducible results per patient is nearly impossible in the daily routine in a hospital. Any misinterpretations, different lesions selection, different approaches, or plain errors will simply not be captured in such deep details as in a central review with unbiased, blinded reviewers who follow strict Charter defined rules. "How can the adjudication rate be reduced?" We see different ways to reduce the adjudication rate where overall the "preventive" approach starts with the selection of experienced readers. Nevertheless this will not reduce the need for thorough reviewer trainings and reviewer oversight during the course of the study. The training shall be representative of imaging and review scenarios to be expected for the trial. A very good way to address and or resolve differences in interpretive 'style' is the evaluation in consensus and individual sessions which is fundamental to ensure reviewer agreements and promotes a uniform assessment approach. It either raises or lowers the bar between highly conservative and less conservative. If you seek to have your adjudication rate to be 0 % then do not have double reads, chose single reads instead. The Unspoken Truth "The adjudication rate will never be 0% as long evaluation is in human hands!" Images for this section: Page 10 of 14

Table 3: Adjudication based on imaging related endpoints per indication Page 11 of 14

Personal information Bohnsack Oliver; Perceptive Informatics, Inc., A PAREXEL Company Lesch Manuela; Perceptive Informatics, Inc., A PAREXEL Company Urbank Anja; Perceptive Informatics, Inc., A PAREXEL Company Fig. 3 References: Perceptive Informatics, Imaging - Berlin/DE oliver.bohnsack@parexel.com Images for this section: Page 12 of 14

Fig. 3 Fig. 4 Page 13 of 14

References 1. US Department of Health and Human Services, Food and Drug Administration, Clinical Trial Endpoints for the Approval of Cancer Drugs and Biologics, (FDA, Rockville, MD, 2007). 2. K. Borradaile, R. Ford, M. O'Neal, K. Byrne "Discordance Between BIRCR Readers" Applied Clinical Trials Online, Supplement, November 2010. 3. Ford R, Schwartz L, Dancey J, Dodd LE, Eisenhauer EA, Gwyther S, Rubinstein L, Sargent D, Shankar L, Therasse P, Verweij J. "Lessons learned from independent central review" Eur J Cancer. 2009 Jan;45(2):268-74. doi: 10.1016/j.ejca.2008.10.031. 4. P. Therasse, S.G. Arbuck, E.A. Eisenhauer, et al, "New Guidelines to Evaluate the Response to Treatment in Solid Tumors (RECIST Guidelines)," Journal of the National Cancer Institute 92 (3) 205-216 (2000). 5. E.A. Eisenhauer, P. Therasse, J. Bogaerts, et al, "New Response Evaluation Criteria in Solid Tumours: Revised RECIST Guideline (Version 1.1)," European Journal of Cancer 45 (3) 228-247 (2009). Page 14 of 14