Accurate Automated Volumetry of Cartilage of the Knee using Convolutional Neural Networks: Data from the Osteoarthritis Initiative 1

Similar documents
Knee Menisci Segmentation using Convolutional Neural Networks: Data from the Osteoarthritis Initiative 1

Osteoarthritis and Cartilage 18 (2010)

Accuracy and Precision of Quantitative Assessment of Cartilage Morphology by Magnetic Resonance Imaging at 3.0T

kmri_qcart_eckstein_descrip 8/7/2017 1

International Cartilage Repair Society

A Method to Monitor Local Changes in MR Signal Intensity in Articular Cartilage: A Potential Marker for Cartilage Degeneration in Osteoarthritis

arxiv: v1 [cs.cv] 9 Sep 2017

arxiv: v1 [cs.cv] 17 Aug 2017

arxiv: v1 [cs.cv] 13 Jul 2018

Segmentation of Cell Membrane and Nucleus by Improving Pix2pix

arxiv: v2 [cs.cv] 8 Mar 2018

Automatic Prostate Cancer Classification using Deep Learning. Ida Arvidsson Centre for Mathematical Sciences, Lund University, Sweden

Osteoarthritis and Cartilage 18 (2010) 1275e1283

arxiv: v1 [cs.cv] 3 Apr 2018

Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images

Automatic Detection of Knee Joints and Quantification of Knee Osteoarthritis Severity using Convolutional Neural Networks

POC Brain Tumor Segmentation. vlife Use Case

An Articulated Statistical Shape Model of the Human Knee

HHS Public Access Author manuscript Med Image Comput Comput Assist Interv. Author manuscript; available in PMC 2018 January 04.

Tibial coverage, meniscus position, size and damage in knees discordant for joint space. narrowing data from the Osteoarthritis Initiative

Skin cancer reorganization and classification with deep neural network

Osteoarthritis and Cartilage 19 (2011) 689e699

arxiv: v1 [cs.cv] 24 Jul 2018

W. Wirth¹, F. Eckstein¹, M. Hudelmaier¹, L.S. Lohmander², R.Frobell²

arxiv: v1 [cs.cv] 12 Jun 2018

Direct comparison of fixed flexion, radiography and MRI in knee osteoarthritis: responsiveness data from the Osteoarthritis Initiative

International Cartilage Repair Society

MRI of Cartilage. D. BENDAHAN (PhD)

Comparative Study of K-means, Gaussian Mixture Model, Fuzzy C-means algorithms for Brain Tumor Segmentation

Automatic Quality Assessment of Cardiac MRI

Highly Accurate Brain Stroke Diagnostic System and Generative Lesion Model. Junghwan Cho, Ph.D. CAIDE Systems, Inc. Deep Learning R&D Team

arxiv: v2 [cs.cv] 19 Dec 2017

Automatic Atlas-based Three-label Cartilage Segmentation from MR Knee Images

Imorphics OA Knee MRI Measurements

Osteoarthritis and Cartilage 18 (2010) 1402e1407

arxiv: v1 [cs.cv] 21 Mar 2018

Differentiating Tumor and Edema in Brain Magnetic Resonance Images Using a Convolutional Neural Network

Quantification of Biomechanical Imaging Biomarkers. Sudhakar Tummala

In vivo diffusion tensor imaging (DTI) of articular cartilage as a biomarker for osteoarthritis

GTC 2018, San Jose, USA

arxiv: v3 [cs.cv] 26 May 2018

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil

Brain tissue and white matter lesion volume analysis in diabetes mellitus type 2

Convolutional Neural Networks for Estimating Left Ventricular Volume

Early Diagnosis of Autism Disease by Multi-channel CNNs

D. Doré 1, C. Ding 1,2, J.P. Pelletier 3, J. Martel-Pelletier 3, F. Cicuttini 2, G. Jones 1.

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING 1

Review Article Quantitative Cartilage Imaging in Knee Osteoarthritis

International Cartilage Repair Society

arxiv: v1 [cs.cv] 9 Oct 2018

NIH Public Access Author Manuscript Proc SPIE. Author manuscript; available in PMC 2014 February 07.

Final Report: Automated Semantic Segmentation of Volumetric Cardiovascular Features and Disease Assessment

White Rose Research Online URL for this paper: Version: Accepted Version

dgemric Effectively Predicts Cartilage Damage Associated with Femoroacetabular Impingement

CLINICAL OUTCOME & MORPHOMETRY OVER 2 & 5 YEARS -

The Relationship Between Cartilage Loss on Magnetic Resonance Imaging and Radiographic Progression in Men and Women With Knee Osteoarthritis

arxiv: v2 [cs.cv] 7 Jun 2018

Magnetic Resonance Contrast Prediction Using Deep Learning

Effective Feature Extraction Based Automatic Knee Osteoarthritis Detection and Classification using Neural Network

Automated Volumetric Cardiac Ultrasound Analysis

CLINICAL OUTCOME & MORPHOMETRY OVER 2 & 5 YEARS - data from a treatment RCT on acute ACL injury

Quantitative magnetic resonance imaging of osteoarthritis

Osteoarthritis and Cartilage 18 (2010) 1393e1401

K nowledge on the deformational behaviour of articular

RECENT ADVANCES IN CLINICAL MR OF ARTICULAR CARTILAGE

arxiv: v1 [cs.lg] 4 Feb 2019

Investigating the loading behaviour of intact and meniscectomy knee joints and the impact on surgical decisions

A fully automated system for quantification of knee bone marrow lesions using MRI and the osteoarthritis initiative cohort

Comparison of Improved Semi-Automated Segmentation Technique with Manual Segmentation: Data from the Osteoarthritis Initiative

High-fidelity Imaging Response Detection in Multiple Sclerosis

Bone marrow lesions in knee osteoarthritis: MR-assessment by manual segmentation and computer-assisted tresholding

Detection of Mild Cognitive Impairment using Image Differences and Clinical Features

This presentation is the intellectual property of the author. Contact them for permission to reprint and/or distribute.

This presentation is the intellectual property of the author. Contact them at for permission to reprint and/or distribute.

Medial Knee Osteoarthritis Precedes Medial Meniscal Posterior Root Tear with an Event of Painful Popping

Methods of MR Fat Quantification and their Pros and Cons

Accepted Manuscript. Anthony Paproki, Craig Engstrom, Shekhar S. Chandra, Ales Neubert, Jurgen Fripp, Stuart Crozier

Classification of Alzheimer s disease subjects from MRI using the principle of consensus segmentation

International Cartilage Repair Society

SCALENE ASIA PACIFIC SDN BHD ROTATIONAL FIELD QUANTUM NUCLEAR MAGNETIC RESONANCE (RFQMR) IN TREATMENT OF OSTEOARTHRITIS OF THE KNEE JOINT

Medivir announces positive topline results from phase IIa osteoarthritis study, showing disease-modifying benefit of MIV-711 on joint structure

Image Classification with TensorFlow: Radiomics 1p/19q Chromosome Status Classification Using Deep Learning

Statistical Classification and level Set Methods in Medical Image Analysis Quantitative Evaluation of Articular Cartilage in Knee MRI

Relationship between knee pain and the presence, location, size and phenotype of femorotibial denuded areas of subchondral bone as visualized by MRI

Knee Cartilage Extraction and Bone-Cartilage Interface Analysis from 3D MRI Data Sets

arxiv: v1 [cs.cv] 1 Oct 2018

ACTIVE LEARNING OF THE GROUND TRUTH FOR RETINAL IMAGE SEGMENTATION

Figure 1: MRI Scanning [2]

Thanks. As something to help remember. Thanks 07/02/14. Arthroscopy TKR V Uni. Arthroscopy TKR V Uni. Role of arthroscopy to choose operation

A deep learning model integrating FCNNs and CRFs for brain tumor segmentation

Segmentation of White Matter Lesions from Volumetric MR Images

A rapid, novel method of volumetric assessment of MRI-detected subchondral bone marrow lesions in knee osteoarthritis

Weight loss in obese people has structure-modifying effects on medial but not on lateral knee articular cartilage

the new accurate analysis and classification system of knee joint osteoarthritis

arxiv: v1 [cs.cv] 30 May 2018

Xingzhong (Jason) Jin

An Edge-Device for Accurate Seizure Detection in the IoT

Research Article Relationship between Pain and Medial Meniscal Extrusion in Knee Osteoarthritis

The latest developments - Automated Breast Volume Scanning. Dr. med. M. Golatta

Transcription:

Zuse Institute Berlin Takustr. 7 14195 Berlin Germany ALEXANDER TACK, STEFAN ZACHOW Accurate Automated Volumetry of Cartilage of the Knee using Convolutional Neural Networks: Data from the Osteoarthritis Initiative 1 1 Preprint submitted to the IEEE International Symposium on Biomedical Imaging (ISBI) conference, 2019. Copyright of the final article was transferred to the IEEE. Readers are alterted to their obligations with respect to copyrighted material. 978-1-5386-5541- 2/18/$31.00 c 2019 IEEE ZIB Report 19-05 (January 2019)

Zuse Institute Berlin Takustr. 7 14195 Berlin Germany Telephone: +49 30-84185-0 Telefax: +49 30-84185-125 E-mail: bibliothek@zib.de URL: http://www.zib.de ZIB-Report (Print) ISSN 1438-0064 ZIB-Report (Internet) ISSN 2192-7782

Accurate Automated Volumetry of Cartilage of the Knee using Convolutional Neural Networks: Data from the Osteoarthritis Initiative Alexander Tack 1 and Stefan Zachow 1 Zuse Institute Berlin, Berlin, Germany, tack@zib.de Abstract. Volumetry of the cartilage of the knee, as needed for the assessment of knee osteoarthritis (KOA), is typically performed in a tedious and subjective process. We present an automated segmentationbased method for the quantification of cartilage volume by employing 3D Convolutional Neural Networks (CNNs). CNNs were trained in a supervised manner using magnetic resonance imaging data as well as cartilage volumetry readings given by clinical experts for 1378 subjects. It was shown that 3D CNNs can be employed for cartilage volumetry with an accuracy similar to expert volumetry readings. In future, accurate automated cartilage volumetry might support both, diagnosis of KOA as well as assessment of KOA progression via longitudinal analysis. Keywords: Deep learning, imaging biomarker, radiomics, cartilage morphometry, volume assessment 1 Introduction Knee osteoarthritis (KOA) is a degenerative disease which causes pain and decreased mobility. It affects over 40 million Europeans and the costs are estimated at 0.5% of the gross national product [1]. Articular cartilage is one of

the most relevant tissues involved in the disease process. It is recommended to employ Magnetic Resonance Imaging (MRI) for assessment of cartilage degeneration [2]. Several biomarkers derived from MRI data such as cartilage volume, cartilage thickness, and denuded cartilage area are used to assess the risk for KOA development and KOA progression [3 6]. In particular, quantification of longitudinal loss of cartilage volume shows potential for this assessment. Thus, it is desired to apply cartilage volumetry to the data of large epidemiologic studies such as the Osteoarthritis Initiative (OAI) [7], the Study of Health In Pomerania [8], or a recently performed BMBF study [9]. This, however, requires precise 3D segmentations for at least two time points. Manual cartilage segmentation is time-consuming, tedious, and results in large inter- and intra-observer variations, rendering a longitudinal analysis challenging. Moreover, Graichen et al. [10] reported absolute differences between cartilage volume analysis based on MRI and the patients actual in vivo cartilage volume of 10.5% for the lateral tibial cartilage (LTC) and 11.5% for the medial tibial cartilage (MTC). The systematic difference was 3.6% and 3.1%, respectively [10]. For supervised training of automated segmentation methods, a dataset containing segmentation masks of the articular cartilage is accessible via the OAI database. This dataset was already used in previous approaches, e.g. for training k-nearest neighbour classification [11], 3D CNNs segmenting multiple small MRI subvolumes [12, 13], and 3D CNNs segmenting one larger MRI subvolume [14]. However, the given segmentation masks were generated using a semi-automated segmentation tool. For this reason, they are not sufficiently precise to train an algorithm and to evaluate its clinical value. In this study we trained 3D CNNs employing reliable volumetry data from the OAI. Since no segmentation masks were provided together with these data, we generated our own. We utilized the OAI reference segmentations to compare

our method to other approaches (cf. [11 14]). Finally, we showed that 3D CNNs can be trained via accurate segmentation masks and volumetric readings such that the agreement between automated cartilage volumetry and clinical experts volume readings is significantly improved compared to existing approaches. 2 Materials and Methods Sagittal Double Echo Steady State MRI data of 1378 subjects from the OAI were utilized for our study. We employed the method of Ambellan et al. [13] for fully automated pre-segmentation of the MRI data (i.e. the tibial bone and the tibial cartilage). 3D CNNs were trained using these automatically computed masks and the effect of adding experts cartilage volume readings as an additional criterion to the loss function was evaluated for the purpose of establishing a fully automated and accurate method to determine the volume of cartilage. Finally, the segmentation accuracy was quantified on OAI reference segmentations. Our work yields two major contributions: i) Training 3D CNNs using automatically computed segmentation masks. The results outperform established methods for articular cartilage segmentation. ii) Adding an additional loss term considering the volume difference between automated volumetry and volumetric measures given by experts. The results of our automated method is as accurate as the experts volumetry. 2.1 MRI datasets Our methods were assessed on MRI data from the OAI: Dataset Chondrometrics: Measures of tibial cartilage volume for 1378 subjects provided by Chondrometrics (Ainring, Germany).

Dataset Imorphics: Segmentations of tibial cartilage for 88 subjects (2 time points: baseline and 12-months follow-up) provided by Imorphics (Manchester, UK). These two datasets do not share any common subjects. Dataset Chondrometrics consists of a larger sample size than Dataset Imorphics and high trust can be put into the volumetry readings performed by the clinical experts of Chondrometrics. Unfortunately, no segmentation masks were supplied by the OAI database for the subjects of this dataset. 2.2 Articular cartilage volumetry employing 3D CNNs 3D CNN architecture 3D U-Nets with an architecture similar to the 2D U-Nets as proposed by Ronneberger et al. [15] were employed (for more details see e.g. [12]). To consider memory consumption, regions of interest (ROIs) were computed such that the data was reduced to a dimension of 160 64 128 (anterior-posterior, superiorinferior, lateral-medial cf. Fig. 1). We utilized the fact that tibial cartilage is located on the tibial plateau. Hence, a subvolume of the MRI as well as one of the corresponding segmentation mask were extracted at the superior margin of the segmented tibial bone for supervised training. The employed 3D CNNs had five convolutional layers with two 3 3 3 convolutions per layer and an increasing number of kernels (32, 64, 128, 256, 512). The 3 3 3 convolutions were activated by ReLUs, except for the last 1 1 1 convolution in which a sigmoid function was utilized. In each layer the first convolution was followed by 10% Dropout.

Fig. 1. Illustration of a 3D ROI (wire box). Volume rendering of segmentation masks of tibial bone (gray), medial tibial cartilage (orange), and lateral tibial cartilage (blue). Training and Evaluation Approaches The quality of CNNs in the context of semantic image segmentation is often measured as how well a segmentation result matches a gold standard, as for instance being given by human experts. In this study, two different approaches for such an evaluation were investigated. In a first Approach A we evaluated the potential of 3D CNNs trained on automatic pre-segmentations. The Dice Similarity Coefficient (DSC) was employed as a two-class loss function (class 1: MTC; class 2: LTC). We hypothesize that these CNNs are capable to learn the semantic information provided by the training data and to generalize well for the data of Dataset Imorphics. However, this training data was not created by clinical experts using appropriate tools. Thus, albeit an adequate agreement to the cartilage volume measures by Chondrometrics is desired, this cannot be expected employing Approach A. For this reason, we further evaluated a second Approach B, in which the volumetry information provided by Chondrometrics is additionally taken into account during training. Hence, in Approach B, the

(a) Appr. A: MTC (b) Appr. A: LTC (c) Appr. B: MTC (d) Appr. B: LTC Fig. 2. Bland-Altman plots for Approach A (left) and Approach B (right). Comparison of our automated MTC and LTC volume measures vs. those provided by Chondrometrics. Volume Difference (VD) was considered for MTC and LTC, respectively. The VD is defined as VD = 100 C Y Y, (1) with C being the gold standard measures of MTC and LTC volume as provided by Chondrometrics and Y being the respective automated segmentation. The total loss function L in Approach B was given as the sum of DSC and VD: L = DSC + αvd, (2) where VD was weighted during training by the factor α R + in order to equalize the contribution of the two summands (after each epoch α was updated). Both approaches were implemented using TensorFlow and Keras and trained using an NVIDIA Tesla P100 GPU. 3 Experiments and results 3D CNNs were trained on Dataset Chondrometrics using the two different loss functions as described in 2.2. For this purpose the dataset was randomly divided into two groups and both approaches were trained in a two-fold cross-validation setting employing the respective dataset only. As a means of data augmentation additional subvolumes were extracted at three random positions around the

cartilage (normal distribution: µ = 0 voxel, σ = 20). This procedure resulted in 2756 training images for each group of the cross-validation split. As shown in Table I for the data of Dataset Imorphics, Approach A yielded a segmentation accuracy of 88.02 ± 4.62 for MTC and 91.27 ± 2.33 for LTC at baseline and 87.43 ± 4.02 and 90.78 ± 2.42 at 12-months follow-up (12m). Moreover, Approach A resulted in a mean volume difference to the values of Chondrometrics of 22.7% ± 13.5 for MTC and 34.0% ± 22.4 for LTC (Fig. 2). Approach B yielded higher volume agreement on Dataset Chondrometrics with 0.3% ± 8.75 for MTC and 6.2% ± 9.4 for LTC (Fig. 2), but in Dataset Imorphics the segmentation accuracy decreased slightly to 82.85 ± 5.53 for MTC and 86.11 ± 4.37 for LTC at baseline and 82.27 ± 5.80 and 85.83 ± 4.30 at 12m. In comparison, the method proposed by Dam et al. over-estimated the cartilage volume on average by 4% for MTC and 14% for LTC compared to the measures by Chondrometrics [11]. We confirmed the over-segmentation via the method of Ambellan et al., where the Imorphics data acted as gold standard for training and no volumetric loss was used. Applying the method of Ambellan et al. to the data of Dataset Chondrometrics, the volume error was 19.0% and 27.6% for MTC and LTC, respectively. Compared to Dam et al. and Ambellan et al., Approach B achieved a similar segmentation accuracy w.r.t. the DSC on Dataset Imorphics but yielded volumes clearly closer to the ones provided by the clinical experts of Chondrometrics. Complete volumetric analysis of the tibial cartilage took on average approx. 5s per knee on a workstation with NVIDIA GTX 1080 Ti GPU. 4 Discussion and conclusion Employing Approach A, the segmentation accuracy outperformed previous methods as measured with the DSC for data of Dataset Imorphics. We hypoth-

Table I. Results of different methods for the automated segmentation of MTC and LTC in Dataset Imorphics (baseline and 12-months follow-up time point). baseline MTC LTC Dam et al. [11] 81.20 ± 5.50 86.60 ± 3.40 Raj et al. [14] 80.66 85.65 Tack et al. [12] 85.13 ± 10.5 90.23 ± 4.64 Ambellan et al. [13] 86.10 ± 5.33 90.40 ± 2.42 Approach B 82.85 ± 5.53 86.11 ± 4.37 Approach A 88.02 ± 4.62 91.27 ± 2.33 12m MTC LTC Dam et al. [11] Raj et al. [14] Tack et al. [12] 85.86 ± 5.03 90.20 ± 2.64 Ambellan et al. [13] 85.80 ± 5.00 89.10 ± 2.41 Approach B 82.27 ± 5.80 85.83 ± 4.30 Approach A 87.43 ± 4.02 90.78 ± 2.42 (a) Approach A (b) Approach B (c) Imorphics Fig. 3. Coronal view of the medial compartment of OAI patient Id #9892765: Results of Approach A (a), Approach B (b), and reference segmentation of Imorphics (c). esize that the increase in accuracy was due to both, the full 3D character of our CNN as well as the additional amount of training data. Although the segmentation accuracy on Dataset Imorphics illustrates the validity of 3D CNNs for the task of segmentation, over-estimation of the cartilage volume compared to Chondrometrics is severe, hence, limiting an application for automated cartilage analysis.

Approach B yielded a significantly higher agreement in volumetry on Dataset Chondrometrics, with a slight decrease of the segmentation accuracy for the data of Dataset Imorphics. As shown in Fig. 3 the specificity is slightly increased in areas of degenerated cartilage, which is hard to differentiate from healthy tissue. In future, we will apply our accurate volumetry method for the entire OAI database, to provide segmentation masks and volumetry data, that can be used as a gold standard for longitudinal assessment of OA progression as well as for the development of novel methods. References 1. P.G. Conaghan, M. Kloppenburg, G. Schett, J.W.J. Bijlsma, et al. Osteoarthritis research priorities: a report from a EULAR ad hoc expert committee. Annals of the Rheumatic Diseases, pages 1 4, 2014. 2. P.G. Conaghan, D.J. Hunter, J.F. Maillefert, W.M. Reichmann, and E. Losina. Summary and recommendations of the OARSI FDA osteoarthritis assessment of structural change working group. Osteoarthritis and Cartilage, 19(5):606 610, 2011. 3. F. Eckstein, D. Burstein, and T.M. Link. Quantitative MRI of cartilage and bone: Degenerative changes in osteoarthritis. NMR in Biomedicine, 19(7):822 854, 2006. 4. F. Eckstein, F. Cicuttini, J.-P. Raynauld, J.C. Waterton, and C. Peterfy. Magnetic Resonance Imaging (MRI) of articular cartilage in knee osteoarthritis (OA): Morphological assessment. Osteoarthritis and Cartilage, 14:46 75, 2006. 5. F. Eckstein, S. Maschek, W. Wirth, M. Hudelmaier, et al. One year change of knee cartilage morphology in the first release of participants from the Osteoarthritis Initiative Progression Subcohort-assosiation with sex, Body Mass Index, symptoms, and radiographic OA status. Annals of the Rheumatic Diseases, 68(5):1 16, 2008.

6. J. Hohe, G. Ateshian, M. Reiser, K.-H. Englmeier, and F. Eckstein. Surface size, curvature analysis, and assessment of knee joint incongruity with MRI in vivo. Magnetic Resonance in Medicine, 47(3):554 561, 2002. 7. OAI. https://oai.nih.gov, 2019. 8. U. John, E. Hensel, J. Lüdemann, M. Piek, et al. Study of Health In Pomerania (SHIP): a health examination survey in an east German region: objectives and design. Sozial-und Präventivmedizin, 46(3):186 194, 2001. 9. OVERLOAD-PrevOP: Verständnis und Prävention der Progression der primären Osteoarthrose. https://gesundheitsforschung-bmbf.de/de/overload-prevop- Praevention-Progression-primaere-Osteoarthrose.php, 2019. 10. H. Graichen, R. v. Eisenhart-Rothe, T. Vogl, K.-H. Englmeier, and F. Eckstein. Quantitative assessment of cartilage status in osteoarthritis by quantitative magnetic resonance imaging: Technical validation for use in analysis of cartilage volume and further morphologic parameters. Arthritis & Rheumatism, 50(3):811 816, 2004. 11. E.B. Dam, M. Lillholm, J. Marques, and M. Nielsen. Automatic segmentation of high-and low-field knee MRIs using knee image quantification with data from the Osteoarthritis Initiative. Journal of Medical imaging, 2(2):024001, 2015. 12. A. Tack, A. Mukhopadhyay, and S. Zachow. Knee menisci segmentation using convolutional neural networks: Data from the Osteoarthritis Initiative. Osteoarthritis and Cartilage, 26(5):680 688, 2018. 13. F. Ambellan, A. Tack, M. Ehlke, and S. Zachow. Automated segmentation of knee bone and cartilage combining statistical shape knowledge and convolutional neural networks: Data from the Osteoarthritis Initiative. In International Conference on Medical Imaging with Deep Learning, 2018. 14. A. Raj, S. Vishwanathan, B. Ajani, K. Krishnan, and H. Agarwal. Automatic knee cartilage segmentation using fully volumetric convolutional neural networks for evaluation of osteoarthritis. In IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), pages 851 854, 2018.

15. O. Ronneberger, P. Fischer, and T. Brox. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 234 241, 2015.