arxiv: v1 [cs.cv] 17 Nov 2017

Similar documents
arxiv: v1 [cs.cv] 9 Sep 2017

arxiv: v1 [cs.cv] 13 Jul 2018

arxiv: v1 [cs.cv] 17 Aug 2017

arxiv: v1 [cs.cv] 30 Dec 2018

Handling Unbalanced Data in Deep Image Segmentation

arxiv: v1 [cs.cv] 28 Feb 2018

Segmentation of Cell Membrane and Nucleus by Improving Pix2pix

Comparative Study of K-means, Gaussian Mixture Model, Fuzzy C-means algorithms for Brain Tumor Segmentation

Research Article Multiscale CNNs for Brain Tumor Segmentation and Diagnosis

A Novel Method using Convolutional Neural Network for Segmenting Brain Tumor in MRI Images

Focus, Segment and Erase: An Efficient Network for Multi-Label Brain Tumor Segmentation

POC Brain Tumor Segmentation. vlife Use Case

Differentiating Tumor and Edema in Brain Magnetic Resonance Images Using a Convolutional Neural Network

A CONVOLUTION NEURAL NETWORK ALGORITHM FOR BRAIN TUMOR IMAGE SEGMENTATION

arxiv: v2 [cs.cv] 7 Jun 2018

Clustering of MRI Images of Brain for the Detection of Brain Tumor Using Pixel Density Self Organizing Map (SOM)

arxiv: v2 [cs.cv] 8 Mar 2018

arxiv: v2 [cs.cv] 19 Dec 2017

Y-Net: Joint Segmentation and Classification for Diagnosis of Breast Biopsy Images

Deep learning approaches to medical applications

A deep learning model integrating FCNNs and CRFs for brain tumor segmentation

HHS Public Access Author manuscript Med Image Comput Comput Assist Interv. Author manuscript; available in PMC 2018 January 04.

arxiv: v1 [cs.cv] 1 Oct 2018

arxiv: v1 [cs.cv] 3 Apr 2018

Skin cancer reorganization and classification with deep neural network

Segmentation of Tumor Region from Brain Mri Images Using Fuzzy C-Means Clustering And Seeded Region Growing

Automatic brain tumor segmentation with a fast Mumford-Shah algorithm

Brain Tumor Segmentation Based on SFCM using Back Propagation Neural Network

A deep learning model integrating FCNNs and CRFs for brain tumor segmentation

Final Report: Automated Semantic Segmentation of Volumetric Cardiovascular Features and Disease Assessment

Convolutional Neural Networks for Estimating Left Ventricular Volume

Neural Network for Detecting Head Impacts from Kinematic Data. Michael Fanton, Nicholas Gaudio, Alissa Ling CS 229 Project Report

A Reliable Method for Brain Tumor Detection Using Cnn Technique

arxiv: v1 [cs.cv] 9 Oct 2018

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil

2017 International MICCAI BraTS Challenge

Efficient Deep Model Selection

arxiv: v1 [cs.cv] 2 Aug 2017

arxiv: v1 [stat.ml] 23 Jan 2017

Improved Intelligent Classification Technique Based On Support Vector Machines

arxiv: v3 [cs.cv] 24 Jan 2017

Automatic Prostate Cancer Classification using Deep Learning. Ida Arvidsson Centre for Mathematical Sciences, Lund University, Sweden

arxiv: v1 [cs.cv] 24 Jul 2018

Recurrent Fully Convolutional Neural Networks for Multi-slice MRI Cardiac Segmentation

arxiv: v1 [cs.cv] 30 May 2018

Hierarchical Convolutional Features for Visual Tracking

Multi-attention Guided Activation Propagation in CNNs

arxiv: v3 [cs.cv] 3 Jun 2017

arxiv: v1 [cs.ai] 28 Nov 2017

Proceedings of the UGC Sponsored National Conference on Advanced Networking and Applications, 27 th March 2015

MRI Image Processing Operations for Brain Tumor Detection

Development of Novel Approach for Classification and Detection of Brain Tumor

DIAGNOSTIC CLASSIFICATION OF LUNG NODULES USING 3D NEURAL NETWORKS

Computational modeling of visual attention and saliency in the Smart Playroom

Keywords MRI segmentation, Brain tumor detection, Tumor segmentation, Tumor classification, Medical Imaging, ANN

MICCAI 2018 VISDMML Tutorial Image Analysis Meets Deep Learning and Visualization in Cardiology and Cardiac Surgery

Using Deep Convolutional Networks for Gesture Recognition in American Sign Language

arxiv: v2 [cs.cv] 22 Mar 2018

The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS)

Satoru Hiwa, 1 Kenya Hanawa, 2 Ryota Tamura, 2 Keisuke Hachisuka, 3 and Tomoyuki Hiroyasu Introduction

PathGAN: Visual Scanpath Prediction with Generative Adversarial Networks

AUTOMATIC DIABETIC RETINOPATHY DETECTION USING GABOR FILTER WITH LOCAL ENTROPY THRESHOLDING

arxiv: v1 [cs.cv] 12 Jun 2018

Lung Nodule Segmentation Using 3D Convolutional Neural Networks

arxiv: v2 [cs.cv] 3 Jun 2018

3D Deep Learning for Multi-modal Imaging-Guided Survival Time Prediction of Brain Tumor Patients

AN EFFICIENT DIGITAL SUPPORT SYSTEM FOR DIAGNOSING BRAIN TUMOR

arxiv: v1 [cs.cv] 2 Jul 2018

arxiv: v2 [cs.lg] 1 Jun 2018

Highly Accurate Brain Stroke Diagnostic System and Generative Lesion Model. Junghwan Cho, Ph.D. CAIDE Systems, Inc. Deep Learning R&D Team

Deep Supervision for Pancreatic Cyst Segmentation in Abdominal CT Scans

Proceedings of MICCAI-BRATS 2016 Multimodal Brain Tumor Image Segmentation Benchmark: Change Detection

AUTOMATIC MEASUREMENT ON CT IMAGES FOR PATELLA DISLOCATION DIAGNOSIS

Combined Radiology and Pathology Classification of Brain Tumors

Design of Palm Acupuncture Points Indicator

ANALYSIS AND DETECTION OF BRAIN TUMOUR USING IMAGE PROCESSING TECHNIQUES

Fully Convolutional Network-based Multi-Task Learning for Rectum and Rectal Cancer Segmentation

Early Diagnosis of Autism Disease by Multi-channel CNNs

arxiv: v3 [cs.cv] 12 Nov 2018

Exploratory Study on Direct Prediction of Diabetes using Deep Residual Networks

International Journal of Engineering Trends and Applications (IJETA) Volume 4 Issue 2, Mar-Apr 2017

arxiv: v1 [cs.cv] 21 Mar 2018

Cancer Cells Detection using OTSU Threshold Algorithm

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine

Automatic Hemorrhage Classification System Based On Svm Classifier

Brain Tumour Diagnostic Support Based on Medical Image Segmentation

Synthesizing Missing PET from MRI with Cycle-consistent Generative Adversarial Networks for Alzheimer s Disease Diagnosis

Comparison of Two Approaches for Direct Food Calorie Estimation

A convolutional neural network to classify American Sign Language fingerspelling from depth and colour images

ITERATIVELY TRAINING CLASSIFIERS FOR CIRCULATING TUMOR CELL DETECTION

Quantifying Radiographic Knee Osteoarthritis Severity using Deep Convolutional Neural Networks

Research Article. Automated grading of diabetic retinopathy stages in fundus images using SVM classifer

BRAIN TUMOR SEGMENTATION USING K- MEAN CLUSTERIN AND ITS STAGES IDENTIFICATION

arxiv: v1 [cs.cv] 21 Jul 2017

arxiv: v3 [cs.cv] 26 May 2018

End-To-End Alzheimer s Disease Diagnosis and Biomarker Identification

Image-Based Estimation of Real Food Size for Accurate Food Calorie Estimation

Detection of Brain Tumor Using FFBP Neural Networks

As entered modern society, the interactions. A Study on Deep Learning Based Sauvegrain Method for Measurement of Puberty Bone Age

Transcription:

Segmenting Brain Tumors with Symmetry Hejia Zhang Princeton University Xia Zhu Intel Labs Theodore L. Willke Intel Labs arxiv:1711.06636v1 [cs.cv] 17 Nov 2017 Abstract We explore encoding brain symmetry into a neural network for a brain tumor segmentation task. A healthy human brain is symmetric at a high level of abstraction, and the high-level asymmetric parts are more likely to be tumor regions. Paying more attention to asymmetries has the potential to boost the performance in brain tumor segmentation. We propose a method to encode brain symmetry into existing neural networks and apply the method to a state-of-the-art neural network for medical imaging segmentation. We evaluate our symmetry-encoded network on the dataset from a brain tumor segmentation challenge and verify that the new model extracts information in the training images more efficiently than the original model. 1 Introduction Brain tumor segmentation using Magnetic Resonance Imaging (MRI) plays an important role in the diagnosis and treatment of brain tumors. Traditionally, this is done manually. However, the manual segmentation is time-consuming, prone to human error, and requires significant expertise [1]. Thus, there is an increasing interest in developing automatic brain tumor segmentation methods. Some methods [2, 3] extract features with medical domain knowledge and use traditional classifiers such as random forest and support vector machines. Recently, deep learning methods [4, 5, 6, 7, 8, 9] show promising results in this task and offer better performance in general. We observe that many of these deep learning methods can be used in other biomedical image segmentation tasks or even semantic segmentations as well. This flexibility can be an advantage but, on the other hand, it implies that these methods make little to no use of the specific properties of the brain or brain tumors. Specific properties could include the structure of normal brain tissue, the texture of brain tumors, and so on. Encoding these properties into the deep learning methods has the potential to extract features that are more relevant to the brain tumor segmentation task. Here we focus on encoding structural property about the brain. One of the universal characteristics of brain structure is the bilateral symmetry of the brain. More precisely, the left hemisphere and right hemisphere of a healthy brain are symmetric at a high level of abstraction, and the presence of any high-level asymmetry usually implies abnormality. Thus, in the brain tumor segmentation task, we would like to pay more attention to asymmetric portions, which are more likely to be tumor regions. In this work, we propose a method to encode the brain symmetry into neural networks for brain tumor segmentation. We apply this method to encode brain symmetry into one of the state-of-the-art neural networks for medical imaging segmentation. The new model and the original model is compared on a popular brain tumor segmentation challenge dataset. 2 Symmetry Encoded Neural Network Now we explore how to encode symmetry into neural networks. The basic idea is to compute the difference between the original brain image and the flipped brain image at a high level and concentrate This work was done while Hejia Zhang was an intern at Intel Labs 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA.

4!! flip copy!+!!! 4 4!! 4!!!+!!! 4 2!+ 2!+ 4!+ 8! Flipped input image Input image Output Conv 3x3, BN+ReLu Max pool 2x2 Up-conv 2x2 (Crop and) concat Conv 1x1 Diff copy 4!+ 8! (a) (b) 4!+4! Figure 1: (a) U-net used as the baseline. (b) Symmetry-encoded U-net. The flipped image branch and the original image branch share the same filter weights. k is the number of feature maps. k 4 8 16 32 Num. of Param. in U-Net 30904 121820 483940 1929332 Num. of Param. in Symmetry-encoded U-Net 35636 140488 558128 2225152 Ratio (Symm/U-Net) 1.15 1.15 1.15 1.15 Table 1: Number of parameters in the standard U-Net and the Symmetry-encoded U-Net. on the difference when extracting features. In a neural network, we implement this idea by subtracting the original and flipped image at some feature map layer. The difference is concatenated with the feature maps of the original image at that level to preserve the information of the original image as well. We can apply this idea to encode symmetry in existing neural networks for this task. However, as the brain symmetry is at a high level of abstraction, we would like to perform the subtraction after several layers of convolution and pooling. Thus, this idea is most appropriate for neural networks with consecutive convolution and pooling layers to extract features at different abstraction levels. Here we use U-Net [4] as an example. It is one of the state-of-the-art neural networks for medical imaging segmentation and has convolution and pooling layers for feature extraction. The architecture of a standard U-Net is shown in Fig. 1(a). U-Net is a fully convolutional network that can be trained end-to-end. The downward path consists of a series of convolution and pooling layers that extract features of different resolutions. The upward path contains convolution and up-convolution layers that increase the resolution of outputs. The skip shortcuts transfer information from the downward path to the upward path to help define more precise boundaries in the segmentation. The number of feature maps is doubled at each layer in the downward path and halved at each layer in the upward path as in [4]. Both the input and output have 4 channels (details discussed in 3). We encode symmetry into U-Net by using two branches that share the same filter weights in the downward path. The two branches take in the original and the flipped brain image, respectively. The difference between feature maps from these two branches is computed after several layers and concatenated with the feature maps of the original image. This symmetry-encoded U-Net is illustrated in Fig. 1(b). Note that the filter weights of two branches are shared, so the number of parameters in this network is only 1.15X of the number of parameters in the standard U-Net (shown in Tab. 1). Also, note that our model is not the same as data augmentation with flipped images. Our model sees the original and flipped images at the same time and explicitly utilizes their differences in high-level features maps, whereas the data augmentation method sees the original and flipped images, respectively, and treats them as independent training images. 2

3 Experiments We use the Multimodal Brain Tumor Segmentation Challenge (BraTS) [10] to determine if our symmetry-encoded model (Fig. 1(b)) has an improved performance compared to the standard U-Net (Fig. 1(a)) for the brain tumor segmentation task. Dataset: We use the training set of the BraTS2017 dataset [10]. It contains multimodal MRI scans of 285 subjects with brain tumors from clinics. The four modes are T1, post-contrast T1 (T1Gd), T2, and T2 fluid attenuated inversion recovery (FLAIR) [11]. Brain image from each patient of each mode is a 3D matrix of size 240 240 155. All the images are anatomically registered to the same brain template. Images are segmented manually by experts into four classes: the background or healthy part (class 0), the Gd-enhancing tumor (ET, class 1), the peritumoral edema (ED, class 2), and the necrotic and non-enhancing tumor (NCR/NET, class 3). The 285 subjects are split into two groups based on the severity of brain tumors. The more severe group called high-grade gliomas (HGG) [12] has 210 subjects, and the other called lower grade gliomas (LGG) has 75 subjects. We use the HGG group in this paper. Preprocessing: a) As our network takes 2D images, we convert each 3D brain image as 155 2D slices of size 240 240. b) We observe that on average, 99.1% of the pixels are annotated as class 0 (healthy tissue), so we throw away any image with class 0 only. We also crop the image size to 156 192 to focus on brain itself. To ensure the flipped image is aligned with the original image, the cropped image has its axis of symmetry at approximately the middle of the image. c) Each image is scaled to [0, 1] in intensity and matched to a template using histogram matching [13]. The template is the average image across all images. After the preprocessing steps, each image is of size 156 192 4 where we put each mode in a different channel. Experiment Details: We randomly partition the 210 subjects into two parts, with 80% for training and 20% for validation. The experiment is repeated 5 times with different partitioning and the averaged result is reported. We use the weighted cross-entropy loss for training. As 95.1% of the pixels are still class 0 on average even after preprocessing, the classes are still highly imbalanced. Thus, we apply median frequency balancing [14] to weight the cross-entropy loss. The idea is to penalize more when a small class gets errors. The loss is defined as C C α c t pi log(y pi ) c=1 p c i=1, where C is number of classes, p is pixel, t pi and y pi are the true binary label and the output from softmax of class i for pixel p, and α c is the class weight of class c defined as α c = median freq across all classes freq of class c. The learning algorithm is Adam with Nesterov momentum (Nadam) [15] with learning rate 0.001. The evaluation metric is the average Dice score [16, 17] across 4 classes. The Dice score is good at evaluating the quality of a segmentation result when the classes are imbalanced. For each class, the Dice score is defined as 2 T P T + P, where T and P are true and predicted binary labels, and is the area. We tried k = 4, 8, 16, 32, where k is number of feature maps of the first layer of neural network as shown in Fig. 1. We evaluate Dice scores on validation data whenever 8000 training images are processed. When the average Dice score across different tumor classes does not increase for five consecutive evaluations on validation data, an experiment is marked as converged. The number of epochs it took to converge and the average Dice scores of different tumor classes with different k on the validation set are shown in Fig. 2. An example segmentation output from the symmetry-encoded U-Net is shown in Fig. 3. We see that the symmetry-encoded U-Net consistently takes fewer epochs to converge. This suggests that the symmetry-encoded model can more efficiently extract information from the training images and eventually train the model with fewer epochs while achieving similar segmentation performance. When k is 4 or 8, the new model also gets better Dice scores compared with the U-Net. When k is 16 or 32, the two models have similar Dice scores. A possible reason is that when k is small, the new model extracts more relevant features than the baseline U-Net, but as k gets larger, the U-Net starts to get the redundancy to include both relevant features and other features. 3

Number of Epochs Takes to Converge Average Dice Score 12 10 8 6 4 2 0 10.4 8.9 8.0 5.7 5.9 4.5 3.1 2.7 k=4 k=8 k=16 k=32 0.9 0.85 0.8 0.75 0.8740.868 0.869 0.869 0.839 0.828 0.779 0.769 k=4 k=8 k=16 k=32 U-Net Symmetry-encoded U-Net U-Net Symmetry-encoded U-Net Figure 2: Results from U-Net and Symmetry-encoded U-Net on BraTS2017 dataset. The experiment is repeated 5 times based on different partitions of training and validation sets. The averaged validation results are reported here. Left: Number of epochs takes to converge. The criterion for convergence is described in the text. Right: Average Dice score across 4 classes on validation data. The Dice score is computed right after convergence. Brain Image Ground-truth Prediction Figure 3: An example of segmentation output image from the symmetry-encoded U-Net. Red: class 1; Green: class 2; Blue: class 3. 4 Discussion and Conclusion We proposed a method to encode brain symmetry information into neural networks for brain tumor segmentation. We encode symmetry into U-Net by adding a branch with flipped brain images to the downward path and subtracting between two branches at a high level of abstraction. This technique can also be extended to other neural networks with successive convolution and pooling layers. We compared the performance of the symmetry-encoded U-Net and the standard U-Net on a popular brain tumor segmentation dataset. The new model takes fewer epochs to converge and shows a higher Dice score when a small number of feature maps are used. The result suggests that encoding symmetry into the neural network helps extract features that are more relevant to the brain tumor segmentation task. In essence, our method forces the network to pay more attention to high-level asymmetric portions, which are more likely to be tumor regions. The relationship between asymmetric portions and tumor regions may be learned implicitly with the standard U-Net, but will take more effort. Our method encodes this information and spares the effort. The idea can be easily extended to 3D U-Net and also can be extended to encode task-specific medical domain knowledge into neural networks for medical applications. 4

References [1] Nelly Gordillo, Eduard Montseny, and Pilar Sobrevilla. State of the art survey on mri brain tumor segmentation. Magnetic resonance imaging, 31(8):1426 1438, 2013. [2] Mustafa Arikan, Bernhard Fröhler, and Torsten Möller. Semi-automatic brain tumor segmentation using support vector machines and interactive seed selection. In Proceedings MICCAI-BRATS Workshop 2016, pages 4 6, 2016. [3] Bi Song, Chen-Rui Chou, Xiaojing Chen, Albert Huang, and Ming-Chang Liu. Anatomy-guided brain tumor segmentation and classification. In International Workshop on Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, pages 162 170. Springer, 2016. [4] Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 234 241. Springer, 2015. [5] PD Chang. Fully convolutional neural networks with hyperlocal features for brain tumor segmentation. In Proceedings MICCAI-BRATS Workshop, pages 4 9, 2016. [6] Adrià Casamitjana, Santi Puch, Asier Aduriz, Elisa Sayrol, and Verónica Vilaplana. 3d convolutional networks for brain tumor segmentation. Proceedings of the MICCAI Challenge on Multimodal Brain Tumor Image Segmentation (BRATS), pages 65 68, 2016. [7] Mohammad Havaei, Axel Davy, David Warde-Farley, Antoine Biard, Aaron Courville, Yoshua Bengio, Chris Pal, Pierre-Marc Jodoin, and Hugo Larochelle. Brain tumor segmentation with deep neural networks. Medical image analysis, 35:18 31, 2017. [8] Fausto Milletari, Nassir Navab, and Seyed-Ahmad Ahmadi. V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 3D Vision (3DV), 2016 Fourth International Conference on, pages 565 571. IEEE, 2016. [9] Özgün Çiçek, Ahmed Abdulkadir, Soeren S Lienkamp, Thomas Brox, and Olaf Ronneberger. 3d u-net: learning dense volumetric segmentation from sparse annotation. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 424 432. Springer, 2016. [10] Bjoern H Menze, Andras Jakab, Stefan Bauer, Jayashree Kalpathy-Cramer, Keyvan Farahani, Justin Kirby, Yuliya Burren, Nicole Porz, Johannes Slotboom, Roland Wiest, et al. The multimodal brain tumor image segmentation benchmark (brats). IEEE transactions on medical imaging, 34(10):1993 2024, 2015. [11] Donald G Mitchell. MRI principles. WB Saunders Company, 1999. [12] Lisa M DeAngelis. Brain tumors. New England Journal of Medicine, 344(2):114 123, 2001. [13] Dori Shapira, Shai Avidan, and Yacov Hel-Or. Multiple histogram matching. In Image Processing (ICIP), 2013 20th IEEE International Conference on, pages 2269 2273. IEEE, 2013. [14] David Eigen and Rob Fergus. Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. In Proceedings of the IEEE International Conference on Computer Vision, pages 2650 2658, 2015. [15] Timothy Dozat. Incorporating nesterov momentum into adam. 2016. [16] Thorvald Sørensen. A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on danish commons. Biol. Skr., 5:1 34, 1948. [17] Lee R Dice. Measures of the amount of ecologic association between species. Ecology, 26(3):297 302, 1945. 5