PCANet-Based Structural Representation for Nonrigid Multimodal Medical Image Registration

Xingxing Zhu; Mingyue Ding; Tao Huang; Xiaomeng Jin; Xuming Zhang

doi:10.3390/s18051477

PCANet-Based Structural Representation for Nonrigid Multimodal Medical Image Registration

Sensors (Basel). 2018 May 8;18(5):1477. doi: 10.3390/s18051477.

Authors

Xingxing Zhu¹, Mingyue Ding², Tao Huang³, Xiaomeng Jin⁴, Xuming Zhang⁵

Affiliations

¹ Department of Biomedical Engineering, School of Life Science and Technology, Ministry of Education Key Laboratory of Molecular Biophysics, Huazhong University of Science and Technology, No 1037, Luoyu Road, Wuhan 430074, China. D201677473@hust.edu.cn.
² Department of Biomedical Engineering, School of Life Science and Technology, Ministry of Education Key Laboratory of Molecular Biophysics, Huazhong University of Science and Technology, No 1037, Luoyu Road, Wuhan 430074, China. myding@hust.edu.cn.
³ State Food and Drug Administration Hubei Center for Medical Devices Quality Supervision and Testing, No 507, Hi-Tech Avenue, Donghu Hi-Tech Development District, Wuhan 430075, China. htao2010@126.com.
⁴ Department of Biomedical Engineering, School of Life Science and Technology, Ministry of Education Key Laboratory of Molecular Biophysics, Huazhong University of Science and Technology, No 1037, Luoyu Road, Wuhan 430074, China. xmjin@hust.edu.cn.
⁵ Department of Biomedical Engineering, School of Life Science and Technology, Ministry of Education Key Laboratory of Molecular Biophysics, Huazhong University of Science and Technology, No 1037, Luoyu Road, Wuhan 430074, China. zxmboshi@hust.edu.cn.

Abstract

Nonrigid multimodal image registration remains a challenging task in medical image processing and analysis. The structural representation (SR)-based registration methods have attracted much attention recently. However, the existing SR methods cannot provide satisfactory registration accuracy due to the utilization of hand-designed features for structural representation. To address this problem, the structural representation method based on the improved version of the simple deep learning network named PCANet is proposed for medical image registration. In the proposed method, PCANet is firstly trained on numerous medical images to learn convolution kernels for this network. Then, a pair of input medical images to be registered is processed by the learned PCANet. The features extracted by various layers in the PCANet are fused to produce multilevel features. The structural representation images are constructed for two input images based on nonlinear transformation of these multilevel features. The Euclidean distance between structural representation images is calculated and used as the similarity metrics. The objective function defined by the similarity metrics is optimized by L-BFGS method to obtain parameters of the free-form deformation (FFD) model. Extensive experiments on simulated and real multimodal image datasets show that compared with the state-of-the-art registration methods, such as modality-independent neighborhood descriptor (MIND), normalized mutual information (NMI), Weber local descriptor (WLD), and the sum of squared differences on entropy images (ESSD), the proposed method provides better registration performance in terms of target registration error (TRE) and subjective human vision.

Keywords: PCANet; medical image registration; similarity metric; structural representation; target registration error.