A Deep Learning Method for 3D Object Classification and Retrieval Using the Global Point Signature Plus and Deep Wide Residual Network

Long Hoang; Suk-Hwan Lee; Ki-Ryong Kwon

doi:10.3390/s21082644

A Deep Learning Method for 3D Object Classification and Retrieval Using the Global Point Signature Plus and Deep Wide Residual Network

Sensors (Basel). 2021 Apr 9;21(8):2644. doi: 10.3390/s21082644.

Authors

Long Hoang¹, Suk-Hwan Lee², Ki-Ryong Kwon³

Affiliations

¹ Department of Artificial Intelligence Convergence, Pukyong National University, Busan 48513, Korea.
² Department of Computer Engineering, Dong-A University, Busan 49315, Korea.
³ Department of IT Convergence and Application Engineering, Pukyong National University, Busan 48513, Korea.

Abstract

A vital and challenging task in computer vision is 3D Object Classification and Retrieval, with many practical applications such as an intelligent robot, autonomous driving, multimedia contents processing and retrieval, and augmented/mixed reality. Various deep learning methods were introduced for solving classification and retrieval problems of 3D objects. Almost all view-based methods use many views to handle spatial loss, although they perform the best among current techniques such as View-based, Voxelization, and Point Cloud methods. Many views make network structure more complicated due to the parallel Convolutional Neural Network (CNN). We propose a novel method that combines a Global Point Signature Plus with a Deep Wide Residual Network, namely GPSP-DWRN, in this paper. Global Point Signature Plus (GPSPlus) is a novel descriptor because it can capture more shape information of the 3D object for a single view. First, an original 3D model was converted into a colored one by applying GPSPlus. Then, a 32 × 32 × 3 matrix stored the obtained 2D projection of this color 3D model. This matrix was the input data of a Deep Residual Network, which used a single CNN structure. We evaluated the GPSP-DWRN for a retrieval task using the Shapnetcore55 dataset, while using two well-known datasets-ModelNet10 and ModelNet40 for a classification task. Based on our experimental results, our framework performed better than the state-of-the-art methods.

Keywords: 3D object classification and retrieval; Deep Wide Residual Network; Global Point Signature Plus; multimedia contents processing and retrieval.