Introduction

AUTHOR=Wang Wenju , Wang Xiaolin , Chen Gang , Zhou Haoran 

TITLE=Multi-view SoftPool attention convolutional networks for 3D model classification

JOURNAL=Frontiers in Neurorobotics

VOLUME=Volume 16 - 2022

YEAR=2022

URL=https://www.frontiersin.org/journals/neurorobotics/articles/10.3389/fnbot.2022.1029968

DOI=10.3389/fnbot.2022.1029968

ISSN=1662-5218

ABSTRACT=<sec><title>Introduction</title><p>Existing multi-view-based 3D model classification methods have the problems of insufficient view refinement feature extraction and poor generalization ability of the network model, which makes it difficult to further improve the classification accuracy. To this end, this paper proposes a multi-view SoftPool attention convolutional network for 3D model classification tasks.</p></sec><sec><title>Methods</title><p>This method extracts multi-view features through ResNest and adaptive pooling modules, and the extracted features can better represent 3D models. Then, the results of the multi-view feature extraction processed using SoftPool are used as the Query for the self-attentive calculation, which enables the subsequent refinement extraction. We then input the attention scores calculated by Query and Key in the self-attention calculation into the mobile inverted bottleneck convolution, which effectively improves the generalization of the network model. Based on our proposed method, a compact 3D global descriptor is finally generated, achieving a high-accuracy 3D model classification performance.</p></sec><sec><title>Results</title><p>Experimental results showed that our method achieves 96.96% OA and 95.68% AA on ModelNet40 and 98.57% OA and 98.42% AA on ModelNet10.</p></sec><sec><title>Discussion</title><p>Compared with a multitude of popular methods, our algorithm model achieves the state-of-the-art classification accuracy.</p></sec>