研究目的
To address the limitations of existing RGB-D object recognition methods by proposing a novel Canonical Correlation Analysis (CCA)-based multi-view Convolutional Neural Network (CNNs) framework that effectively exploits mutual relationships between color and depth views.
研究成果
The proposed CCA-based multi-view CNNs architecture effectively identifies the associations between different perspectives of a same shape model, achieving better performance than state-of-the-art approaches. The ACCAR model is more efficient and can reach a further higher recognition accuracy than CCAR method when sufficient epochs are allowed.
研究不足
The computational expense of solving CCA optimization directly and the need for all or a large batch of training samples to compute their covariance matrices, inverse square roots, and matrix singular value decompositions (SVDs).