研究目的
Investigating the application of attention mechanisms in deep neural networks for pixel-wise classification of very high-resolution remote sensing images to improve classification accuracy by focusing on key features and suppressing irrelevant information.
研究成果
The proposed neural network, incorporating attention mechanisms, achieves competitive accuracy in pixel-wise classification of VHRRS images. It effectively highlights discriminative features and suppresses irrelevant information, proving its effectiveness compared to other state-of-the-art methods. The study also demonstrates the importance of network structure and the volume of training data on classification performance.
研究不足
The training time for the proposed method is considerable due to the complexity of the network and the large amount of floating-point arithmetic involved. Additionally, the method's performance is influenced by the volume of training data, with deeper networks requiring more data to achieve superior classification results.
1:Experimental Design and Method Selection:
The study proposes a novel neural network incorporating two attention mechanisms (control gate and feedback attention) for pixel-wise classification of VHRRS images. The network is designed with mask and trunk branches to recalibrate feature extraction and focus on informative areas.
2:Sample Selection and Data Sources:
The method is tested on images from BJ-02, GF-02, Geoeye, and Quickbird satellites, covering various ground objects like residential areas, water bodies, vegetation, roads, and bare land.
3:List of Experimental Equipment and Materials:
The experiments were conducted on a computer with a 16.0 GB RAM Intel? Xeon? CPU E3-1220v5@3.00 GHz processor and an NVIDIA Quadro K620 graphic card with CUDA version 8.0.4 for acceleration.
4:0 GB RAM Intel? Xeon? CPU E3-1220v5@00 GHz processor and an NVIDIA Quadro K620 graphic card with CUDA version 4 for acceleration.
Experimental Procedures and Operational Workflow:
4. Experimental Procedures and Operational Workflow: The network is trained with randomly selected pixels as training samples, using patches around these pixels for feature extraction. The network's performance is evaluated based on overall accuracy (OA) and Kappa coefficients.
5:Data Analysis Methods:
The effectiveness of the proposed method is compared with state-of-the-art methods using OA, Kappa, producer, and user accuracies as evaluation metrics.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容