研究目的
To develop a method for simultaneous multispectral foreground segmentation and stereo registration in video sequences, addressing challenges like low contrast and parallax effects.
研究成果
The proposed method effectively integrates multispectral data for simultaneous segmentation and registration, outperforming state-of-the-art monocular and supervised methods. It achieves robust performance in low contrast conditions and provides temporal coherence. Future work could include stronger constraints for coherence, explicit occlusion handling, and generalization to instance-level segmentation.
研究不足
The method assumes calibrated stereo pairs and may not handle large stereo baselines well. Occlusion handling is not explicit, and performance can degrade in highly cluttered scenes or with very low contrast. Computational complexity increases with temporal pipeline depth.
1:Experimental Design and Method Selection:
The method involves alternating minimization of two energy functions (stereo registration and segmentation) using move-making algorithms on conditional random fields. It integrates shape and appearance cues with dynamic priors and temporal coherence through higher-order terms.
2:Sample Selection and Data Sources:
Multiple multispectral datasets are used, including a modified version of the VAP dataset by Palmero et al. (2016), the dataset by Bilodeau et al. (2014), and a newly captured RGB-LWIR dataset. Frames are selected at 2 Hz for evaluation.
3:List of Experimental Equipment and Materials:
A stereo pair consisting of a Kinect v2 for Windows (Full HD resolution) and a FLIR A40 LWIR camera (QVGA resolution), calibrated using a foam core checkerboard pattern heated with halogen lamps. Software includes OpenCV for calibration and OpenGM library for graphical models.
4:Experimental Procedures and Operational Workflow:
Initialization with monocular segmentation masks, followed by iterative minimization of energy functions. Optical flow is computed for temporal coherence. Processing is done on a 3.7 GHz Intel i7-8700K processor.
5:7 GHz Intel i7-8700K processor.
Data Analysis Methods:
5. Data Analysis Methods: Evaluation using binary classification metrics (Precision, Recall, F1 score) for segmentation and disparity error metrics (percentage of pixels with errors >1px, >2px, >4px, average error) for registration. Comparisons with baseline methods like St-Charles et al. (2016), GrabCut, and sliding window approaches.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容