研究目的
Proposing a generic methodology for the semi-automatic generation of reliable position annotations for evaluating multi-camera people-trackers on large video data sets.
研究成果
The proposed semi-automatic annotation methodology significantly reduces the manual labor required for generating reliable position annotations in multi-camera video sequences, achieving high accuracy and reliability. The framework is generic and can be extended with additional trackers, offering a scalable solution for evaluating multi-camera people-trackers on large video data sets.
研究不足
The proposed approach is currently limited to single target tracking, with an exploratory study for the multi-target case suggesting future work.
1:Experimental Design and Method Selection:
The methodology involves estimating a consensus tracking result from multiple existing trackers and people detectors, classifying it as reliable or not, and verifying a small subset of the data with insufficient reliability by a human.
2:Sample Selection and Data Sources:
A data set of ~6 h captured by 4 cameras, featuring a person in a holiday flat performing various activities.
3:List of Experimental Equipment and Materials:
Four cameras for video capture, multiple trackers and people detectors for automatic annotation.
4:Experimental Procedures and Operational Workflow:
Automatic computation of annotation data, classification of reliability, human verification of unreliable tracks, and comprehensive visual inspection.
5:Data Analysis Methods:
Comparison of automatically annotated frames to ground truth, estimation of tracking accuracy, and evaluation of the annotation procedure.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容