Towards Reaching Human Performance in Pedestrian Detection

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):973-986. doi: 10.1109/TPAMI.2017.2700460. Epub 2017 May 2.

Abstract

Encouraged by the recent progress in pedestrian detection, we investigate the gap between current state-of-the-art methods and the "perfect single frame detector". We enable our analysis by creating a human baseline for pedestrian detection (over the Caltech pedestrian dataset). After manually clustering the frequent errors of a top detector, we characterise both localisation and background-versus-foreground errors. To address localisation errors we study the impact of training annotation noise on the detector performance, and show that we can improve results even with a small portion of sanitised training data. To address background/foreground discrimination, we study convnets for pedestrian detection, and discuss which factors affect their performance. Other than our in-depth analysis, we report top performance on the Caltech pedestrian dataset, and provide a new sanitised set of training and test annotations.

MeSH terms

  • Cluster Analysis
  • Databases, Factual
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Pattern Recognition, Automated / methods*
  • Pedestrians / classification*
  • Video Recording / methods*