Robotic Vision Evaluation and Benchmarking

This project aimed to develop new standardised benchmark task and evaluation metrics for robotic scene understanding. To aid future research in robotic vision, the project team developed and published a comprehensive software suite that allows users to control robots in both simulation and reality, and evaluate their performance on a variety of relevant tasks.

Project Leaders

Team Members

Project Aim

This project aimed to develop new standardised benchmark task and evaluation metrics for robotic scene understanding. The goal for 2020 was to create a new, annual robotic scene understanding competition to be organised at a leading computer vision or robotics conference. The aim was to recreate for robotic vision, the positive effects that competitions have had for the advances in computer vision and deep learning. We furthermore wanted to continue running our Probabilistic Object Detection challenge created in 2019, and release the BenchBot application programming interface (API), a software suite allowing easy robotic vision evaluation in simulation and reality.

Key Results

The project team released a new robotic vision challenge in 2020, the first for Scene Understanding (Semantic SLAM and Scene Change Detection). The task in this challenge is to explore an unknown indoor environment and build a detailed map containing all the objects in the environment. The challenge requires a robot to map apartments, office spaces or cluttered kitchen environments. In collaboration with Google AI, Nvidia, Facebook AI Research, and other international partner universities, we will present this new challenge at the conference for Computer Vision and Pattern Recognition (CVPR), the leading computer vision conference, in 2021.

Our Scene Understanding benchmark challenge builds on our new BenchBot API, a software suite that allows users to control robots in simulation as well as in reality and evaluate them for important robotic vision tasks. BenchBot provides a simple software interface to receive sensor data (including RGB and depth images) from a robot, and send motion commands to the robot. With only a few lines of Python code, the user can successfully control a robot based on its sensor feedback. Importantly, exactly the same code can be executed on a simulated robot in a high-fidelity simulation environment, and on a real robot in our lab. Users provide their code in a Docker container and BenchBot handles the execution on either the simulated or real robot platform.

We also organised the 3^rd iteration of the Probabilistic Object Detection Challenge and organised successful workshops at the International Conference on Robotics and Automation (ICRA) and the European Conference on Computer Vision (ECCV).

The team also developed an improved object detector, reviewed existing semantic SLAM algorithms, and ran them as baselines on the new challenge dataset.

2020 Annual Report

Robotic Vision Evaluation and Benchmarking

Project Leaders

Niko Sünderhauf

Feras Dayoub

Team Members

David Hall

Haoyang Zhang

Suman Bista

Rohan Smith

Ben Talbot

Project Aim

Key Results