Graph explorer

Towards Streaming Perception

Embodied perception refers to the ability of an autonomous agent to perceive its environment so that it can (re)act. The responsiveness of the agent is largely governed by latency of its processing pipeline. While past work has studied the algorithmic trade-off between latency and accuracy, there has not been a clear metric to compare different methods along the Pareto optimal latency-accuracy curve. We point out a discrepancy between standard offline evaluation and real-time applications: by the time an algorithm finishes processing a particular frame, the surrounding world has changed. To these ends, we present an approach that coherently integrates latency and accuracy into a single metric for real-time online perception, which we refer to as "streaming accuracy". The key insight behind this metric is to jointly evaluate the output of the entire perception stack at every time instant, forcing the stack to consider the amount of streaming data that should be ignored while computation is occurring. More broadly, building upon this metric, we introduce a meta-benchmark that systematically converts any single-frame task into a streaming perception task. We focus on the illus

5 nodes5 linksoverview previewTowards Streaming Perception
5 nodes5 links
Towards Streaming Perception5 visible / 5 total nodes / 8 links
Co-authorshipCo-authorshipCo-authorshipAuthorshipWorks onAuthorshipAuthorshipTopic signalWTowards Streaming Perceptionpreprint / 2020AMengtian LiResearcherAYu-Xiong WangResearcherADeva RamananResearcherTComputer Vision30606 works
PaperSignal 104 links

Towards Streaming Perception

preprint / 2020

Open