Source author record

Marc Van Droogenbroeck

Marc Van Droogenbroeck appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision eess.IV astro-ph.IM Machine Learning

Catalog footprint

What is connected

5works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

An exploration of the performances achievable by combining unsupervised background subtraction algorithms

Background subtraction (BGS) is a common choice for performing motion detection in video. Hundreds of BGS algorithms are released every year, but combining them to detect motion remains largely unexplored. We found that combination strategies allow to capitalize on this massive amount of available BGS algorithms, and offer significant space for performance improvement. In this paper, we explore sets of performances achievable by 6 strategies combining, pixelwise, the outputs of 26 unsupervised BGS algorithms, on the CDnet 2014 dataset, both in the ROC space and in terms of the F1 score. The chosen strategies are representative for a large panel of strategies, including both deterministic and non-deterministic ones, voting and learning. In our experiments, we compare our results with the state-of-the-art combinations IUTIS-5 and CNN-SFC, and report six conclusions, among which the existence of an important gap between the performances of the individual algorithms and the best performances achievable by combining them.

preprint2022arXiv

M4Depth: Monocular depth estimation for autonomous vehicles in unseen environments

Estimating the distance to objects is crucial for autonomous vehicles when using depth sensors is not possible. In this case, the distance has to be estimated from on-board mounted RGB cameras, which is a complex task especially in environments such as natural outdoor landscapes. In this paper, we present a new method named M4Depth for depth estimation. First, we establish a bijective relationship between depth and the visual disparity of two consecutive frames and show how to exploit it to perform motion-invariant pixel-wise depth estimation. Then, we detail M4Depth which is based on a pyramidal convolutional neural network architecture where each level refines an input disparity map estimate by using two customized cost volumes. We use these cost volumes to leverage the visual spatio-temporal constraints imposed by motion and to make the network robust for varied scenes. We benchmarked our approach both in test and generalization modes on public datasets featuring synthetic camera trajectories recorded in a wide variety of outdoor scenes. Results show that our network outperforms the state of the art on these datasets, while also performing well on a standard depth estimation benchmark. The code of our method is publicly available at https://github.com/michael-fonder/M4Depth.

preprint2020arXiv

A Context-Aware Loss Function for Action Spotting in Soccer Videos

In video understanding, action spotting consists in temporally localizing human-induced events annotated with single timestamps. In this paper, we propose a novel loss function that specifically considers the temporal context naturally present around each action, rather than focusing on the single annotated frame to spot. We benchmark our loss on a large dataset of soccer videos, SoccerNet, and achieve an improvement of 12.8% over the baseline. We show the generalization capability of our loss for generic activity proposals and detection on ActivityNet, by spotting the beginning and the end of each activity. Furthermore, we provide an extended ablation study and display challenging cases for action spotting in soccer videos. Finally, we qualitatively illustrate how our loss induces a precise temporal understanding of actions and show how such semantic knowledge can be used for automatic highlights generation.

preprint2020arXiv

Multimodal and multiview distillation for real-time player detection on a football field

Monitoring the occupancy of public sports facilities is essential to assess their use and to motivate their construction in new places. In the case of a football field, the area to cover is large, thus several regular cameras should be used, which makes the setup expensive and complex. As an alternative, we developed a system that detects players from a unique cheap and wide-angle fisheye camera assisted by a single narrow-angle thermal camera. In this work, we train a network in a knowledge distillation approach in which the student and the teacher have different modalities and a different view of the same scene. In particular, we design a custom data augmentation combined with a motion detection algorithm to handle the training in the region of the fisheye camera not covered by the thermal one. We show that our solution is effective in detecting players on the whole field filmed by the fisheye camera. We evaluate it quantitatively and qualitatively in the case of an online distillation, where the student detects players in real time while being continuously adapted to the latest video conditions.

preprint2014arXiv

The VORTEX project: first results and perspectives

(abridged) Vortex coronagraphs are among the most promising solutions to perform high contrast imaging at small angular separations. They feature a very small inner working angle, a clear 360 degree discovery space, have demonstrated very high contrast capabilities, are easy to implement on high-contrast imaging instruments, and have already been extensively tested on the sky. Since 2005, we have been designing, developing and testing an implementation of the charge-2 vector vortex phase mask based on concentric subwavelength gratings, referred to as the Annular Groove Phase Mask (AGPM). Science-grade mid-infrared AGPMs were produced in 2012 for the first time, using plasma etching on synthetic diamond substrates. They have been validated on a coronagraphic test bench, showing broadband peak rejection up to 500:1 in the L band, which translates into a raw contrast of about $6\times 10^{-5}$ at $2 λ/D$. Three of them have now been installed on world-leading diffraction-limited infrared cameras (VLT/NACO, VLT/VISIR and LBT/LMIRCam). During the science verification observations with our L-band AGPM on NACO, we observed the beta Pictoris system and obtained unprecedented sensitivity limits to planetary companions down to the diffraction limit ($0.1''$). More recently, we obtained new images of the HR 8799 system at L band during the AGPM first light on LMIRCam. After reviewing these first results obtained with mid-infrared AGPMs, we will discuss the short- and mid-term goals of the on-going VORTEX project, which aims to improve the performance of our vortex phase masks for future applications on second-generation high-contrast imagers and on future extremely large telescopes (ELTs).