PDC-Net+

Enhanced Probabilistic Dense Correspondence Network

TPAMI 2023
Prune Truong Martin Danelljan Radu Timofte Luc Van Gool
ETH Zurich - Computer Vision Lab
Arxiv Code

Abstract

Establishing robust and accurate correspondences between a pair of images is a long-standing computer vision problem with numerous applications. While classically dominated by sparse methods, emerging dense approaches offer a compelling alternative paradigm that avoids the keypoint detection step. However, dense flow estimation is often inaccurate in the case of large displacements, occlusions, or homogeneous regions. In order to apply dense methods to real-world applications, such as pose estimation, image manipulation, or 3D reconstruction, it is therefore crucial to estimate the confidence of the predicted matches. We propose the Enhanced Probabilistic Dense Correspondence Network, PDC-Net+, capable of estimating accurate dense correspondences along with a reliable confidence map. We develop a flexible probabilistic approach that jointly learns the flow prediction and its uncertainty. In particular, we parametrize the predictive distribution as a constrained mixture model, ensuring better modelling of both accurate flow predictions and outliers. Moreover, we develop an architecture and an enhanced training strategy tailored for robust and generalizable uncertainty prediction in the context of self-supervised training. Our approach obtains state-of-the-art results on multiple challenging geometric matching and optical flow datasets. We further validate the usefulness of our probabilistic confidence estimation for the tasks of pose estimation, 3D reconstruction, image-based localization, and image retrieval.

Visual Results


Aligning indoor images from ScanNet

We compute the flow field from the reference (middle) to the query (left). We plot the 1000 top confident matches as well. The warped query is represented on the right (and should resemble the middle). Only the regions for which the matches were predicted as confident are visible.






Aligning video object segmentation data from DAVIS

Here, we warp the query images toward the reference images. Our approach PDC-Net+ also predicts a confidence mask along with the dense correspondences. We show the warped query only in the estimated confident regions.


Citation

If you want to cite our work, please use:

@article{pdcnet+,
  author    = {Prune Truong and
               Martin Danelljan and
               Radu Timofte and
               Luc Van Gool},
  title     = {PDC-Net+: Enhanced Probabilistic Dense Correspondence Network},
  booktitle = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
  year      = {2023},
  url       = {https://arxiv.org/abs/2109.13912}
}