Skip to content

PyTorch implementation of paper "HDA-Net: Horizontal Deformable Attention Network for Stereo Matching" (ACM MM2021 Oral)

Notifications You must be signed in to change notification settings

baopingli/HDANet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HDA-Net: Horizontal Deformable Attention Network for Stereo Matching

Official implementation of paper "HDA-Net: Horizontal Deformable Attention Network for Stereo Matching" (ACM MM2021 Oral)


Paper: [ACM MM2021]

Authors: Qi Zhang, Xuesong Zhang, Baoping Li, Yuhong Chen, Anlong Ming

Beijing University of Posts and Telecommunications


[TOC]

Introduction

Stereo matching is a fundamental and challenging task which has various applications in autonomous driving, dense reconstruction and other depth related tasks. Contextual information with discriminative features is crucial for accurate stereo matching in the ill-posed regions (textureless, occlusion, etc.). In this paper, we propose an efficient horizontal attention module to adaptively capture the global correspondence clues. Compared with the popular non-local attention, our horizontal attention is more effective for stereo matching with better performance and lower consumption of computation and memory. We further introduce a deformable module to refine the contextual information in the disparity discontinuous areas such as the boundary of objects. Learning-based method is adopted to construct the cost volume by concatenating the features of two branches. In order to offer explicit similarity measure to guide learning-based volume for obtaining more reasonable unimodal matching cost distribution we additionally combine the learning-based volume with the improved zero-centered group-wise correlation volume. Finally, we regularize the 4D joint cost volume by a 3D CNN module and generate the final output by disparity regression. The experimental results show that our proposed HDA-Net achieves the state-of-the-art performance on the Scene Flow dataset and obtains competitive performance on the KITTI datasets compared with the relevant networks.

Architecture of the proposed HDA-Net

Architecture of the proposed HDA-Net

The details of HDA module

The details of HDA module

Citation

If you find our work helpful to your research, please cite our paper:

@inbook{10.1145/3474085.3475273,
author = {Zhang, Qi and Zhang, Xuesong and Li, Baoping and Chen, Yuzhong and Ming, Anlong},
title = {HDA-Net: Horizontal Deformable Attention Network for Stereo Matching},
year = {2021},
isbn = {9781450386517},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3474085.3475273},
booktitle = {Proceedings of the 29th ACM International Conference on Multimedia},
pages = {32–40},
numpages = {9}
}

Usage

Dependencies

  • Python

  • Pytorch

  • torchvision

  • KITTI Stereo

  • Scene Flow

Train





Evaluation





Pretrained Model

Test on your own stereo pair




Results

Evaluation of HDANet with different settings

Ablation

Results on KITTI 2015 leaderboard

kitti2015

Results on KITTI 2012 leaderboard

kitti2012

Qualitative results

Left image

Predicted disparity

Error

About

PyTorch implementation of paper "HDA-Net: Horizontal Deformable Attention Network for Stereo Matching" (ACM MM2021 Oral)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published