Category Level Object Pose Estimation via Neural Analysis-by-Synthesis
2020
Conference Paper
avg
Many object pose estimation algorithms rely on the analysis-by-synthesis framework which requires explicit representations of individual object instances. In this paper we combine a gradient-based fitting procedure with a parametric neural image synthesis module that is capable of implicitly representing the appearance, shape and pose of entire object categories, thus rendering the need for explicit CAD models per object instance unnecessary. The image synthesis network is designed to efficiently span the pose configuration space so that model capacity can be used to capture the shape and local appearance (i.e., texture) variations jointly. At inference time the synthesized images are compared to the target via an appearance based loss and the error signal is backpropagated through the network to the input parameters. Keeping the network parameters fixed, this allows for iterative optimization of the object pose, shape and appearance in a joint manner and we experimentally show that the method can recover orientation of objects with high accuracy from 2D images alone. When provided with depth measurements, to overcome scale ambiguities, the method can accurately recover the full 6DOF pose successfully.
Author(s): | Xu Chen and Zijian Dong and Jie Song and Andreas Geiger and Otmar Hilliges |
Book Title: | Computer Vision – ECCV 2020 |
Volume: | 26 |
Pages: | 139--156 |
Year: | 2020 |
Month: | August |
Series: | Lecture Notes in Computer Science, 12371 |
Editors: | Vedaldi, Andrea and Bischof, Horst and Brox, Thomas and Frahm, Jan-Michael |
Publisher: | Springer |
Department(s): | Autonomous Vision |
Bibtex Type: | Conference Paper (inproceedings) |
Paper Type: | Conference |
DOI: | 10.1007/978-3-030-58574-7_9 |
Event Name: | 16th European Conference on Computer Vision (ECCV 2020) |
Event Place: | Glasgow |
Address: | Cham |
ISBN: | 978-3-030-58573-0 |
State: | Published |
Links: |
Project Page
|
Attachments: |
pdf
suppmat |
BibTex @inproceedings{Chen2020ECCV, title = {Category Level Object Pose Estimation via Neural Analysis-by-Synthesis}, author = {Chen, Xu and Dong, Zijian and Song, Jie and Geiger, Andreas and Hilliges, Otmar}, booktitle = {Computer Vision – ECCV 2020}, volume = {26}, pages = {139--156}, series = {Lecture Notes in Computer Science, 12371}, editors = {Vedaldi, Andrea and Bischof, Horst and Brox, Thomas and Frahm, Jan-Michael}, publisher = {Springer}, address = {Cham}, month = aug, year = {2020}, doi = {10.1007/978-3-030-58574-7_9}, month_numeric = {8} } |