Skip to content

siatheindochinese/OVE6D-pose

 
 

Repository files navigation

OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation (CVPR 2022

  • Added real-time inferencing.

Real-Time Inference

Real-time inference on an Intel Realsense D455 Depth Camera: demo_vid

demo_vid

The model follows a 2-stage pipeline:

  • (1) object detection and instance segmentation using Detectron2
  • (2) OVE6D pipeline (viewpoint-encoding, codebook lookup, in-plane rotation regression, centroid refinement)

While the OVE6D viewpoint encoder does not need to be retrained for new objects, the segmentation pipeline that comes before OVE6D needs to be trained for desired objects. Detectron2 is utilized due to its off-the-shelf nature, following the original authors use of Detectron2 for the LineMod dataset.

Setup

Refer to the original repository for setup instructions.

Training for Custom Dataset

  • to-be-added

Acknowledgement

This fork uses work from Detectron2 and OVE6D, neither of which I am originally involved in.

About

OVE6D, modified for private use-case.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 82.7%
  • Python 17.3%