Inference for downstream tasks #18

ac5113 · 2024-07-22T16:43:49Z

Hello,
Do you plan to provide inferencing for the downstream applications mentioned, such as 3D reconstruction from occluded RGB image?

egeozguroglu · 2024-08-27T13:31:38Z

Hi, thanks for the interest in our work! Since we synthesize RGB images of whole objects (i.e. perform amodal completion and segmentation), our approach makes it straightforward to equip various computer vision methods with the ability to handle occlusions, beyond amodal segmentation.

After performing amodal completion, for recognition, we use CLIP as the base open-vocabulary classifier. For novel view synthesis and 3D reconstruction, we use SyncDreamer. Since these codebases are very clean & well-documented, we don't plan to provide the intermediary code here. Moreover, please note that our approach is not specific to any particular recognition, or NVS/3D reconstruction model. Instead, it serves as a drop-in module to enable them to handle occlusions.

That said, please feel free to contact me at ege.ozguroglu@columbia.edu for specific scripts.

ac5113 · 2024-08-27T18:13:17Z

Thanks for the response!
I'll be sure to do so

ac5113 closed this as completed Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference for downstream tasks #18

Inference for downstream tasks #18

ac5113 commented Jul 22, 2024

egeozguroglu commented Aug 27, 2024 •

edited

Loading

ac5113 commented Aug 27, 2024

Inference for downstream tasks #18

Inference for downstream tasks #18

Comments

ac5113 commented Jul 22, 2024

egeozguroglu commented Aug 27, 2024 • edited Loading

ac5113 commented Aug 27, 2024

egeozguroglu commented Aug 27, 2024 •

edited

Loading