开源软件名称(OpenSource Name):daveredrum/ScanRefer开源软件地址(OpenSource Url):https://github.com/daveredrum/ScanRefer开源编程语言(OpenSource Language):Python 92.4%开源软件介绍(OpenSource Introduction):ScanRefer: 3D Object Localization in RGB-D Scans using Natural LanguageIntroductionWe introduce the new task of 3D object localization in RGB-D scans using natural language descriptions. As input, we assume a point cloud of a scanned 3D scene along with a free-form description of a specified target object. To address this task, we propose ScanRefer, where the core idea is to learn a fused descriptor from 3D object proposals and encoded sentence embeddings. This learned descriptor then correlates the language expressions with the underlying geometric features of the 3D scan and facilitates the regression of the 3D bounding box of the target object. In order to train and benchmark our method, we introduce a new ScanRefer dataset, containing 51,583 descriptions of 11,046 objects from 800 ScanNet scenes. ScanRefer is the first large-scale effort to perform object localization via natural language expression directly in 3D. Please also check out the project website here. For additional detail, please see the ScanRefer paper: |
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论