开源软件名称(OpenSource Name):hche11/Localizing-Visual-Sounds-the-Hard-Way开源软件地址(OpenSource Url):https://github.com/hche11/Localizing-Visual-Sounds-the-Hard-Way开源编程语言(OpenSource Language):Python 100.0%开源软件介绍(OpenSource Introduction):Localizing-Visual-Sounds-the-Hard-WayCode and Dataset for "Localizing Visual Sounds the Hard Way". The repo contains code and our pre-trained model. Environment
Flickr-SoundNetWe provide the pretrained model here. To test the model, testing data and ground truth should be downloaded from learning to localize sound source. Then run
VGG-Sound SourceWe provide the pretrained model here. To test the model, run
(Note, some gt bounding boxes are updated recently, all results on VGG-SS cause a 2~3% difference on IoU.) Both test data should be placed in the following structure.
Citation
|
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论