Generating Visually Aligned Sound from Videos
We focus on the task of generating sound from natural videos, and the sound should be both temporally and content-wise aligned with visual signals. This task is extremely challenging because some sounds generated outside a camera can not be inferred from video content. The model may be forced to lea...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - PP(2020) vom: 28. Juli
|
1. Verfasser: |
Chen, Peihao
(VerfasserIn) |
Weitere Verfasser: |
Zhang, Yang,
Tan, Mingkui,
Xiao, Hongdong,
Huang, Deng,
Gan, Chuang |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2020
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
|
Schlagworte: | Journal Article |