CSpace

浏览/检索结果: 共2条,第1-2条 帮助

已选(0)清除 条数/页:   排序方式:
Dubbing Movies via Hierarchical Phoneme Modeling and Acoustic Diffusion Denoising 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 卷号: 47, 期号: 11, 页码: 10361-10377
作者:  Li, Liang;  Cong, Gaoxiang;  Qi, Yuankai;  Zha, Zheng-Jun;  Wu, Qi;  Sheng, Quan Z.;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Videos  Lips  Visualization  Acoustics  Cloning  Noise reduction  Motion pictures  Head  Adaptation models  Text to speech  Visual voice cloning  speech synthesis  hierarchical phoneme modeling  contrastive learning  acoustic diffusion denoising  
Rethink video retrieval representation for video captioning 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 156, 页码: 13
作者:  Tian, Mingkai;  Li, Guorong;  Qi, Yuankai;  Wang, Shuhui;  Sheng, Quan Z.;  Huang, Qingming
收藏  |  浏览/下载:14/0  |  提交时间:2025/06/25
Video captioning  Video-text retrieval  Token shift  Cross-attention