CSpace

浏览/检索结果: 共13条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Dubbing Movies via Hierarchical Phoneme Modeling and Acoustic Diffusion Denoising 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 卷号: 47, 期号: 11, 页码: 10361-10377
作者:  Li, Liang;  Cong, Gaoxiang;  Qi, Yuankai;  Zha, Zheng-Jun;  Wu, Qi;  Sheng, Quan Z.;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Videos  Lips  Visualization  Acoustics  Cloning  Noise reduction  Motion pictures  Head  Adaptation models  Text to speech  Visual voice cloning  speech synthesis  hierarchical phoneme modeling  contrastive learning  acoustic diffusion denoising  
Dynamic Strategy Prompt Reasoning for Emotional Support Conversation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 卷号: 27, 页码: 108-119
作者:  Liu, Yiting;  Li, Liang;  Tu, Yunbin;  Zhang, Beichen;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:39/0  |  提交时间:2025/06/25
Emotion recognition  Oral communication  Commonsense reasoning  History  Information processing  Generators  Computers  Computer science  Visualization  Semantics  Emotion support conversation  strategy prompt reasoning  
Inductive State-Relabeling Adversarial Active Learning With Heuristic Clique Rescaling 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 12, 页码: 9780-9796
作者:  Zhang, Beichen;  Li, Liang;  Wang, Shuhui;  Cai, Shaofei;  Zha, Zheng-Jun;  Tian, Qi;  Huang, Qingming
收藏  |  浏览/下载:37/0  |  提交时间:2025/06/25
Active learning  adversarial learning  state relabeling  contrastive learning  data diversity  
Context-Aware ProposalBoundary Network With Structural Consistency for Audiovisual Event Localization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 卷号: 35, 期号: 11, 页码: 15872-15882
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:13/0  |  提交时间:2025/06/25
Proposals  Visualization  Location awareness  Encoding  Task analysis  Feature extraction  Aggregates  Audiovisual learning  context learning  event localization  
SMART: Syntax-Calibrated Multi-Aspect Relation Transformer for Change Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 7, 页码: 4926-4943
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:33/0  |  提交时间:2024/12/06
Semantics  Visualization  Transformers  Decoding  Switches  Syntactics  Image representation  Change captioning  multi-aspect relation learning  part-of-speech  visual switch  transformer  
Downstream-Pretext Domain Knowledge Traceback for Active Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 10585-10596
作者:  Zhang, Beichen;  Li, Liang;  Zha, Zheng-Jun;  Luo, Jiebo;  Huang, Qingming
收藏  |  浏览/下载:13/0  |  提交时间:2025/06/25
Task analysis  Uncertainty  Annotations  Data models  Training  Visualization  Transformers  Active learning  pretext training  domain knowledge  self-supervised learning  
Context-Aware Proposal-Boundary Network With Structural Consistency for Audiovisual Event Localization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 11
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:46/0  |  提交时间:2023/12/04
Audiovisual learning  context learning  event localization  
Semantic and Relation Modulation for Audio-Visual Event Localization 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 7711-7725
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:49/0  |  提交时间:2023/12/04
Visualization  Location awareness  Correlation  Proposals  Semantics  Task analysis  Modulation  Audio-visual learning  event localization  normalization  
Learning Degradation-Invariant Representation for Robust Real-World Person Re-Identification 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 页码: 27
作者:  Huang, Yukun;  Fu, Xueyang;  Li, Liang;  Zha, Zheng-Jun
收藏  |  浏览/下载:64/0  |  提交时间:2022/12/07
Person Re-ID  Representation learning  Vision in bad weather  Deep learning  Low-light image enhancement  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:62/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer