CSpace

浏览/检索结果: 共51条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Dubbing Movies via Hierarchical Phoneme Modeling and Acoustic Diffusion Denoising 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 卷号: 47, 期号: 11, 页码: 10361-10377
作者:  Li, Liang;  Cong, Gaoxiang;  Qi, Yuankai;  Zha, Zheng-Jun;  Wu, Qi;  Sheng, Quan Z.;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Videos  Lips  Visualization  Acoustics  Cloning  Noise reduction  Motion pictures  Head  Adaptation models  Text to speech  Visual voice cloning  speech synthesis  hierarchical phoneme modeling  contrastive learning  acoustic diffusion denoising  
Dynamic Strategy Prompt Reasoning for Emotional Support Conversation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 卷号: 27, 页码: 108-119
作者:  Liu, Yiting;  Li, Liang;  Tu, Yunbin;  Zhang, Beichen;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:42/0  |  提交时间:2025/06/25
Emotion recognition  Oral communication  Commonsense reasoning  History  Information processing  Generators  Computers  Computer science  Visualization  Semantics  Emotion support conversation  strategy prompt reasoning  
Inductive State-Relabeling Adversarial Active Learning With Heuristic Clique Rescaling 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 12, 页码: 9780-9796
作者:  Zhang, Beichen;  Li, Liang;  Wang, Shuhui;  Cai, Shaofei;  Zha, Zheng-Jun;  Tian, Qi;  Huang, Qingming
收藏  |  浏览/下载:39/0  |  提交时间:2025/06/25
Active learning  adversarial learning  state relabeling  contrastive learning  data diversity  
Context-Aware ProposalBoundary Network With Structural Consistency for Audiovisual Event Localization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 卷号: 35, 期号: 11, 页码: 15872-15882
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:16/0  |  提交时间:2025/06/25
Proposals  Visualization  Location awareness  Encoding  Task analysis  Feature extraction  Aggregates  Audiovisual learning  context learning  event localization  
Multi-Grained Representation Aggregating Transformer with Gating Cycle for Change Captioning 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 10, 页码: 23
作者:  Yue, Shengbin;  Tu, Yunbin;  Li, Liang;  Gao, Shengxiang;  Yu, Zhengtao
收藏  |  浏览/下载:18/0  |  提交时间:2025/06/25
Change captioning  multi-grained representation aggregating  gating cycle  Transformer  
Learning Domain Invariant Features for Unsupervised Indoor Depth Estimation Adaptation 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 9, 页码: 23
作者:  Zhang, Jiehua;  Li, Liang;  Yan, Chenggang;  Wang, Zhan;  Xu, Changliang;  Zhang, Jiyong;  Chen, Chuqiao
收藏  |  浏览/下载:61/0  |  提交时间:2024/12/06
Indoor depth estimation  unsupervised learning  transfer learning  domain adaptation  
Progressive Decision Boundary Shifting for Unsupervised Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 12
作者:  Li, Liang;  Lu, Tongyu;  Sun, Yaoqi;  Gao, Yuhan;  Yan, Chenggang;  Hu, Zhenghui;  Huang, Qingming
收藏  |  浏览/下载:45/0  |  提交时间:2024/12/06
Uncertainty  Feature extraction  Semantics  Task analysis  Training  Adversarial machine learning  Symbols  Domain shifting  progressive decision boundary  self-learning  unsupervised domain adaptation (UDA)  
SMART: Syntax-Calibrated Multi-Aspect Relation Transformer for Change Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 7, 页码: 4926-4943
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:36/0  |  提交时间:2024/12/06
Semantics  Visualization  Transformers  Decoding  Switches  Syntactics  Image representation  Change captioning  multi-aspect relation learning  part-of-speech  visual switch  transformer  
Context Disentangling and Prototype Inheriting for Robust Visual Grounding 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 5, 页码: 3213-3229
作者:  Tang, Wei;  Li, Liang;  Liu, Xuejing;  Jin, Lu;  Tang, Jinhui;  Li, Zechao
收藏  |  浏览/下载:70/0  |  提交时间:2024/05/20
Visualization  Grounding  Prototypes  Transformers  Task analysis  Linguistics  Feature extraction  Context disentangling  open-vocabulary scene  prototype discovering  robust grounding  visual grounding (VG)  
Progressive Depth Decoupling and Modulating for Flexible Depth Completion 期刊论文
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 卷号: 73, 页码: 16
作者:  Yang, Zhiwen;  Zhang, Jiehua;  Li, Liang;  Yan, Chenggang;  Sun, Yaoqi;  Yin, Haibing
收藏  |  浏览/下载:36/0  |  提交时间:2024/12/06
Accuracy  Decoding  Transformers  Three-dimensional displays  Task analysis  Estimation  Research and development  Adaptive depth modulating  depth completion  depth discretization  incremental depth decoupling