CSpace

浏览/检索结果: 共15条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Dubbing Movies via Hierarchical Phoneme Modeling and Acoustic Diffusion Denoising 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 卷号: 47, 期号: 11, 页码: 10361-10377
作者:  Li, Liang;  Cong, Gaoxiang;  Qi, Yuankai;  Zha, Zheng-Jun;  Wu, Qi;  Sheng, Quan Z.;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Videos  Lips  Visualization  Acoustics  Cloning  Noise reduction  Motion pictures  Head  Adaptation models  Text to speech  Visual voice cloning  speech synthesis  hierarchical phoneme modeling  contrastive learning  acoustic diffusion denoising  
NeRFFaceShop: Learning a Photo-Realistic 3D-Aware Generative Model of Animatable and Relightable Heads From Large-Scale in-the-Wild Videos 期刊论文
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 卷号: 31, 期号: 10, 页码: 7938-7950
作者:  Jiang, Kaiwen;  Liu, Feng-Lin;  Chen, Shu-Yu;  Wan, Pengfei;  Zhang, Yuan;  Lai, Yu-Kun;  Fu, Hongbo;  Gao, Lin
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Animation  Lighting  Head  Three-dimensional displays  Videos  Training  Computational modeling  Solid modeling  Aerospace electronics  Rendering (computer graphics)  Face animation  face relighting  volume disentangling  neural radiance fields  neural rendering  
A framework for Chinese event semantic constraint predicates and their generation using PEFT LLM and RAG 期刊论文
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 页码: 26
作者:  Huang, Qiaojuan;  Wang, Shi;  He, Qing;  Cao, Cungen
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Event semantic analysis  Head-tail event semantic constraint predicates  Large language models  Parameter-efficient fine-tuning  Retrieval-augmented generation  
FullLoRA: Efficiently Boosting the Robustness of Pretrained Vision Transformers 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 卷号: 34, 页码: 4580-4590
作者:  Yuan, Zheng;  Zhang, Jie;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Training  Computational modeling  Robustness  Adaptation models  Computer vision  Transformers  Visualization  Natural language processing  Image classification  Head  Adversarial training  parameter-efficient  pretrained model  
Infant cry classification using an efficient graph structure and attention-based model 期刊论文
KUWAIT JOURNAL OF SCIENCE, 2024, 卷号: 51, 期号: 3, 页码: 9
作者:  Qiao, Xuesong;  Jiao, Siwen;  Li, Han;  Liu, Gengyuan;  Gao, Xuan;  Li, Zhanshan
收藏  |  浏览/下载:30/0  |  提交时间:2024/12/06
Neural network  Multi-head attention  Infant cry  Audio classification  
Research on Aspect-Level Sentiment Analysis Based on Adversarial Training and Dependency Parsing 期刊论文
ELECTRONICS, 2024, 卷号: 13, 期号: 10, 页码: 16
作者:  Xu, Erfeng;  Zhu, Junwu;  Zhang, Luchen;  Wang, Yi;  Lin, Wei
收藏  |  浏览/下载:37/0  |  提交时间:2024/12/06
multi head attention mechanism  dependency syntactic relationships  adjacency matrix  adversarial training  
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 5, 页码: 5561-5578
作者:  Gao, Difei;  Wang, Ruiping;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:52/0  |  提交时间:2023/12/04
Visualization  Task analysis  Tail  Head  Annotations  Magnetic heads  Mouth  Visual question answering  compositional reasoning  commonsense reasoning  dataset construction  
3D Face Reconstruction and Gaze Tracking in the HMD for Virtual Interaction 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 3166-3179
作者:  Chen, Shu-Yu;  Lai, Yu-Kun;  Xia, Shihong;  Rosin, Paul L.;  Gao, Lin
收藏  |  浏览/下载:49/0  |  提交时间:2023/12/04
Communication  eye tracking  head-mounted display  real-time facial performance capture  user interaction  virtual reality  
BLPSeg: Balance the Label Preference in Scribble-Supervised Semantic Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4921-4934
作者:  Wang, Yude;  Zhang, Jie;  Kan, Meina;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:49/0  |  提交时间:2023/12/04
Annotations  Semantic segmentation  Training  Task analysis  Head  Semantics  Costs  Scribble-supervised  weakly supervised  semantic segmentation  
Spatial-Temporal Graph Network for Video Crowd Counting 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 1, 页码: 228-241
作者:  Wu, Zhe;  Zhang, Xinfeng;  Tian, Geng;  Wang, Yaowei;  Huang, Qingming
收藏  |  浏览/下载:55/0  |  提交时间:2023/07/12
Computational modeling  Predictive models  Analytical models  Long short term memory  Optical flow  Integrated circuit modeling  Head  Video-based crowd counting  spatiotemporal graph attention  multi-scale module