CSpace

浏览/检索结果: 共18条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1522-1533
作者:  Wang, Yabing;  Wang, Shuhui;  Luo, Hao;  Dong, Jianfeng;  Wang, Fan;  Han, Meng;  Wang, Xun;  Wang, Meng
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Noise measurement  Estimation  Costs  Transportation  Training  Task analysis  Cross-modal retrieval  noise correspondence learning  cross-lingual transfer  optimal transport  machine translation  
Uncertainty Modeling for Robust Domain Adaptation Under Noisy Environments 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 6157-6170
作者:  Zhuo, Junbao;  Wang, Shuhui;  Huang, Qingming
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Domain Adaptation  Uncertainty  Noisy Label  Transfer Learning  Deep Learning  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context  
Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 1882-1894
作者:  Song, Guoli;  Wang, Shuhui;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:38/0  |  提交时间:2021/12/01
Semantics  Correlation  Task analysis  Data models  Learning systems  Kernel  Deep learning  Cross-modal retrieval  correlation learning  feature learning  partial correlation  
Graph Regularized Encoder-Decoder Networks for Image Representation Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3124-3136
作者:  Yang, Shijie;  Li, Liang;  Wang, Shuhui;  Zhang, Weigang;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:36/0  |  提交时间:2021/12/01
Laplace equations  Visualization  Manifolds  Image reconstruction  Task analysis  Decoding  Semantics  Auto-encoder  encoder-decoder  graph regularizer  image representation learning  
Pano-SfMLearner: Self-Supervised Multi-Task Learning of Depth and Semantics in Panoramic Videos 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2021, 卷号: 28, 页码: 832-836
作者:  Liu, Mengyi;  Wang, Shuhui;  Guo, Yulan;  He, Yuan;  Xue, Hui
收藏  |  浏览/下载:36/0  |  提交时间:2021/12/01
Depth estimation  semantic segmentation  pano-ramic video  self-supervised learning  
Augmented Adversarial Training for Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 559-571
作者:  Wu, Yiling;  Wang, Shuhui;  Song, Guoli;  Huang, Qingming
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Cross-modal retrieval  data alignment  adversa-rial training  
Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 5, 页码: 1310-1322
作者:  Wu, Yiling;  Wang, Shuhui;  Huang, Qingming
收藏  |  浏览/下载:64/0  |  提交时间:2020/12/10
Semantics  Correlation  Training  Data models  Visualization  Adaptation models  Fasteners  Cross-modality learning  similarity function learning  online learning  low-rank matrix  
Textual-Visual Reference-Aware Attention Network for Visual Dialog 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 卷号: 29, 页码: 6655-6666
作者:  Guo, Dan;  Wang, Hui;  Wang, Shuhui;  Wang, Meng
收藏  |  浏览/下载:44/0  |  提交时间:2020/12/10
Visual dialog  attention network  textual reference  visual reference  multimodal semantic interaction