CSpace

浏览/检索结果: 共30条,第1-20条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
Learning Hierarchical Modular Networks for Video Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 2, 页码: 1049-1064
作者:  Li, Guorong;  Ye, Hanhua;  Qi, Yuankai;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Video captioning  hierarchical modular network  scene-graph reward  reinforcement learning  
Temporal Dynamic Concept Modeling Network for Explainable Video Event Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 卷号: 19, 期号: 6, 页码: 22
作者:  Zhang, Weigang;  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Event recognition  temporal concept receptive field  dynamic convolution  
General Greedy De-Bias Learning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 8, 页码: 9789-9805
作者:  Han, Xinzhe;  Wang, Shuhui;  Su, Chi;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Task analysis  Correlation  Training  Data models  Question answering (information retrieval)  Visualization  Image classification  Curriculum learning  dataset biases  greedy strategy  robust learning  
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning  
Uncertainty Modeling for Robust Domain Adaptation Under Noisy Environments 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 6157-6170
作者:  Zhuo, Junbao;  Wang, Shuhui;  Huang, Qingming
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Domain Adaptation  Uncertainty  Noisy Label  Transfer Learning  Deep Learning  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context  
Local-binarized very deep residual network for visual categorization 期刊论文
NEUROCOMPUTING, 2021, 卷号: 430, 页码: 82-93
作者:  Liu, Xuejing;  Li, Liang;  Wang, Shuhui;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:39/0  |  提交时间:2021/12/01
Network compression and acceleration  Pose estimation  Object recognition  Saliency detection  Local binary residual block  
Harmonized Multimodal Learning with Gaussian Process Latent Variable Models 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 卷号: 43, 期号: 3, 页码: 858-872
作者:  Song, Guoli;  Wang, Shuhui;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Multimodal learning  Gaussian process  latent variable modeling  cross-modal retrieval  
Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 1882-1894
作者:  Song, Guoli;  Wang, Shuhui;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:38/0  |  提交时间:2021/12/01
Semantics  Correlation  Task analysis  Data models  Learning systems  Kernel  Deep learning  Cross-modal retrieval  correlation learning  feature learning  partial correlation  
Graph Regularized Encoder-Decoder Networks for Image Representation Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3124-3136
作者:  Yang, Shijie;  Li, Liang;  Wang, Shuhui;  Zhang, Weigang;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:36/0  |  提交时间:2021/12/01
Laplace equations  Visualization  Manifolds  Image reconstruction  Task analysis  Decoding  Semantics  Auto-encoder  encoder-decoder  graph regularizer  image representation learning  
Augmented Adversarial Training for Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 559-571
作者:  Wu, Yiling;  Wang, Shuhui;  Song, Guoli;  Huang, Qingming
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Cross-modal retrieval  data alignment  adversa-rial training  
Two-stream deep sparse network for accurate and efficient image restoration 期刊论文
COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 卷号: 200, 页码: 11
作者:  Wang, Shuhui;  Hu, Ling;  Li, Liang;  Zhang, Weigang;  Huang, Qingming
收藏  |  浏览/下载:294/0  |  提交时间:2020/12/10
Two-stream sparse network  Image restoration  Image super-resolution  Image denoising  
Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 5, 页码: 1310-1322
作者:  Wu, Yiling;  Wang, Shuhui;  Huang, Qingming
收藏  |  浏览/下载:65/0  |  提交时间:2020/12/10
Semantics  Correlation  Training  Data models  Visualization  Adaptation models  Fasteners  Cross-modality learning  similarity function learning  online learning  low-rank matrix  
SkeletonNet: A Hybrid Network With a Skeleton-Embedding Process for Multi-View Image Representation Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 11, 页码: 2916-2929
作者:  Yang, Shijie;  Li, Liang;  Wang, Shuhui;  Zhang, Weigang;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:51/0  |  提交时间:2020/12/10
Semantics  Correlation  Visualization  Skeleton  Matrix decomposition  Kernel  Laplace equations  Unsupervised multi-view subspace learning  semantic inconsistency  tensor factorization  deep auto-encoders  
Online Asymmetric Metric Learning With Multi-Layer Similarity Aggregation for Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 卷号: 28, 期号: 9, 页码: 4299-4312
作者:  Wu, Yiling;  Wang, Shuhui;  Song, Guoli;  Huang, Qingming
收藏  |  浏览/下载:253/0  |  提交时间:2019/08/16
Cross-modal retrieval  asymmetric metric  online learning  multi-layer aggregation  
Beyond global fusion: A group-aware fusion approach for multi-view image clustering 期刊论文
INFORMATION SCIENCES, 2019, 卷号: 493, 页码: 176-191
作者:  Xue, Zhe;  Li, Guorong;  Wang, Shuhui;  Huang, Jun;  Zhang, Weigang;  Huang, Qingming
收藏  |  浏览/下载:260/0  |  提交时间:2019/08/16
Multi-view learning  Local fusion strategy  Group-aware fusion  Image clustering  
Multi-modal semantic autoencoder for cross-modal retrieval 期刊论文
NEUROCOMPUTING, 2019, 卷号: 331, 页码: 165-175
作者:  Wu, Yiling;  Wang, Shuhui;  Huang, Qingming
收藏  |  浏览/下载:81/0  |  提交时间:2019/04/03
Cross-modal retrieval  Multi-modal data  Autoencoder  
A Hierarchical CNN-RNN Approach for Visual Emotion Classification 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 卷号: 15, 期号: 3, 页码: 17
作者:  Li, Liang;  Zhu, Xinge;  Hao, Yiming;  Wang, Shuhui;  Gao, Xingyu;  Huang, Qingming
收藏  |  浏览/下载:36/0  |  提交时间:2020/12/10
Visual emotion recognition  multi-task learning  feature fusing  hierarchical CNN-RNN  stacked bi-directional RNN  
Semantic invariant cross-domain image generation with generative adversarial networks 期刊论文
NEUROCOMPUTING, 2018, 卷号: 293, 页码: 55-63
作者:  Mao, Xiaofeng;  Wang, Shuhui;  Zheng, Liying;  Huang, Qingming
收藏  |  浏览/下载:69/0  |  提交时间:2019/12/10
Generative adversarial networks  Image-to-image translation  Semantic invariance