CSpace

浏览/检索结果: 共126条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Vision-based food nutrition estimation via RGB-D fusion network 期刊论文
FOOD CHEMISTRY, 2023, 卷号: 424, 页码: 10
作者:  Shao, Wenjing;  Min, Weiqing;  Hou, Sujuan;  Luo, Mengjiang;  Li, Tianhao;  Zheng, Yuanjie;  Jiang, Shuqiang
收藏  |  浏览/下载:14/0  |  提交时间:2023/12/04
Food nutrient  Nutrition estimation  Food composition  Deep learning  RGB-D fusion  
Mining collaborative spatio-temporal clues for face forgery detection 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 页码: 20
作者:  Ding, Bo;  Fan, Zhenfeng;  Zhao, Zejun;  Xia, Shihong
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Face forgery detection  Spatial-temporal clue  Low-level feature  Collaborative learning  Multimodel attention  
Large Scale Visual Food Recognition 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 8, 页码: 9932-9949
作者:  Min, Weiqing;  Wang, Zhiling;  Liu, Yuxin;  Luo, Mengjiang;  Kang, Liping;  Wei, Xiaoming;  Wei, Xiaolin;  Jiang, Shuqiang
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Image recognition  Visualization  Task analysis  Benchmark testing  Representation learning  Training  Semantics  Food dataset  food recognition  large-scale datasets  fine-grained recognition  
Bi-STAN: bilinear spatial-temporal attention network for wearable human activity recognition 期刊论文
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 页码: 17
作者:  Gao, Chenlong;  Chen, Yiqiang;  Jiang, Xinlong;  Hu, Lisha;  Zhao, Zhicheng;  Zhang, Yuxin
收藏  |  浏览/下载:18/0  |  提交时间:2023/07/12
Human activity recognition  Spatial-temporal attention  Bilinear pooling  Low-redundancy  
CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 508, 页码: 293-304
作者:  Luo, Huaishao;  Ji, Lei;  Zhong, Ming;  Chen, Yang;  Lei, Wen;  Duan, Nan;  Li, Tianrui
收藏  |  浏览/下载:40/0  |  提交时间:2022/12/07
Video retrieval  Video captioning  CLIP  
Self-Supervised Enhancement for Named Entity Disambiguation via Multimodal Graph Convolution 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Zhou, Pengfei;  Ying, Kaining;  Wang, Zhenhua;  Guo, Dongyan;  Bai, Cong
收藏  |  浏览/下载:34/0  |  提交时间:2022/12/07
Task analysis  Convolution  Semantics  Internet  Bit error rate  Visualization  Pipelines  Graph convolutional network (GCN)  multimodal data  named entity disambiguation (NED)  self-supervised learning (SSL)  
Richer fusion network for breast cancer classification based on multimodal data 期刊论文
BMC Medical Informatics and Decision Making, 2021, 卷号: 21, 期号: Suppl 1
作者:  Yan,Rui;  Zhang,Fa;  Rao,Xiaosong;  Lv,Zhilong;  Li,Jintao;  Zhang,Lingling;  Liang,Shuang;  Li,Yilin;  Ren,Fei;  Zheng,Chunhou;  Liang,Jun
收藏  |  浏览/下载:38/0  |  提交时间:2021/12/01
Pathological image  Electronic medical record  Multimodal fusion  Breast cancer classification  Convolutional neural network  
Harmonized Multimodal Learning with Gaussian Process Latent Variable Models 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 卷号: 43, 期号: 3, 页码: 858-872
作者:  Song, Guoli;  Wang, Shuhui;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:40/0  |  提交时间:2021/12/01
Multimodal learning  Gaussian process  latent variable modeling  cross-modal retrieval  
Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 1882-1894
作者:  Song, Guoli;  Wang, Shuhui;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:43/0  |  提交时间:2021/12/01
Semantics  Correlation  Task analysis  Data models  Learning systems  Kernel  Deep learning  Cross-modal retrieval  correlation learning  feature learning  partial correlation  
Bridging Text and Video: A Universal Multimodal Transformer for Audio-Visual Scene-Aware Dialog 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 2476-2483
作者:  Li, Zekang;  Li, Zongjia;  Zhang, Jinchao;  Feng, Yang;  Zhou, Jie
收藏  |  浏览/下载:46/0  |  提交时间:2021/12/01
Task analysis  Feature extraction  Visualization  Speech processing  History  Social networking (online)  Pattern recognition  Dialogue System  Multimodal  Natural Language Processing  Video Understanding