CSpace

浏览/检索结果: 共350条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Towards Food Image Retrieval via Generalization-Oriented Sampling and Loss Function Design 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 1, 页码: 19
作者:  Song, Jiajun;  Li, Zhuo;  Min, Weiqing;  Jiang, Shuqiang
收藏  |  浏览/下载:6/0  |  提交时间:2023/12/04
Food computing  image retrieval  deep learning  
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1522-1533
作者:  Wang, Yabing;  Wang, Shuhui;  Luo, Hao;  Dong, Jianfeng;  Wang, Fan;  Han, Meng;  Wang, Xun;  Wang, Meng
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Visualization  Noise measurement  Estimation  Costs  Transportation  Training  Task analysis  Cross-modal retrieval  noise correspondence learning  cross-lingual transfer  optimal transport  machine translation  
STAR-TM: STructure Aware Reconstruction of Textured Mesh From Single Image 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 15680-15693
作者:  Wu, Tong;  Gao, Lin;  Zhang, Ling-Xiao;  Lai, Yu-Kun;  Zhang, Hao
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Image reconstruction  Shape  Transformers  Periodic structures  Three-dimensional displays  Semantics  Training  Structure-aware single-view 3D reconstruction  textured meshes  texture completion  transformer  
Large Scale Visual Food Recognition 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 8, 页码: 9932-9949
作者:  Min, Weiqing;  Wang, Zhiling;  Liu, Yuxin;  Luo, Mengjiang;  Kang, Liping;  Wei, Xiaoming;  Wei, Xiaolin;  Jiang, Shuqiang
收藏  |  浏览/下载:6/0  |  提交时间:2023/12/04
Image recognition  Visualization  Task analysis  Benchmark testing  Representation learning  Training  Semantics  Food dataset  food recognition  large-scale datasets  fine-grained recognition  
General Greedy De-Bias Learning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 8, 页码: 9789-9805
作者:  Han, Xinzhe;  Wang, Shuhui;  Su, Chi;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:6/0  |  提交时间:2023/12/04
Task analysis  Correlation  Training  Data models  Question answering (information retrieval)  Visualization  Image classification  Curriculum learning  dataset biases  greedy strategy  robust learning  
Importance First: Generating Scene Graph of Human Interest 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 页码: 27
作者:  Wang, Wenbin;  Wang, Ruiping;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:6/0  |  提交时间:2023/12/04
Key relationship  Hierarchical entity tree  Hierarchical contextual propagation  Relationship ranking  Spatial scale  Visual saliency  
MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4073-4087
作者:  Yang, Jiahao;  Li, Xiangyang;  Zheng, Mao;  Wang, Zihan;  Zhu, Yongqing;  Guo, Xiaoqian;  Yuan, Yuchen;  Chai, Zifeng;  Jiang, Shuqiang
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Video-language pre-training  inter-modality bridge  memory module  
Unsupervised Cross-Modal Hashing via Semantic Text Mining 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 8946-8957
作者:  Tu, Rong-Cheng;  Mao, Xian-Ling;  Lin, Qinghong;  Ji, Wenjin;  Qin, Weize;  Wei, Wei;  Huang, Heyan
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Cross-modal retrieval  deep supervised hashing  semantic text mining  self-redefined-similarity loss  
CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 508, 页码: 293-304
作者:  Luo, Huaishao;  Ji, Lei;  Zhong, Ming;  Chen, Yang;  Lei, Wen;  Duan, Nan;  Li, Tianrui
收藏  |  浏览/下载:27/0  |  提交时间:2022/12/07
Video retrieval  Video captioning  CLIP  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer