CSpace

浏览/检索结果: 共25条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Cross Modal Compression With Variable Rate Prompt 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3444-3456
作者:  Gao, Junlong;  Li, Jiguo;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:12/0  |  提交时间:2024/05/20
Cross modal compression  semantic fidelity  variable rate prompt  
Semantic-Aware Visual Decomposition for Image Coding 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 页码: 23
作者:  Chang, Jianhui;  Zhang, Jian;  Li, Jiguo;  Wang, Shiqi;  Mao, Qi;  Jia, Chuanmin;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:14/0  |  提交时间:2023/12/04
Image coding  Semantic-aware visual decomposition  Structure-texture  Coherency regularization  Extremely low bitrate  
Deep Intra Prediction by Jointly Exploiting Local and Non-Local Similarities 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 5, 页码: 2396-2409
作者:  Lei, Meng;  Zhang, Jiaqi;  Wang, Shiqi;  Wang, Shanshe;  Ma, Siwei
收藏  |  浏览/下载:17/0  |  提交时间:2023/12/04
Correlation  Predictive models  Deep learning  Video coding  Image reconstruction  Encoding  Convolutional neural networks  Intra prediction  attention mechanism  template matching  non-local operation  
Learned Image Compression Using Cross-Component Attention Mechanism 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5478-5493
作者:  Duan, Wenhong;  Chang, Zheng;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Song, Li;  Gao, Wen
收藏  |  浏览/下载:15/0  |  提交时间:2023/12/04
Image coding  Context modeling  Transforms  Decoding  Standards  Image reconstruction  Transform coding  Image compression  cross-component  information-guided unit  attention mechanism  information-preserving  
STAM: A SpatioTemporal Attention Based Memory for Video Prediction 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 2354-2367
作者:  Chang, Zheng;  Zhang, Xinfeng;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:14/0  |  提交时间:2023/12/04
Global spatiotemporal information  spatio temporal receptive field  3D convolutional neural network  spatiotemporal attention  sequence learning  video prediction  
Textural and Directional Information Based Offset In-Loop Filtering in AVS3 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 5957-5971
作者:  Zhang, Jiaqi;  Jian, Yunrui;  Wang, Suhong;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:11/0  |  提交时间:2024/05/20
AVS3  in-loop filter  TDIO  textural and directional offset  
Scalable Intra Coding Optimization for Video Coding 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 10, 页码: 7092-7106
作者:  Zhang, Jiaqi;  Wang, Meng;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:23/0  |  提交时间:2023/07/12
Encoding  Complexity theory  Optimization  Standards  Transforms  Electronic mail  Urban areas  AVS3  block partition  inherited information  intra coding optimization  
Learning to Fool the Speaker Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 卷号: 17, 期号: 3, 页码: 21
作者:  Li, Jiguo;  Zhang, Xinfeng;  Xu, Jizheng;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:25/0  |  提交时间:2022/12/07
Audio forensics  adversarial attack  deep neural network  
Predictive Generalized Graph Fourier Transform for Attribute Compression of Dynamic Point Clouds 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 卷号: 31, 期号: 5, 页码: 1968-1982
作者:  Xu, Yiqun;  Hu, Wei;  Wang, Shanshe;  Zhang, Xinfeng;  Wang, Shiqi;  Ma, Siwei;  Guo, Zongming;  Gao, Wen
收藏  |  浏览/下载:47/0  |  提交时间:2021/12/01
Dynamic point clouds  attribute coding  inter-coding  generalized graph Fourier transform  
Direct Speech-to-Image Translation 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 卷号: 14, 期号: 3, 页码: 517-529
作者:  Li, Jiguo;  Zhang, Xinfeng;  Jia, Chuanmin;  Xu, Jizheng;  Zhang, Li;  Wang, Yue;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:62/0  |  提交时间:2020/12/10
Correlation  Visualization  Feature extraction  Generative adversarial networks  Task analysis  Gallium nitride  Face  Speech-to-image translation  cross-modal generation  generative adversarial network  teacher-student learning