CSpace

浏览/检索结果: 共272条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Multi-feature deep supervised voiceprint adversarial network for depression recognition from speech 期刊论文
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 卷号: 89, 页码: 15
作者:  Pan, Yuchen;  Shang, Yuanyuan;  Wang, Wei;  Shao, Zhuhong;  Han, Zhuojin;  Liu, Tie;  Guo, Guodong;  Ding, Hui
收藏  |  浏览/下载:8/0  |  提交时间:2024/05/20
Adversarial learning  Audio processing  Attention mechanism  Deep neural network  Depression recognition  Feature enhancement  
Overview of the Tenth Dialog System Technology Challenge: DSTC10 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 765-778
作者:  Yoshino, Koichiro;  Chen, Yun-Nung;  Crook, Paul;  Kottur, Satwik;  Li, Jinchao;  Hedayatnia, Behnam;  Moon, Seungwhan;  Fei, Zhengcong;  Li, Zekang;  Zhang, Jinchao;  Feng, Yang;  Zhou, Jie;  Kim, Seokhwan;  Liu, Yang;  Jin, Di;  Papangelis, Alexandros;  Gopalakrishnan, Karthik;  Hakkani-Tur, Dilek;  Damavandi, Babak;  Geramifard, Alborz;  Hori, Chiori;  Shah, Ankit;  Zhang, Chen;  Li, Haizhou;  Sedoc, Joao;  D'Haro, Luis F.;  Banchs, Rafael;  Rudnicky, Alexander
收藏  |  浏览/下载:20/0  |  提交时间:2024/05/20
Task analysis  Internet  History  Oral communication  Measurement  Context modeling  Visualization  Dialog systems  natural language processing  speech processing  multimodal sensors  
Context-Aware Proposal-Boundary Network With Structural Consistency for Audiovisual Event Localization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 11
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:12/0  |  提交时间:2023/12/04
Audiovisual learning  context learning  event localization  
Semantic and Relation Modulation for Audio-Visual Event Localization 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 7711-7725
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Visualization  Location awareness  Correlation  Proposals  Semantics  Task analysis  Modulation  Audio-visual learning  event localization  normalization  
Scalable Intra Coding Optimization for Video Coding 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 10, 页码: 7092-7106
作者:  Zhang, Jiaqi;  Wang, Meng;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:18/0  |  提交时间:2023/07/12
Encoding  Complexity theory  Optimization  Standards  Transforms  Electronic mail  Urban areas  AVS3  block partition  inherited information  intra coding optimization  
A Fast Precision Tuning Solution for Always-On DNN Accelerators 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 卷号: 41, 期号: 5, 页码: 1236-1248
作者:  Wang, Ying;  He, Yintao;  Cheng, Long;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:30/0  |  提交时间:2022/12/07
Computer architecture  Neural networks  Computational modeling  Approximate computing  Tuning  Switches  Microprocessors  Always-on  CNN  computing-in-memory (CiM)  resistive RAM  
Distribution Distance Regularized Sequence Representation for Text Matching in Asymmetrical Domains 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 721-733
作者:  Yu, Weijie;  Xu, Chen;  Xu, Jun;  Pang, Liang;  Wen, Ji-Rong
收藏  |  浏览/下载:26/0  |  提交时间:2022/12/07
Semantics  Neural networks  Training  Task analysis  Measurement  Speech processing  Electronic mail  Text matching  sequence representation  natural language processing  
Learning to Fool the Speaker Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 卷号: 17, 期号: 3, 页码: 21
作者:  Li, Jiguo;  Zhang, Xinfeng;  Xu, Jizheng;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:22/0  |  提交时间:2022/12/07
Audio forensics  adversarial attack  deep neural network  
Happy Emotion Recognition From Unconstrained Videos Using 3D Hybrid Deep Features 期刊论文
IEEE ACCESS, 2021, 卷号: 9, 页码: 35524-35538
作者:  Samadiani, Najmeh;  Huang, Guangyan;  Hu, Yu;  Li, Xiaowei
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Feature extraction  Emotion recognition  Face recognition  Videos  Three-dimensional displays  Long short term memory  Visualization  Facial landmarks  facial expression recognition  long short term memory  multi-layer neural networks  happy emotion recognition  
Bridging Text and Video: A Universal Multimodal Transformer for Audio-Visual Scene-Aware Dialog 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 2476-2483
作者:  Li, Zekang;  Li, Zongjia;  Zhang, Jinchao;  Feng, Yang;  Zhou, Jie
收藏  |  浏览/下载:46/0  |  提交时间:2021/12/01
Task analysis  Feature extraction  Visualization  Speech processing  History  Social networking (online)  Pattern recognition  Dialogue System  Multimodal  Natural Language Processing  Video Understanding