×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
自动化研究所 [98]
内容类型
会议论文 [75]
期刊论文 [23]
发表日期
2024 [1]
2022 [3]
2021 [6]
2020 [7]
2019 [10]
2018 [15]
更多...
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共98条,第1-10条
帮助
限定条件
专题:自动化研究所
第一署名单位
第一作者单位
通讯作者单位
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis
期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:
Yi, Guofeng
;
Fan, Cunhang
;
Zhu, Kang
;
Lv, Zhao
;
Liang, Shan
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2024/02/22
Multimodal sentiment analysis
Vision-language
Multimodal fusion
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition
期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 页码: 762-766
作者:
Zhengkun Tian
;
Jiangyan Yi
;
Jianhua Tao
;
Shuai Zhang
;
Zhengqi Wen
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2022/06/14
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:
Wang, Tao
;
Yi, Jiangyan
;
Fu, Ruibo
;
Tao, Jianhua
;
Wen, Zhengqi
收藏
  |  
浏览/下载:36/0
  |  
提交时间:2022/09/19
Speech processing
Decoding
Predictive models
Acoustics
Transfer learning
Training
Task analysis
Coarse-to-fine decoding
mask prediction
one-shot learning
text-based speech editing
text-to-speech
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:
Wang, Tao
;
Fu, Ruibo
;
Yi, Jiangyan
;
Tao, Jianhua
;
Wen, Zhengqi
收藏
  |  
浏览/下载:12/0
  |  
提交时间:2022/06/06
Vocoders
Stochastic processes
Neural networks
Speech processing
Signal to noise ratio
Acoustics
Speech enhancement
Vocoder
speech synthesis
deterministic plus stochastic
multiband excitation
noise control
One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition
会议论文
Tokyo, Japan, 14-17 December 2021
作者:
Zhengkun Tian
;
Jiangyan Yi
;
Ye Bai
;
Jianhua Tao
;
Shuai Zhang
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2022/06/14
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization
会议论文
Brno, Czechia, 30 August – 3 September
作者:
Zhengkun Tian
;
Jiangyan Yi
;
Ye Bai
;
Jianhua Tao
;
Shuai Zhang
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2022/06/14
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:
Fan, Cunhang
;
Yi, Jiangyan
;
Tao, Jianhua
;
Tian, Zhengkun
;
Liu, Bin
收藏
  |  
浏览/下载:27/0
  |  
提交时间:2021/03/08
Speech enhancement
Speech recognition
Training
Noise measurement
Logic gates
Acoustic distortion
Task analysis
Gated recurrent fusion
robust end-to-end speech recognition
speech distortion
speech enhancement
speech transformer
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning
会议论文
Hong Kong, 24-27 Jan. 2021
作者:
Fan, Cunhang
;
Liu, Bin
;
Tao, Jianhua
;
Yi, Jiangyan
;
Wen, Zhengqi
收藏
  |  
浏览/下载:13/0
  |  
提交时间:2021/06/01
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 1340-1351
作者:
Bai, Ye
;
Yi, Jiangyan
;
Tao, Jianhua
;
Wen, Zhengqi
;
Tian, Zhengkun
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2021/06/07
End-to-End
language modeling
speech recognition
teacher-student learning
transfer learning
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:
Ye Bai
;
Jiangyan Yi
;
Jianhua Tao
;
Zhengkun Tian
;
Zhengqi Wen
收藏
  |  
浏览/下载:23/0
  |  
提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别
©版权所有 ©2017 CSpace - Powered by
CSpace