×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
自动化研究所 [113]
清华大学 [66]
声学研究所 [16]
华南理工大学 [12]
北京大学 [10]
深圳先进技术研究院 [10]
更多...
内容类型
期刊论文 [114]
会议论文 [97]
学位论文 [50]
其他 [11]
会议 [4]
发表日期
2019 [2]
2018 [14]
2017 [9]
2016 [21]
2015 [10]
2014 [19]
更多...
学科主题
人工智能 [3]
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共276条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
发表日期升序
发表日期降序
提交时间升序
提交时间降序
题名升序
题名降序
作者升序
作者降序
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:
Yi, Jiangyan
;
Tao, Jianhua
;
Fu, Ruibo
;
Wang, Tao
;
Zhang, Chu Yuan
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2023/11/17
Adversarial training
multi-task learning
prosodic boundaries
speech synthesis
multi-modal embeddings
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:
Wang, Tao
;
Fu, Ruibo
;
Yi, Jiangyan
;
Tao, Jianhua
;
Wen, Zhengqi
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2022/06/06
Vocoders
Stochastic processes
Neural networks
Speech processing
Signal to noise ratio
Acoustics
Speech enhancement
Vocoder
speech synthesis
deterministic plus stochastic
multiband excitation
noise control
F-0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 3375-3383
作者:
Li, Yongwei
;
Tao, Jianhua
;
Erickson, Donna
;
Liu, Bin
;
Akagi, Masato
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2021/12/28
Speech recognition
Iterative methods
Production
Estimation
Brain modeling
Shape
Low-frequency noise
Glottal source
vocal tract
source-filter model
ARX-LF model
Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model
期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2020, 卷号: 92, 期号: 8, 页码: 831-838
作者:
Li, Yongwei
;
Sakakibara, Ken-Ichi
;
Akagi, Masato
收藏
  |  
浏览/下载:7/0
  |  
提交时间:2020/08/03
Glottal source waveform
Vocal tract shape
ARX-LF model
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS
会议论文
网上虚拟会议, 2020-5
作者:
Fu, Ruibo
;
Tao, Jianhua
;
Wen, Zhengqi
;
Yi, Jiangyan
;
Wang, Tao
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2020/06/27
prosody transfer
optimization strategy
speaker adaptation
attention
speech synthesis
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation
会议论文
Brighton,UK, MAY 12-17,2019
作者:
Fu, Ruibo
;
Tao, Jianhua
;
Wen, Zhengqi
;
Zheng, Yibin
收藏
  |  
浏览/下载:7/0
  |  
提交时间:2020/06/24
speech synthesis
speaker adaptation
speaker embedding
phoneme representation
Boosting Character-Based Chinese Speech Synthesis via Multi-Task Learning and Dictionary Tutoring
会议论文
奥地利, 2019.9.15-2019.9.19
作者:
Zou, Yuxiang
;
Dong, Linhao
;
Xu, Bo
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2020/06/10
On The Application and Compression of Deep Time Delay Neural Network for Embedded Statistical Parametric Speech Synthesis
会议论文
Hyderabad, 2-6 September 2018
作者:
Yibin Zheng
;
Jianhua Tao
;
Zhengqi Wen
;
Ruibo Fu
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2019/05/02
Deep Metric Learning for the Target Cost in Unit-Selection Speech Synthesizer
会议论文
印度海得拉巴, 2018-9
作者:
Fu, Ruibo
;
Tao, Jianhua
;
Zheng, Yibin
;
Wen, Zhengqi
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2020/06/27
speech synthesis
unit-selection
target cost
deep metric learning
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis
会议论文
印度海得拉巴, 2018-9
作者:
Fu, Ruibo
;
Tao, Jianhua
;
Zheng, Yibin
;
Wen, Zhengqi
收藏
  |  
浏览/下载:3/0
  |  
提交时间:2020/06/27
©版权所有 ©2017 CSpace - Powered by
CSpace