NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Rongjie Huang
Number of Papers:- 18
Number of Citations:- 0
First ACL Paper:- 2023
Latest ACL Paper:- 2024
Venues:-
s
A
d
i
-
EMNLP
NAACL
L
ACL
C
F
n
g
Co-Authors:-
Aoxiong Yin
Bai Jionghao
Baoxing Huai
Changhao Pan
Chao Weng
Similar Authors:-
2024
2023
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners
ACL
Rongjie Huang |
Chunlei Zhang |
Yongqi Wang |
Dongchao Yang |
Jinchuan Tian |
Zhenhui Ye |
Luping Liu |
Zehan Wang |
Ziyue Jiang |
Xuankai Chang |
Jiatong Shi |
Chao Weng |
Zhou Zhao |
Dong Yu |
Robust Singing Voice Transcription Serves Synthesis
ACL
Ruiqi Li |
Yu Zhang |
Yongqi Wang |
Zhiqing Hong |
Rongjie Huang |
Zhou Zhao |
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
F
i
n
d
i
n
g
s
-
A
C
L
Huadai Liu |
Rongjie Huang |
Jinzheng He |
Gang Sun |
Ran Shen |
Xize Cheng |
Zhou Zhao |
Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment
ACL
Zhiqing Hong |
Rongjie Huang |
Xize Cheng |
Yongqi Wang |
Ruiqi Li |
Fuming You |
Zhou Zhao |
Zhimeng Zhang |
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
NAACL
Yongqi Wang |
Ruofan Hu |
Rongjie Huang |
Zhiqing Hong |
Ruiqi Li |
Wenrui Liu |
Fuming You |
Tao Jin |
Zhou Zhao |
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
ACL
Yongqi Wang |
Bai Jionghao |
Rongjie Huang |
Ruiqi Li |
Zhiqing Hong |
Zhou Zhao |
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion
F
i
n
d
i
n
g
s
-
A
C
L
Ruiqi Li |
Rongjie Huang |
Yongqi Wang |
Zhiqing Hong |
Zhou Zhao |
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation
F
i
n
d
i
n
g
s
-
A
C
L
Xize Cheng |
Rongjie Huang |
Linjun Li |
Zehan Wang |
Tao Jin |
Aoxiong Yin |
Chen Feiyang |
Xinyu Duan |
Baoxing Huai |
Zhou Zhao |
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
EMNLP
Yu Zhang |
Ziyue Jiang |
Ruiqi Li |
Changhao Pan |
Jinzheng He |
Rongjie Huang |
Chuxin Wang |
Zhou Zhao |
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
F
i
n
d
i
n
g
s
-
A
C
L
Jinzheng He |
Jinglin Liu |
Zhenhui Ye |
Rongjie Huang |
Chenye Cui |
Huadai Liu |
Zhou Zhao |
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
EMNLP
Huadai Liu |
Rongjie Huang |
Xuan Lin |
Wenqiang Xu |
Maozong Zheng |
Hong Chen |
Jinzheng He |
Zhou Zhao |
Contrastive Token-Wise Meta-Learning for Unseen Performer Visual Temporal-Aligned Translation
F
i
n
d
i
n
g
s
-
A
C
L
Linjun Li |
Tao Jin |
Xize Cheng |
Ye Wang |
Wang Lin |
Rongjie Huang |
Zhou Zhao |
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
F
i
n
d
i
n
g
s
-
A
C
L
Ruiqi Li |
Rongjie Huang |
Lichao Zhang |
Jinglin Liu |
Zhou Zhao |
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
F
i
n
d
i
n
g
s
-
A
C
L
Ziyue Jiang |
Qian Yang |
Jialong Zuo |
Zhenhui Ye |
Rongjie Huang |
Yi Ren |
Zhou Zhao |
FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis
F
i
n
d
i
n
g
s
-
A
C
L
Rongjie Huang |
Yi Ren |
Ziyue Jiang |
Chenye Cui |
Jinglin Liu |
Zhou Zhao |
Prosody-TTS: Improving Prosody with Masked Autoencoder and Conditional Diffusion Model For Expressive Text-to-Speech
F
i
n
d
i
n
g
s
-
A
C
L
Rongjie Huang |
Chunlei Zhang |
Yi Ren |
Zhou Zhao |
Dong Yu |
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training
ACL
Zhenhui Ye |
Rongjie Huang |
Yi Ren |
Ziyue Jiang |
Jinglin Liu |
Jinzheng He |
Xiang Yin |
Zhou Zhao |
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
ACL
Rongjie Huang |
Huadai Liu |
Xize Cheng |
Yi Ren |
Linjun Li |
Zhenhui Ye |
Jinzheng He |
Lichao Zhang |
Jinglin Liu |
Xiang Yin |
Zhou Zhao |
.