NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Zhou Zhao
Number of Papers:- 42
Number of Citations:- 2
First ACL Paper:- 2017
Latest ACL Paper:- 2024
Venues:-
s
EMNLP
i
d
-
A
NAACL
Findings
L
ACL
C
F
n
g
Co-Authors:-
Aoxiong Yin
Bai Jionghao
Baoxing Huai
Baoyi He
Boyuan Pan
Similar Authors:-
Abu Bakr Soliman
Isar Nejadgholi
Isuru Gunasekara
Aritz Bilbao Jayo
Aitor Almeida
2024
2023
2022
2020
2019
2018
2017
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners
ACL
Rongjie Huang |
Chunlei Zhang |
Yongqi Wang |
Dongchao Yang |
Jinchuan Tian |
Zhenhui Ye |
Luping Liu |
Zehan Wang |
Ziyue Jiang |
Xuankai Chang |
Jiatong Shi |
Chao Weng |
Zhou Zhao |
Dong Yu |
Robust Singing Voice Transcription Serves Synthesis
ACL
Ruiqi Li |
Yu Zhang |
Yongqi Wang |
Zhiqing Hong |
Rongjie Huang |
Zhou Zhao |
Uni-Dubbing: Zero-Shot Speech Synthesis from Visual Articulation
ACL
Songju Lei |
Xize Cheng |
Mengjiao Lyu |
Jianqiao Hu |
Jintao Tan |
Runlin Liu |
Lingyu Xiong |
Tao Jin |
Xiandong Li |
Zhou Zhao |
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech
ACL
Shengpeng Ji |
Ziyue Jiang |
Hanting Wang |
Jialong Zuo |
Zhou Zhao |
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension
ACL
Qian Yang |
Jin Xu |
Wenrui Liu |
Yunfei Chu |
Ziyue Jiang |
Xiaohuan Zhou |
Yichong Leng |
Yuanjun Lv |
Zhou Zhao |
Chang Zhou |
Jingren Zhou |
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
F
i
n
d
i
n
g
s
-
A
C
L
Huadai Liu |
Rongjie Huang |
Jinzheng He |
Gang Sun |
Ran Shen |
Xize Cheng |
Zhou Zhao |
Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment
ACL
Zhiqing Hong |
Rongjie Huang |
Xize Cheng |
Yongqi Wang |
Ruiqi Li |
Fuming You |
Zhou Zhao |
Zhimeng Zhang |
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
NAACL
Yongqi Wang |
Ruofan Hu |
Rongjie Huang |
Zhiqing Hong |
Ruiqi Li |
Wenrui Liu |
Fuming You |
Tao Jin |
Zhou Zhao |
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
ACL
Yongqi Wang |
Bai Jionghao |
Rongjie Huang |
Ruiqi Li |
Zhiqing Hong |
Zhou Zhao |
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion
F
i
n
d
i
n
g
s
-
A
C
L
Ruiqi Li |
Rongjie Huang |
Yongqi Wang |
Zhiqing Hong |
Zhou Zhao |
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition
ACL
Zirun Guo |
Tao Jin |
Zhou Zhao |
Rethinking the Multimodal Correlation of Multimodal Sequential Learning via Generalizable Attentional Results Alignment
ACL
Tao Jin |
Wang Lin |
Ye Wang |
Linjun Li |
Xize Cheng |
Zhou Zhao |
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation
F
i
n
d
i
n
g
s
-
A
C
L
Xize Cheng |
Rongjie Huang |
Linjun Li |
Zehan Wang |
Tao Jin |
Aoxiong Yin |
Chen Feiyang |
Xinyu Duan |
Baoxing Huai |
Zhou Zhao |
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
EMNLP
Yu Zhang |
Ziyue Jiang |
Ruiqi Li |
Changhao Pan |
Jinzheng He |
Rongjie Huang |
Chuxin Wang |
Zhou Zhao |
ART: rule bAsed futuRe-inference deducTion
EMNLP
Mengze Li |
Tianqi Zhao |
Bai Jionghao |
Baoyi He |
Jiaxu Miao |
Wei Ji |
Zheqi Lv |
Zhou Zhao |
Shengyu Zhang |
Wenqiao Zhang |
Fei Wu |
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
F
i
n
d
i
n
g
s
-
A
C
L
Jinzheng He |
Jinglin Liu |
Zhenhui Ye |
Rongjie Huang |
Chenye Cui |
Huadai Liu |
Zhou Zhao |
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
EMNLP
Huadai Liu |
Rongjie Huang |
Xuan Lin |
Wenqiang Xu |
Maozong Zheng |
Hong Chen |
Jinzheng He |
Zhou Zhao |
Scene-robust Natural Language Video Localization via Learning Domain-invariant Representations
F
i
n
d
i
n
g
s
-
A
C
L
Zehan Wang |
Yang Zhao |
Haifeng Huang |
Yan Xia |
Zhou Zhao |
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
ACL
Xize Cheng |
Tao Jin |
Linjun Li |
Wang Lin |
Xinyu Duan |
Zhou Zhao |
Multi-modal Action Chain Abductive Reasoning
ACL
Mengze Li |
Tianbao Wang |
Jiahe Xu |
Kairong Han |
Shengyu Zhang |
Zhou Zhao |
Jiaxu Miao |
Wenqiao Zhang |
Shiliang Pu |
Fei Wu |
Contrastive Token-Wise Meta-Learning for Unseen Performer Visual Temporal-Aligned Translation
F
i
n
d
i
n
g
s
-
A
C
L
Linjun Li |
Tao Jin |
Xize Cheng |
Ye Wang |
Wang Lin |
Rongjie Huang |
Zhou Zhao |
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding
EMNLP
Zehan Wang |
Haifeng Huang |
Yang Zhao |
Linjun Li |
Xize Cheng |
Yichen Zhu |
Aoxiong Yin |
Zhou Zhao |
Semantic-conditioned Dual Adaptation for Cross-domain Query-based Visual Segmentation
F
i
n
d
i
n
g
s
-
A
C
L
Ye Wang |
Tao Jin |
Wang Lin |
Xize Cheng |
Linjun Li |
Zhou Zhao |
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
F
i
n
d
i
n
g
s
-
A
C
L
Ruiqi Li |
Rongjie Huang |
Lichao Zhang |
Jinglin Liu |
Zhou Zhao |
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
F
i
n
d
i
n
g
s
-
A
C
L
Ziyue Jiang |
Qian Yang |
Jialong Zuo |
Zhenhui Ye |
Rongjie Huang |
Yi Ren |
Zhou Zhao |
DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect
F
i
n
d
i
n
g
s
-
A
C
L
Jinglin Liu |
Zhenhui Ye |
Qian Chen |
Siqi Zheng |
Wen Wang |
Zhang Qinglin |
Zhou Zhao |
FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis
F
i
n
d
i
n
g
s
-
A
C
L
Rongjie Huang |
Yi Ren |
Ziyue Jiang |
Chenye Cui |
Jinglin Liu |
Zhou Zhao |
Prosody-TTS: Improving Prosody with Masked Autoencoder and Conditional Diffusion Model For Expressive Text-to-Speech
F
i
n
d
i
n
g
s
-
A
C
L
Rongjie Huang |
Chunlei Zhang |
Yi Ren |
Zhou Zhao |
Dong Yu |
Weakly-Supervised Spoken Video Grounding via Semantic Interaction Learning
ACL
Ye Wang |
Wang Lin |
Shengyu Zhang |
Tao Jin |
Linjun Li |
Xize Cheng |
Zhou Zhao |
TAVT: Towards Transferable Audio-Visual Text Generation
ACL
Wang Lin |
Tao Jin |
Wenwen Pan |
Linjun Li |
Xize Cheng |
Ye Wang |
Zhou Zhao |
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training
ACL
Zhenhui Ye |
Rongjie Huang |
Yi Ren |
Ziyue Jiang |
Jinglin Liu |
Jinzheng He |
Xiang Yin |
Zhou Zhao |
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
ACL
Rongjie Huang |
Huadai Liu |
Xize Cheng |
Yi Ren |
Linjun Li |
Zhenhui Ye |
Jinzheng He |
Lichao Zhang |
Jinglin Liu |
Xiang Yin |
Zhou Zhao |
Revisiting Over-Smoothness in Text to Speech
ACL
Yi Ren |
Xu Tan |
Tao Qin |
Zhou Zhao |
Tie-Yan Liu |
Prior Knowledge and Memory Enriched Transformer for Sign Language Translation
ACL
Findings
Tao Jin |
Zhou Zhao |
Meng Zhang |
Xingshan Zeng |
Learning the Beauty in Songs: Neural Singing Voice Beautifier
ACL
Jinglin Liu |
Chengxi Li |
Yi Ren |
Zhiying Zhu |
Zhou Zhao |
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding
ACL
Mengze Li |
Tianbao Wang |
Haoyu Zhang |
Shengyu Zhang |
Zhou Zhao |
Jiaxu Miao |
Wenqiao Zhang |
Wenming Tan |
Jin Wang |
Peng Wang |
Shiliang Pu |
Fei Wu |
SimulSpeech: End-to-End Simultaneous Speech to Text Translation
ACL
Yi Ren |
Jinglin Liu |
Xu Tan |
Chen Zhang |
Tao QIN |
Zhou Zhao |
Tie-Yan Liu |
A Study of Non-autoregressive Model for Sequence Generation
ACL
Yi Ren |
Jinglin Liu |
Xu Tan |
Zhou Zhao |
sheng zhao |
Tie-Yan Liu |
Video Dialog via Progressive Inference and Cross-Transformer
EMNLP
Weike Jin |
Zhou Zhao |
Mao Gu |
Jun Xiao |
Furu Wei |
Yueting Zhuang |
Investigating Capsule Networks with Dynamic Routing for Text Classification
EMNLP
Min Yang |
Wei Zhao |
Jianbo Ye |
Zeyang Lei |
Zhou Zhao |
Soufei Zhang |
Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference
ACL
Boyuan Pan |
Yazheng Yang |
Zhou Zhao |
Yueting Zhuang |
Deng Cai |
Xiaofei He |
Identifying and Tracking Sentiments and Topics from Social Media Texts during Natural Disasters
EMNLP
Min Yang |
Jincheng Mei |
Heng Ji |
Wei Zhao |
Zhou Zhao |
Xiaojun Chen |
Linguistic
Task
Approach
Language
Dataset Type
.