NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Yuhao Zhou
Number of Papers:- 7
Number of Citations:- 0
First ACL Paper:- 2022
Latest ACL Paper:- 2024
Venues:-
s
A
d
i
-
EMNLP
L
P
ACL
E
C
M
N
F
n
g
Co-Authors:-
Bao Rong
Binghai Wang
Caishuang Huang
Di Liang
Dong Yan
Similar Authors:-
2024
2023
2022
StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback
ACL
Shihan Dou |
Yan Liu |
Haoxiang Jia |
Enyu Zhou |
Limao Xiong |
Junjie Shan |
Caishuang Huang |
Xiao Wang |
Xiaoran Fan |
Zhiheng Xi |
Yuhao Zhou |
Tao Ji |
Rui Zheng |
Qi Zhang |
Tao Gui |
Xuanjing Huang |
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin
ACL
Shihan Dou |
Enyu Zhou |
Yan Liu |
Songyang Gao |
Wei Shen |
Limao Xiong |
Yuhao Zhou |
Xiao Wang |
Zhiheng Xi |
Xiaoran Fan |
Shiliang Pu |
Jiang Zhu |
Rui Zheng |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
Reward Modeling Requires Automatic Adjustment Based on Data Quality
F
i
n
d
i
n
g
s
-
E
M
N
L
P
Binghai Wang |
Rui Zheng |
Lu Chen |
Zhiheng Xi |
Wei Shen |
Yuhao Zhou |
Dong Yan |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning
EMNLP
Lu Chen |
Rui Zheng |
Binghai Wang |
Senjie Jin |
Caishuang Huang |
Junjie Ye |
Zhihao Zhang |
Yuhao Zhou |
Zhiheng Xi |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
F
i
n
d
i
n
g
s
-
E
M
N
L
P
Zhiheng Xi |
Senjie Jin |
Yuhao Zhou |
Rui Zheng |
Songyang Gao |
Jia Liu |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
Detecting Adversarial Samples through Sharpness of Loss Landscape
F
i
n
d
i
n
g
s
-
A
C
L
Rui Zheng |
Shihan Dou |
Yuhao Zhou |
Qin Liu |
Tao Gui |
Qi Zhang |
Zhongyu Wei |
Xuanjing Huang |
Menghan Zhang |
Robust Lottery Tickets for Pre-trained Language Models
ACL
Rui Zheng |
Bao Rong |
Yuhao Zhou |
Di Liang |
Sirui Wang |
Wei Wu |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
.