NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Zhiheng Xi
Number of Papers:- 11
Number of Citations:- 0
First ACL Paper:- 2022
Latest ACL Paper:- 2024
Venues:-
s
EMNLP
d
i
-
A
L
P
ACL
E
C
M
N
F
n
g
Co-Authors:-
Binghai Wang
Caishuang Huang
Dong Yan
Enyu Zhou
Han Xia
Similar Authors:-
2024
2023
2022
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
F
i
n
d
i
n
g
s
-
N
A
A
C
L
Wei He |
Shichun Liu |
Jun Zhao |
Yiwen Ding |
Yi Lu |
Zhiheng Xi |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback
ACL
Shihan Dou |
Yan Liu |
Haoxiang Jia |
Enyu Zhou |
Limao Xiong |
Junjie Shan |
Caishuang Huang |
Xiao Wang |
Xiaoran Fan |
Zhiheng Xi |
Yuhao Zhou |
Tao Ji |
Rui Zheng |
Qi Zhang |
Tao Gui |
Xuanjing Huang |
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin
ACL
Shihan Dou |
Enyu Zhou |
Yan Liu |
Songyang Gao |
Wei Shen |
Limao Xiong |
Yuhao Zhou |
Xiao Wang |
Zhiheng Xi |
Xiaoran Fan |
Shiliang Pu |
Jiang Zhu |
Rui Zheng |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
Reward Modeling Requires Automatic Adjustment Based on Data Quality
F
i
n
d
i
n
g
s
-
E
M
N
L
P
Binghai Wang |
Rui Zheng |
Lu Chen |
Zhiheng Xi |
Wei Shen |
Yuhao Zhou |
Dong Yan |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning
EMNLP
Lu Chen |
Rui Zheng |
Binghai Wang |
Senjie Jin |
Caishuang Huang |
Junjie Ye |
Zhihao Zhang |
Yuhao Zhou |
Zhiheng Xi |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data
F
i
n
d
i
n
g
s
-
E
M
N
L
P
Han Xia |
Songyang Gao |
Qiming Ge |
Zhiheng Xi |
Qi Zhang |
Xuanjing Huang |
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
F
i
n
d
i
n
g
s
-
E
M
N
L
P
Zhiheng Xi |
Senjie Jin |
Yuhao Zhou |
Rui Zheng |
Songyang Gao |
Jia Liu |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
RealBehavior: A Framework for Faithfully Characterizing Foundation Models’ Human-like Behavior Mechanisms
F
i
n
d
i
n
g
s
-
E
M
N
L
P
Enyu Zhou |
Rui Zheng |
Zhiheng Xi |
Songyang Gao |
Xiaoran Fan |
Zichu Fei |
Jingting Ye |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
Characterizing the Impacts of Instances on Robustness
F
i
n
d
i
n
g
s
-
A
C
L
Rui Zheng |
Zhiheng Xi |
Qin Liu |
Wenbin Lai |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
Jin Ma |
Ying Shan |
Weifeng Ge |
Connectivity Patterns are Task Embeddings
F
i
n
d
i
n
g
s
-
A
C
L
Zhiheng Xi |
Rui Zheng |
Yuansen Zhang |
Xuanjing Huang |
Zhongyu Wei |
Minlong Peng |
Mingming Sun |
Qi Zhang |
Tao Gui |
Efficient Adversarial Training with Robust Early-Bird Tickets
EMNLP
Zhiheng Xi |
Rui Zheng |
Tao Gui |
Qi Zhang |
Xuanjing Huang |
.