NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Florian Strub
Number of Papers:- 5
Number of Citations:- 0
First ACL Paper:- 2020
Latest ACL Paper:- 2024
Venues:-
s
EMNLP
i
d
NAACL
-
A
L
ViGIL
C
F
n
g
Co-Authors:-
Aaron Courville
Abhishek Das
Alane Suhr
Alice Martin
Arash Ahmadian
Similar Authors:-
2024
2022
2021
2020
Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning
F
i
n
d
i
n
g
s
-
A
C
L
Mathieu Rita |
Florian Strub |
Rahma Chaabouni |
Paul Michel |
Emmanuel Dupoux |
Olivier Pietquin |
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion
EMNLP
Yannis Flet-Berliac |
Nathan Grinsztajn |
Florian Strub |
Eugene Choi |
Bill Wu |
Chris Cremer |
Arash Ahmadian |
Yash Chandak |
Mohammad Gheshlaghi Azar |
Olivier Pietquin |
Matthieu Geist |
Learning Natural Language Generation with Truncated Reinforcement Learning
NAACL
Alice Martin |
Guillaume Quispe |
Charles Ollion |
Sylvain Le Corff |
Florian Strub |
Olivier Pietquin |
Proceedings of the Fourth Workshop on Visually Grounded Interaction and Language
NAACL
ViGIL
Cătălina Cangea |
Abhishek Das |
Drew Hudson |
Jacob Krantz |
Stefan Lee |
Jiayuan Mao |
Florian Strub |
Alane Suhr |
Erik Wijmans |
Supervised Seeded Iterated Learning for Interactive Language Learning
EMNLP
Yuchen Lu |
Soumye Singhal |
Florian Strub |
Olivier Pietquin |
Aaron Courville |
.