NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Olivier Pietquin
Number of Papers:- 19
Number of Citations:- 4
First ACL Paper:- 2005
Latest ACL Paper:- 2024
Venues:-
TACL
s
EMNLP
i
d
NAACL
-
A
SIGDIAL
WS
L
IJCNLP
ACL
JEP/TALN/RECITAL
C
LREC
WMT
F
n
g
Co-Authors:-
Aaron Courville
Ahmet Ustun
Alexandre Berard
Alice Martin
Arash Ahmadian
Similar Authors:-
Miguel Matos
Jean Francois Rey
Mohamed Sehili
Thierry Joubert
Antonio Serralheiro
2024
2023
2022
2020
2017
2016
2015
2014
2013
2012
2011
2010
2005
Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning
F
i
n
d
i
n
g
s
-
A
C
L
Mathieu Rita |
Florian Strub |
Rahma Chaabouni |
Paul Michel |
Emmanuel Dupoux |
Olivier Pietquin |
Back to Basics: Revisiting REINFORCE-Style Optimization for Learning from Human Feedback in LLMs
ACL
Arash Ahmadian |
Chris Cremer |
Matthias Gallé |
Marzieh Fadaee |
Julia Kreutzer |
Olivier Pietquin |
Ahmet Üstün |
Sara Hooker |
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion
EMNLP
Yannis Flet-Berliac |
Nathan Grinsztajn |
Florian Strub |
Eugene Choi |
Bill Wu |
Chris Cremer |
Arash Ahmadian |
Yash Chandak |
Mohammad Gheshlaghi Azar |
Olivier Pietquin |
Matthieu Geist |
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
ACL
Paul Roit |
Johan Ferret |
Lior Shani |
Roee Aharoni |
Geoffrey Cideron |
Robert Dadashi |
Matthieu Geist |
Sertan Girgin |
Leonard Hussenot |
Orgad Keller |
Nikola Momchev |
Sabela Ramos Garea |
Piotr Stanczyk |
Nino Vieillard |
Olivier Bachem |
Gal Elidan |
Avinatan Hassidim |
Olivier Pietquin |
Idan Szpektor |
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
TACL
Eugene Kharitonov |
Damien Vincent |
Zalán Borsos |
Raphaël Marinier |
Sertan Girgin |
Olivier Pietquin |
Matt Sharifi |
Marco Tagliasacchi |
Neil Zeghidour |
Learning Natural Language Generation with Truncated Reinforcement Learning
NAACL
Alice Martin |
Guillaume Quispe |
Charles Ollion |
Sylvain Le Corff |
Florian Strub |
Olivier Pietquin |
Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue
SIGDIAL
Olivier Pietquin |
Smaranda Muresan |
Vivian Chen |
Casey Kennington |
David Vandyke |
Nina Dethlefs |
Koji Inoue |
Erik Ekstedt |
Stefan Ultes |
Supervised Seeded Iterated Learning for Interactive Language Learning
EMNLP
Yuchen Lu |
Soumye Singhal |
Florian Strub |
Olivier Pietquin |
Aaron Courville |
LIG-CRIStAL Submission for the WMT 2017 Automatic Post-Editing Task
WMT
WS
Alexandre Bérard |
Laurent Besacier |
Olivier Pietquin |
MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP
LREC
Alexandre Bérard |
Christophe Servan |
Olivier Pietquin |
Laurent Besacier |
Human-Machine Dialogue as a Stochastic Game
SIGDIAL
WS
Merwan Barlier |
Julien Perolat |
Romain Laroche |
Olivier Pietquin |
NASTIA: Negotiating Appointment Setting Interface
LREC
Layla El Asri |
Rémi Lemonnier |
Romain Laroche |
Olivier Pietquin |
Hatim Khouzaimi |
DINASTI: Dialogues with a Negotiating Appointment Setting Interface
LREC
Layla El Asri |
Romain Laroche |
Olivier Pietquin |
Model-free POMDP optimisation of tutoring systems with echo-state networks
SIGDIAL
WS
Lucie Daubigney |
Matthieu Geist |
Olivier Pietquin |
Optimisation d’un tuteur intelligent à partir d’un jeu de données fixé (Optimization of a tutoring system from a fixed set of data) [in French]
JEP/TALN/RECITAL
Lucie Daubigney |
Matthieu Geist |
Olivier Pietquin |
Statistical User Simulation for Spoken Dialogue Systems: What for, Which Data, Which Future?
NAACL
WS
Olivier Pietquin |
Training a BN-based user model for dialogue simulation with missing data
IJCNLP
Stéphane Rossignol |
Olivier Pietquin |
Michel Ianotto |
Sparse Approximate Dynamic Programming for Dialog Management
SIGDIAL
WS
Senthilkumar Chandramohan |
Matthieu Geist |
Olivier Pietquin |
Réseau bayesien pour un modèle d’utilisateur et un module de compréhension pour l’optimisation des systèmes de dialogues
JEP/TALN/RECITAL
Olivier Pietquin |
Linguistic
Task
Approach
Language
Dataset Type
.