NLPExplorer

Olivier Pietquin

Number of Papers:- 19

Number of Citations:- 4

First ACL Paper:- 2005

Latest ACL Paper:- 2024

Venues:-

TACL

s

EMNLP

i

d

NAACL

-

A

SIGDIAL

WS

L

IJCNLP

ACL

JEP/TALN/RECITAL

C

LREC

WMT

F

n

g

Co-Authors:-

Aaron Courville

Alexandre Berard

Similar Authors:-

Jean Francois Rey

Thierry Joubert

Antonio Serralheiro

Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning F i n d i n g s - A C L

Mathieu Rita | Florian Strub | Rahma Chaabouni | Paul Michel | Emmanuel Dupoux | Olivier Pietquin |

Back to Basics: Revisiting REINFORCE-Style Optimization for Learning from Human Feedback in LLMs ACL

Arash Ahmadian | Chris Cremer | Matthias Gallé | Marzieh Fadaee | Julia Kreutzer | Olivier Pietquin | Ahmet Üstün | Sara Hooker |

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion EMNLP

Yannis Flet-Berliac | Nathan Grinsztajn | Florian Strub | Eugene Choi | Bill Wu | Chris Cremer | Arash Ahmadian | Yash Chandak | Mohammad Gheshlaghi Azar | Olivier Pietquin | Matthieu Geist |

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback ACL

Paul Roit | Johan Ferret | Lior Shani | Roee Aharoni | Geoffrey Cideron | Robert Dadashi | Matthieu Geist | Sertan Girgin | Leonard Hussenot | Orgad Keller | Nikola Momchev | Sabela Ramos Garea | Piotr Stanczyk | Nino Vieillard | Olivier Bachem | Gal Elidan | Avinatan Hassidim | Olivier Pietquin | Idan Szpektor |

Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision TACL

Eugene Kharitonov | Damien Vincent | Zalán Borsos | Raphaël Marinier | Sertan Girgin | Olivier Pietquin | Matt Sharifi | Marco Tagliasacchi | Neil Zeghidour |

Learning Natural Language Generation with Truncated Reinforcement Learning NAACL

Alice Martin | Guillaume Quispe | Charles Ollion | Sylvain Le Corff | Florian Strub | Olivier Pietquin |

Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue SIGDIAL

Olivier Pietquin | Smaranda Muresan | Vivian Chen | Casey Kennington | David Vandyke | Nina Dethlefs | Koji Inoue | Erik Ekstedt | Stefan Ultes |

Supervised Seeded Iterated Learning for Interactive Language Learning EMNLP

Yuchen Lu | Soumye Singhal | Florian Strub | Olivier Pietquin | Aaron Courville |

LIG-CRIStAL Submission for the WMT 2017 Automatic Post-Editing Task WMT WS

Alexandre Bérard | Laurent Besacier | Olivier Pietquin |

MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP LREC

Alexandre Bérard | Christophe Servan | Olivier Pietquin | Laurent Besacier |

Human-Machine Dialogue as a Stochastic Game SIGDIAL WS

Merwan Barlier | Julien Perolat | Romain Laroche | Olivier Pietquin |

NASTIA: Negotiating Appointment Setting Interface LREC

Layla El Asri | Rémi Lemonnier | Romain Laroche | Olivier Pietquin | Hatim Khouzaimi |

DINASTI: Dialogues with a Negotiating Appointment Setting Interface LREC

Layla El Asri | Romain Laroche | Olivier Pietquin |

Model-free POMDP optimisation of tutoring systems with echo-state networks SIGDIAL WS

Lucie Daubigney | Matthieu Geist | Olivier Pietquin |

Optimisation d’un tuteur intelligent à partir d’un jeu de données fixé (Optimization of a tutoring system from a fixed set of data) [in French] JEP/TALN/RECITAL

Lucie Daubigney | Matthieu Geist | Olivier Pietquin |

Statistical User Simulation for Spoken Dialogue Systems: What for, Which Data, Which Future? NAACL WS

Olivier Pietquin |

Training a BN-based user model for dialogue simulation with missing data IJCNLP

Stéphane Rossignol | Olivier Pietquin | Michel Ianotto |

Sparse Approximate Dynamic Programming for Dialog Management SIGDIAL WS

Senthilkumar Chandramohan | Matthieu Geist | Olivier Pietquin |

Réseau bayesien pour un modèle d’utilisateur et un module de compréhension pour l’optimisation des systèmes de dialogues JEP/TALN/RECITAL

Olivier Pietquin |

.