Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Cheng-Yu Hsieh | Chun-Liang Li | Chih-kuan Yeh | Hootan Nakhost | Yasuhisa Fujii | Alex Ratner | Ranjay Krishna | Chen-Yu Lee | Tomas Pfister |

Paper Details:

Month: July
Year: 2023
Location: Toronto, Canada
Venue: F | i | n | d | i | n | g | s | - | A | C | L |