Text-Free Image-to-Speech Synthesis Using Learned Segmental Units

Wei-Ning Hsu | David Harwath | Tyler Miller | Christopher Song | James Glass |

Paper Details:

Month: August
Year: 2021
Location: Online
Venue: ACL | IJCNLP |