October 1, 2019

IBM’s AI generates high-quality voices from 5 minutes of talking

CRETech · 1 minute read

Training powerful text to speech models requires sufficiently powerful hardware. A recent study published by OpenAI drives the point home — it found that since 2012, the amount of compute used in the largest runs grew by more than 300,000 times. In pursuit of less demanding models, researchers at IBM developed a new lightweight and modular method for speech synthesis. They say it’s able to synthesize high-quality speech in real time by learning different aspects of a speaker’s voice, making it possible to adapt to new speaking styles and voices with small amounts of data.

IBM’s AI generates high-quality voices from 5 minutes of talking

Related posts

IBM Sues Zillow Over Patent Use

CLS, IBM and banks test blockchain app store

Only 20% of U.S. workers in office three days or more: IBM CEO