PlayAI clones voices on command

November 25, 2024

4

Again in 2016, Hammad Syed and Mahmoud Felfel, an ex-WhatsApp engineer, thought it’d be neat to construct a text-to-speech Chrome extension for Medium articles. The extension, which might learn any Medium story aloud, was featured on Product Hunt. A yr later, it spawned a whole enterprise.

“We noticed an even bigger alternative in serving to people and organizations create lifelike audio content material for his or her functions,” Syed advised TechCrunch. “With out the necessity to construct their very own mannequin, they might deploy human-quality speech experiences sooner than ever earlier than.”

Syed and Felfel’s firm, PlayAI (previously PlayHT), pitches itself because the “voice interface of AI.” Prospects can select from numerous predefined voices, or clone a voice, and use PlayAI’s API to combine text-to-speech into their apps.

Toggles permit customers to regulate the intonation, cadence, and tenor of voices.

PlayAI additionally provides a “playground” the place customers can add a file to generate a read-aloud model and a dashboard for creating more-polished audio narrations and voiceovers. Lately, the corporate bought into the “AI brokers” recreation with instruments that can be utilized to automate duties comparable to answering buyer calls at a enterprise.

Certainly one of PlayAI’s extra fascinating experiments is PlayNote, which transforms PDFs, movies, photographs, songs, and different recordsdata into podcast-style reveals, read-aloud summaries, one-on-one debates, and even youngsters’s tales. Like Google’s NotebookLM, PlayNote generates a script from an uploaded file or URL and feeds it to a group of AI fashions, which collectively craft the completed product.

I gave it a whirl, and the outcomes weren’t half unhealthy. PlayNote’s “podcast” setting produces clips roughly on par with NotebookLM’s when it comes to high quality, and the device’s potential to ingest photographs and movies makes for some fascinating creations. Given an image of hen mole dish I had not too long ago, PlayNote wrote a five-minute podcast script about it. Really, we live sooner or later.

Granted, the device, like all AI instruments, generates odd artifacts and hallucinations once in a while. And whereas PlayNote will do its greatest to adapt a file to the format you’ve chosen, don’t count on, say, a dry authorized submitting to make for the perfect supply materials. See: the Musk v. OpenAI lawsuit framed as a bedtime story: