Tech

Stability AI releases a sound generator | TechCrunch

Published

7 months ago

June 5, 2024

Admin

Stability AI releases a sound generator | TechCrunch

Stability AI, the startup behind the AI-powered art generator Stable Diffusion, has released an open AI model for generating sounds and songs that it claims was trained exclusively on royalty-free recordings.

Called Stable Audio Open, the generative model takes a text description (e.g. “Rock beat played in a treated studio, session drumming on an acoustic kit”) and outputs a recording up to 47 seconds in length. The model was trained using around 486,000 samples from free music libraries FreeSound and the Free Music Archive.

Stability AI says that the model can be used to create drum beats, instrument riffs, ambient noises and “production elements” for videos, films and TV shows as well as to “edit” existing songs or apply the style of one song (e.g. smooth jazz) to another.

“A key benefit of this open source release is that users can fine-tune the model on their own custom audio data,” Stability AI wrote in a post on its corporate blog. “For example, a drummer could fine-tune on samples of their own drum recordings to generate new beats.”

Stable Audio Open has its limitations, however. It can’t produce full songs, melodies or vocals — at least not good ones. Stability AI says that it’s not optimized for this, and suggests that users looking for those capabilities opt for the company’s premium Stable Audio service.

Stable Audio Open also can’t be used commercially; its terms of service prohibit it. And it doesn’t perform equally well across musical styles and cultures or with descriptions in languages other than English — biases Stability AI blames on the training data.

“The source of data is potentially lacking diversity and all cultures are not equally represented in the data set,” Stability AI writes in a description of the model. “The generated samples from the model will reflect the biases from the training data.”

Stability AI — which has long struggled to turn its flagging business around — became the subject of controversy recently after its VP of generative audio, Ed Newton-Rex, resigned over disagreement with the company’s stance that training generative AI models on copyrighted works constitutes “fair use.” Stable Audio Open would appear to be an attempt to turn that narrative around, while at the same time not-so-subtly advertising Stability AI’s paid products.

As music generators including Stability’s gain in popularity, copyright — and the ways in which some creators of generators might be abusing it — is becoming a central point of focus.

In May, Sony Music, which represents artists, including Billy Joel, Doja Cat and Lil Nas X, sent a letter to 700 AI companies warning against “unauthorized use” of its content for training audio generators. And in March, the U.S.’ first law aimed at tamping down abuses of AI in music was signed into law in Tennessee.

Up Next

iOS 18 beta 1 is coming soon, will you install it? [Poll] – 9to5Mac

Don't Miss

Iconic modern Metroidvania with 94% Steam score reveals new retro-styled DLC

Crunchbase News Today

Stability AI releases a sound generator | TechCrunch

Tech

Stability AI releases a sound generator | TechCrunch

Oakland business owner speaks out, concerned with break-ins

Community remembers ‘enduring legacy’ of fitness influencer Miguel Aguilar after his death

George Costanza’s 5 Best Jobs On Seinfeld, Ranked – SlashFilm

Ken Paxton sues NCAA over transgender athletes’ participation in women’s sports

Bryce Underwood opens up about Michigan decision: ‘Business is business’

Dolphins keep playoff dream alive by beating 49ers 29-17

Bills don’t play a great game, but avoid a bad loss by coming back to beat Patriots

Horoscope Today: December 23, 2024

Years later, California’s pandemic-era debt comes back to cost local businesses

SNF open thread: Buccaneers-Cowboys gambling lines and picks for tonight’s game