Entertainment
Nvidia shows AI model that can modify voices, generate novel sounds
Nvidia on Monday (Nov 25) showed a new artificial intelligence model for generating music and audio that can modify voices and generate novel sounds — technology aimed at the producers of music, films and video games. Nvidia, the world's biggest sup
Nvidia on Monday (Nov 25) showed a new artificial intelligence model for generating music and audio that can modify voices and generate novel sounds — technology aimed at the producers of music, films and video games.
Nvidia, the world's biggest supplier of chips and software used to create AI systems, said it does not have immediate plans to publicly release the technology, which it calls Fugatto, short for Foundational Generative Audio Transformer Opus 1.
It joins other technologies shown by startups such as Runway and larger players such as Meta Platforms that can generate audio or video from a text prompt.
Santa Clara, California-based Nvidia's version generates sound effects and music from a text description, including novel sounds such as making a trumpet bark like a dog.
What makes it different from other AI technologies is its ability to take in and modify existing audio, for example by taking a line played on a piano and transforming it into a line sung by a human voice, or by taking a spoken word recording and changing the accent used and the mood expressed.
Nvidia, the world's biggest supplier of chips and software used to create AI systems, said it does not have immediate plans to publicly release the technology, which it calls Fugatto, short for Foundational Generative Audio Transformer Opus 1.
It joins other technologies shown by startups such as Runway and larger players such as Meta Platforms that can generate audio or video from a text prompt.
Santa Clara, California-based Nvidia's version generates sound effects and music from a text description, including novel sounds such as making a trumpet bark like a dog.
What makes it different from other AI technologies is its ability to take in and modify existing audio, for example by taking a line played on a piano and transforming it into a line sung by a human voice, or by taking a spoken word recording and changing the accent used and the mood expressed.
Related
Next