SCIENCE

Audio AIs are trained on data full of bias and offensive language


Microphone

Audio training data has been overlooked when it comes to assessing AI

Israel Palacio/Unsplash

Artificial intelligence models that generate audio are being trained on datasets plagued with bias, offensive language and potential copyright infringement, sparking concerns about their use.

Generative audio products, such as song generators, voice cloning tools and transcription services, are increasingly popular, but while text and image generators have been subject to much scrutiny, audio has received less attention.

To help rectify this, William Agnew at Carnegie Mellon University in Pennsylvania and his…



Source link

PennsylvaniaDigitalNews.com