NEW YORK (AFP) – As people brace for the disruptive impact of artificial intelligence (AI) on jobs and everyday living, those in the world of audio books say their field is already being transformed.
AI has the ability to create human-sounding recordings – at assembly-line speed – while bypassing at least part of the services of the human professionals who for years have made a living with their voices.
Many of them are already seeing a sharp drop off in business.
Tanya Eby has been a full-time voice actor and professional narrator for 20 years. She has a recording studio in her home.
But in the past six months she has seen her work load fall by half. Her bookings now run only through June, while in a normal year they would extend through August.
Many of her colleagues report similar declines.
While other factors could be at play, she told AFP, “It seems to make sense that AI is affecting all of us.”
There is no label identifying AI-assisted recordings as such, but professionals say thousands of audio books currently in circulation use “voices” generated from a databank.
Among the most cutting-edge, DeepZen offers rates that can slash the cost of producing an audio book to one-fourth, or less, that of a traditional project.
The small London-based company draws from a database it created by recording the voices of several actors who were asked to speak in a variety of emotional registers.
“Every voice that we are using, we sign a license agreement, and we pay for the recordings,” said DeepZen CEO Kamis Taylan.
For every project, he added, “we pay royalties based on the work that we do”. Not everyone respects that standard, said Eby.
“All these new companies are popping up who are not as ethical,” she said, and some use voices found in databases without paying for them. “There’s that grey area” being exploited by several platforms, Taylan acknowledged. “They take your voice, my voice, five other people’s voices combined that just creates a separate voice… They say that it doesn’t belong to anybody.”
All the audio book companies contacted by AFP denied using such practices.
Speechki, a Texas-based start-up, uses both its own recordings and voices from existing databanks, said CEO Dima Abramov.
But that is done only after a contract has been signed covering usage rights, he said.
FUTURE OF COEXISTENCE?
The five largest US publishing houses did not respond to requests for comment.
But professionals contacted by AFP said several traditional publishers are already using so-called generative AI, which can create texts, images, videos and voices from existing content – without human intervention.
“Professional narration has always been, and will remain, core to the Audible listening experience,” said a spokesperson for that Amazon subsidiary, a giant in the American audio book sector.
“However, as text-to-speech technology improves, we see a future in which human performances and text-to-speech generated content can co-exist.”
The giants of US technology, deeply involved in the explosively developing field of AI, are all pursuing the promising business of digitally narrated audio books.
ACCESSIBLE TO ALL
Early this year, Apple announced it was moving into AI-narrated audio books, a move it said would make the “creation of audio books more accessible to all”, notably independent authors and small publishers.
Google is offering a similar service, which it describes as “auto-narration”.
“We have to democratise the publishing industry, because only the most famous and the big names are getting converted into audio,” said Taylan.
“Synthetic narration just opened the door for old books that have never been recorded, and all the books from the future that never will be recorded because of the economics,” added Speechki’s Abramov.
Given the costs of human-based recording, he added, only some five per cent of all books are turned into audio books.
But Abramov insisted that the growing market would also benefit voice actors. “They will make more money, they will make more recordings,” he said.
THE HUMAN ELEMENT
“The essence of storytelling is teaching humanity how to be human. And we feel strongly that that should never be given to a machine to teach us about how to be human,” said Emily Ellet, an actor and audio book narrator who cofounded the Professional Audiobook Narrators Association (PANA).
“Storytelling,” she added, “should remain human entirely.”
Eby underlined a frequent criticism of digitally generated recordings. When compared to a human recording, she said, an AI product “lacks in emotional connectivity”.
Eby said she fears, however, that people will grow accustomed to the machine-generated version, “and I think that’s quietly what’s kind of happening”.
Her wish is simply “that companies would let listeners know that they are actually listening to an AI-generated piece.
“I just want people to be honest about it”.