{"id":34028,"date":"2024-03-30T08:16:30","date_gmt":"2024-03-30T02:46:30","guid":{"rendered":"https:\/\/farratanews.online\/openais-voice-cloning-ai-model-only-needs-a-15-second-sample-to-work\/"},"modified":"2024-03-30T08:16:30","modified_gmt":"2024-03-30T02:46:30","slug":"openais-voice-cloning-ai-model-only-needs-a-15-second-sample-to-work","status":"publish","type":"post","link":"https:\/\/farratanews.online\/openais-voice-cloning-ai-model-only-needs-a-15-second-sample-to-work\/","title":{"rendered":"OpenAI\u2019s voice cloning AI model only needs a 15-second sample to work"},"content":{"rendered":"

[ad_1]\n<\/p>\n

\n

OpenAI is offering limited access to a text-to-voice generation platform it developed called Voice Engine, which can create a synthetic voice based on a 15-second clip of someone\u2019s voice. The AI-generated voice can read out text prompts on command in the same language as the speaker or in a number of other languages. \u201cThese small scale deployments are helping to inform our approach, safeguards, and thinking about how Voice Engine could be used for good across various industries,\u201d OpenAI said in its blog post.\u00a0<\/p>\n<\/div>\n

\n

Companies with access include the education technology company Age of Learning, visual storytelling platform HeyGen, frontline health software maker Dimagi, AI communication app creator Livox, and health system Lifespan.<\/p>\n<\/div>\n

\n

In these samples posted by OpenAI, you can hear what Age of Learning has been doing with the technology to generate pre-scripted voice-over content, as well as reading out \u201creal-time, personalized responses\u201d to students written by GPT-4.<\/p>\n<\/div>\n

\n

First, the reference audio in English:<\/p>\n<\/div>\n

\n

And here are three AI-generated audio clips based on that sample, <\/p>\n<\/div>\n

\n

OpenAI said it began developing Voice Engine in late 2022 and that the technology has already powered preset voices for the text-to-speech API and ChatGPT\u2019s Read Aloud feature. In an interview with TechCrunch<\/em>, Jeff Harris, a member of OpenAI\u2019s product team for Voice Engine, said the model was trained on \u201ca mix of licensed and publicly available data.\u201d OpenAI told the publication the model will only be available to about 10 developers. <\/p>\n<\/div>\n

\n

AI text-to-audio generation is an area of generative AI that\u2019s continuing to evolve. While most focus on instrumental or natural sounds, fewer have focused on voice generation, partially due to the questions OpenAI cited. Some names in the space include companies like Podcastle and ElevenLabs, which provide AI voice cloning technology and tools the Vergecast<\/em> explored last year. <\/p>\n<\/div>\n

\n

According to OpenAI, its partners agreed to abide by its usage policies that say they will not use Voice Generation to impersonate people or organizations without their consent. It also requires the partners to get the \u201cexplicit and informed consent\u201d of the original speaker, not build ways for individual users to create their own voices, and to disclose to listeners that the voices are AI-generated. OpenAI also added watermarking to the audio clips to trace their origin and actively monitor how the audio is used.\u00a0<\/p>\n<\/div>\n

\n

OpenAI suggested several steps that it thinks could limit the risks around tools like these, including phasing out voice-based authentication to access bank accounts, policies to protect the use of people\u2019s voices in AI, greater education on AI deepfakes, and development of tracking systems of AI content.\u00a0<\/p>\n<\/div>\n[ad_2]\n","protected":false},"excerpt":{"rendered":"

[ad_1] OpenAI is offering limited access to a text-to-voice generation platform it developed called Voice Engine, which can create a synthetic voice based on a 15-second clip of someone\u2019s voice. The AI-generated voice can read out text prompts on command in the same language as the speaker or in a number of other languages. \u201cThese …<\/p>\n","protected":false},"author":1,"featured_media":34029,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[],"_links":{"self":[{"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/posts\/34028"}],"collection":[{"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/comments?post=34028"}],"version-history":[{"count":0,"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/posts\/34028\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/media\/34029"}],"wp:attachment":[{"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/media?parent=34028"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/categories?post=34028"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/farratanews.online\/wp-json\/wp\/v2\/tags?post=34028"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}