{"id":214908,"date":"2017-03-10T08:25:29","date_gmt":"2017-03-10T13:25:29","guid":{"rendered":"http:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/uncategorized\/a-groundbreaking-new-ai-taught-itself-to-speak-in-just-a-few-hours-futurism.php"},"modified":"2022-05-14T23:23:53","modified_gmt":"2022-05-15T03:23:53","slug":"a-groundbreaking-new-ai-taught-itself-to-speak-in-just-a-few-hours-futurism","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/artificial-intelligence\/a-groundbreaking-new-ai-taught-itself-to-speak-in-just-a-few-hours-futurism.php","title":{"rendered":"A Groundbreaking New AI Taught Itself to Speak in Just a Few Hours &#8211; Futurism"},"content":{"rendered":"<p><p>Giving Machines a Voice    <\/p>\n<p>    Last year, Google successfully gave a machine the ability to    generate human-like speech through its voice synthesis program called WaveNet.    Powered by Googles DeepMind artificial intelligence    (AI) deep neural network, WaveNet produced synthetic speech    using given texts. Now, Chinese internet search company Baidu    has developed the most advanced speech synthesis program ever,    and its called Deep Voice.  <\/p>\n<p>    Developed in Baidus AI research lab based in Silicon Valley,    Deep Voice presents a big breakthrough in speech synthesis    technology by largely doing away with the behind-the-scenes    fine-tuning typically necessary for suchprograms. As    such, Deep Voice can learn how to talk in a matter of a few    hours and with virtually no help from humans.  <\/p>\n<p>    Deep Voice uses a relatively simple method: through    deep-learning    techniques, Deep Voice broke down texts into phonemes  which is sound    at its smallest perceptually distinct units. A speech synthesis    network then reproduced these sounds. The need for any    fine-tuning was greatly reduced because every stage of the    process relied on deep-learning techniques all    researches needed to dowas train the algorithm.  <\/p>\n<p>    For the audio synthesis model, we implement a variant of    WaveNet that requires fewer parameters and trains faster than    the original, the Baidu researchers wrote in a study    published online. By using a neural network for each    component, our system is simpler and more flexible than    traditional text-to-speech systems, where each component    requires laborious feature engineering and extensive domain    expertise.  <\/p>\n<p>    Text-to-speech systems arent entirely new. Theyre present in    many of the worlds modern gadgets and devices.From    simpler ones  like talking clocks and answering systems in    phones  to more complex versions, like those in navigation    apps. These, however, have been made using large databases of    speech recordings. As such, the speech generated by these    traditional text-to-speech systems dont flowas seamless    as actual human speech.  <\/p>\n<p>    Baidus work on Deep Voice is a step towards achieving    human-like speech synthesis in real time, without using    pre-recorded responses. Baidus Deep Voice puts together    phonemes in such a way that it sounds like actual human speech.    We optimize inference to faster-than-real-time speeds, showing    that these techniques can be applied to generate audio in    real-time in a streaming fashion, their researchers said.  <\/p>\n<p>    However, there are still certain variables that their new    system cannot yet control: the stresses on phonemes and the    duration and natural frequency of each sound. Once perfected,    control of these variables would allow Baidu to change the    voice of the speaker and, possibly, the emotions conveyed by a    word.  <\/p>\n<p>    At the very least, this would be computationally demanding,    limiting just how much Deep Voice can be used in real-time    speech synthesis in the real world. As thethe Baidu researchersexplained:  <\/p>\n<p>    In the future, better synthesized speech systems can be used to    improvethe assistant features found in smartphones and    smart home devices. At the very least, it wouldmake    talking to your devices feel more real.  <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>See the original post:<\/p>\n<p><a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/futurism.com\/a-groundbreaking-new-ai-taught-itself-to-speak-in-just-a-few-hours\/\" title=\"A Groundbreaking New AI Taught Itself to Speak in Just a Few Hours - Futurism\">A Groundbreaking New AI Taught Itself to Speak in Just a Few Hours - Futurism<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> Giving Machines a Voice Last year, Google successfully gave a machine the ability to generate human-like speech through its voice synthesis program called WaveNet.  <a href=\"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/artificial-intelligence\/a-groundbreaking-new-ai-taught-itself-to-speak-in-just-a-few-hours-futurism.php\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"limit_modified_date":"","last_modified_date":"","_lmt_disableupdate":"","_lmt_disable":"","footnotes":""},"categories":[13],"tags":[],"class_list":["post-214908","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence"],"modified_by":"Danzig","_links":{"self":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts\/214908"}],"collection":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/comments?post=214908"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/posts\/214908\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/media?parent=214908"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/categories?post=214908"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/futurist-transhuman-news-blog\/wp-json\/wp\/v2\/tags?post=214908"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}