{"id":73758,"date":"2019-01-30T19:14:28","date_gmt":"2019-01-31T03:14:28","guid":{"rendered":"http:\/\/69.46.6.243\/?p=73758"},"modified":"2019-01-30T19:14:28","modified_gmt":"2019-01-31T03:14:28","slug":"columbia-engineers-translate-brain-signals-directly-into-speech","status":"publish","type":"post","link":"https:\/\/new.thepinetree.net\/?p=73758","title":{"rendered":"Columbia Engineers Translate Brain Signals Directly into Speech"},"content":{"rendered":"<p>New York, NY&#8230;In a scientific first, Columbia neuroengineers have created a system that translates thought into intelligible, recognizable speech. By monitoring someone\u2019s brain activity, the technology can reconstruct the words a person hears with unprecedented clarity. This breakthrough, which harnesses the power of speech synthesizers and artificial intelligence, could lead to new ways for computers to communicate directly with the brain. It also lays the groundwork for helping people who cannot speak, such as those living with amyotrophic lateral sclerosis (ALS) or recovering from stroke, regain their ability to communicate with the outside world.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/new.thepinetree.net\/wp-content\/uploads\/2019\/01\/Mesgarani-Hero-06-27-18.jpg\" alt=\"\" width=\"640\" height=\"361\" class=\"alignnone size-full wp-image-73759\" srcset=\"https:\/\/new.thepinetree.net\/wp-content\/uploads\/2019\/01\/Mesgarani-Hero-06-27-18.jpg 640w, https:\/\/new.thepinetree.net\/wp-content\/uploads\/2019\/01\/Mesgarani-Hero-06-27-18-300x169.jpg 300w, https:\/\/new.thepinetree.net\/wp-content\/uploads\/2019\/01\/Mesgarani-Hero-06-27-18-123x70.jpg 123w, https:\/\/new.thepinetree.net\/wp-content\/uploads\/2019\/01\/Mesgarani-Hero-06-27-18-570x322.jpg 570w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/p>\n<p>These findings were published in Scientific Reports.<\/p>\n<p>\u201cOur voices help connect us to our friends, family and the world around us, which is why losing the power of one\u2019s voice due to injury or disease is so devastating,\u201d said Nima Mesgarani, PhD, the paper\u2019s senior author and a principal investigator at Columbia University\u2019s Mortimer B. Zuckerman Mind Brain Behavior Institute. \u201cWith today\u2019s study, we have a potential way to restore that power. We\u2019ve shown that, with the right technology, these people\u2019s thoughts could be decoded and understood by any listener.\u201d<\/p>\n<p>This would be a game changer. It would give anyone who has lost their ability to speak, whether through injury or disease, the renewed chance to connect to the world around them.<\/p>\n<p>Decades of research has shown that when people speak \u2014 or even imagine speaking \u2014 telltale patterns of activity appear in their brain. Distinct (but recognizable) pattern of signals also emerge when we listen to someone speak, or imagine listening. Experts, trying to record and decode these patterns, see a future in which thoughts need not remain hidden inside the brain \u2014 but instead could be translated into verbal speech at will.<\/p>\n<p>But accomplishing this feat has proven challenging. Early efforts to decode brain signals by Dr. Mesgarani and others focused on simple computer models that analyzed spectrograms, which are visual representations of sound frequencies.<\/p>\n<p>But because this approach has failed to produce anything resembling intelligible speech, Dr. Mesgarani and his team, including the paper&#8217;s first author Hassan Akbari, turned instead to a vocoder, a computer algorithm that can synthesize speech after being trained on recordings of people talking.<\/p>\n<p>\u201cThis is the same technology used by Amazon Echo and Apple Siri to give verbal responses to our questions,\u201d said Dr. Mesgarani, who is also an associate professor of electrical engineering at Columbia Engineering.<\/p>\n<p>To teach the vocoder to interpret to brain activity, Dr. Mesgarani teamed up with Ashesh Dinesh Mehta, MD, PhD, a neurosurgeon at Northwell Health Physician Partners Neuroscience Institute and co-author of today\u2019s paper. Dr. Mehta treats epilepsy patients, some of whom must undergo regular surgeries.<\/p>\n<p>\u201cWorking with Dr. Mehta, we asked epilepsy patients already undergoing brain surgery to listen to sentences spoken by different people, while we measured patterns of brain activity,\u201d said Dr. Mesgarani. \u201cThese neural patterns trained the vocoder.\u201d<\/p>\n<p>Next, the researchers asked those same patients to listen to speakers reciting digits between 0 to 9, while recording brain signals that could then be run through the vocoder. The sound produced by the vocoder in response to those signals was analyzed and cleaned up by neural networks, a type of artificial intelligence that mimics the structure of neurons in the biological brain.<\/p>\n<p>The end result was a robotic-sounding voice reciting a sequence of numbers. To test the accuracy of the recording, Dr. Mesgarani and his team tasked individuals to listen to the recording and report what they heard.<\/p>\n<p>\u201cWe found that people could understand and repeat the sounds about 75% of the time, which is well above and beyond any previous attempts,\u201d said Dr. Mesgarani. The improvement in intelligibility was especially evident when comparing the new recordings to the earlier, spectrogram-based attempts. \u201cThe sensitive vocoder and powerful neural networks represented the sounds the patients had originally listened to with surprising accuracy.\u201d<\/p>\n<p>Dr. Mesgarani and his team plan to test more complicated words and sentences next, and they want to run the same tests on brain signals emitted when a person speaks or imagines speaking. Ultimately, they hope their system could be part of an implant, similar to those worn by some epilepsy patients, that translates the wearer\u2019s thoughts directly into words.<\/p>\n<p>\u201cIn this scenario, if the wearer thinks \u2018I need a glass of water,\u2019 our system could take the brain signals generated by that thought, and turn them into synthesized, verbal speech,\u201d said Dr. Mesgarani. \u201cThis would be a game changer. It would give anyone who has lost their ability to speak, whether through injury or disease, the renewed chance to connect to the world around them.\u201d<\/p>\n<p>###<\/p>\n<p>This paper is titled \u201cTowards reconstructing intelligible speech from the human auditory cortex.\u201d Additional contributors include Bahar Khalighinejad and Jose L. Herrero.<\/p>\n<p>This research was supported by the National Institutes of Health (DC014279), the Pew Charitable Trusts and the Pew Biomedical Scholars Program.<\/p>\n<p>The authors report no financial or other conflicts of interest.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>New York, NY&#8230;In a scientific first, Columbia neuroengineers have created a system that translates thought into intelligible, recognizable speech. By monitoring someone\u2019s brain activity, the technology can reconstruct the words a person hears with unprecedented clarity. This breakthrough, which harnesses the power of speech synthesizers and artificial intelligence, could lead to new ways for computers [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":73759,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_cbd_carousel_blocks":"[]","jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[3,6,20,1,147],"tags":[],"class_list":["post-73758","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-business","category-education","category-featured","category-news","category-tech","last_archivepost"],"jetpack_featured_media_url":"https:\/\/new.thepinetree.net\/wp-content\/uploads\/2019\/01\/Mesgarani-Hero-06-27-18.jpg","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=\/wp\/v2\/posts\/73758","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=73758"}],"version-history":[{"count":0,"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=\/wp\/v2\/posts\/73758\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=\/wp\/v2\/media\/73759"}],"wp:attachment":[{"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=73758"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=73758"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/new.thepinetree.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=73758"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}