Do you remember when the idea of KITT, the chatty Knight Rider car, still blew you away? Or when Blade Runner Eric Decker verbally commanded his computer to enhance photos of a crime scene? The idea of being understood by a computer seemed futuristic enough, let alone one that could answer your questions and understand your commands.
About the Author
Graeme John Cole is a contributor for Rev, creator of the world's most accurate automatic speech recognition engine, Rev.ai.
Today, we all carry KITT in our pockets. We sigh when KITT answers the phone at the bank. The personality isnt quite there yet but computers can recognize the words we say near-perfectly.
Michael Knight, the Knight Rider hero who partnered with his intelligent car to fight crime, was skeptical at the thought KITT might understand his questions in 1982. But the development of speech recognition technology had been underway since the 1950s. Here's a closer look at how that technology has evolved over the years. And how our ways of using speech recognition and speech-to-text capabilities have evolved alongside the tech.
The power of automated speech recognition (ASR) means that its development has always been associated with big names.
Bell Laboratories led the way with AUDREY in 1952. The AUDREY system recognized spoken numbers with 97-99% accuracy in carefully controlled conditions. However, according to James Flanagan, a scientist and former Bell Labs electrical engineer, AUDREY sat on "a six-foot-high relay rack, consumed substantial power, and exhibited the myriad maintenance problems associated with complex vacuum-tube circuitry." AUDREY was too expensive and inconvenient even for specialist use cases.
IBM followed in 1962 with the Shoebox, which recognized numbers and simple math terms. Meanwhile, Japanese labs were developing vowel and phoneme recognizers and the first speech segmenter. It's one thing for a computer to understand a small range of numbers (i.e., 0-9), but Kyoto University's breakthrough was to 'segment' a line of speech so the tech could go to work on a range of spoken sounds.
In the 1970s, The Department of Defense (DARPA) funded the Speech Understanding Research (SUR) program. The fruits of this research included the HARPY Speech Recognition System from Carnegie Mellon. HARPY recognized sentences from a vocabulary of 1,011 words, giving the system the power of the average three-year-old. Like a three-year-old, speech recognition was now charming and had potential but you wouldnt want it in the office.
HARPY was among the first to make use of Hidden Markov Models (HMM). This probabilistic method drove the development of ASR in the 1980s. Indeed, in the 1980s, the first viable use cases for speech-to-text tools emerged with IBM's experimental transcription system, Tangora. Properly trained, Tangora could recognize and type 20,000 words in English. However, the system was still too unwieldy for commercial use.
We thought it was wrong to ask a machine to emulate people, recalls IBMs speech recognition innovator Fred Jelinek. After all, if a machine has to move, it does it with wheelsnot by walking. Rather than exhaustively studying how people listen to and understand speech, we wanted to find the natural way for the machine to do it.
Statistical analysis was now driving the evolution of ASR technology. In 1990, Dragon Dictate launched as the first commercial speech recognition software. It cost $9,000 roughly $18,890 in 2021 accounting for inflation. Until the launch of Dragon Naturally Speaking in 1997, users still needed to pause between every word.
In 1992, AT&T introduced Bell Labs Voice Recognition Call Processing (VRCP) service. VRCP now handles around 1.2 billion voice transactions each year.
But most of the work on speech recognition in the 1990s took place under the hood. Personal computing and the ubiquitous network created new angles for innovation. Such was the opportunity spotted by Mike Cohen, who joined Google to launch the company's speech tech efforts in 2004. Google Voice Search (2007) delivered voice recognition tech to the masses. But it also recycled the speech data of millions of networked users as training material for machine learning. And it had Google's processing clout to drive the quality forwards.
Apple (Siri) and Microsoft (Cortana) followed just to stay in the game. In the early 2010s, the emergence of deep learning, Recurrent Neural Networks (RNNs), and Long short-term memory (LSTM), led to a hyperspace jump in the capabilities of ASR tech. This forward momentum was also largely driven by emergence and increased availability of low-cost computing and massive algorithmic advances.
Building on decades of evolution and in response to rising user expectations speech recognition technology has made further leaps over the past half-decade. Solutions to optimize varying audio fidelity and demanding hardware requirements are easing speech recognition into everyday use via voice search and the Internet of Things.
For example, smart speakers use hot-word detection to deliver an immediate result using embedded software. Meanwhile, the remainder of the sentence is sent to the cloud for processing. Googles VoiceFilter-Lite optimizes an individuals speech at the device end of the transaction. This enables consumers to train their device with their voice. Training reduces the source-to-distortion ratio (SDR), enhancing the usability of voice-activated assistive apps.
Word error rate (WER - the percentage of incorrect words that appear during a speech-to-text process) is improving vastly. Academics suggest that by the end of the 2020s, 99% of transcription work will be automatic. Humans will step in only for quality control and corrections.
ASR capability is improving in symbiosis with the developments of the networked age. Here's a look at three compelling use cases for automated speech recognition.
The podcasting industry will bust through the $1 billion barrier in 2021. Listenership is soaring and the words keep coming.
Podcast platforms are seeking out ASR providers with high accuracy and per-word timestamps to help make it easier for people to create podcasts and maximize the value of their content. Providers like Descript convert podcasts into text that can be quickly edited.
Plus, per-word timestamps save time, empowering the editor to mold the finished podcast like clay. These transcripts also make content more accessible to all audiences, as well as help creators improve their shows searchability and discoverability via SEO.
More and more meetings take place online these days. And even those that dont are often recorded. Minute-taking is expensive and time-consuming. But meeting notes are an invaluable tool for attendees to get a recap or check a detail. Streaming ASR delivers speech-to-text in real-time. This means easy captioning or live transcription for meetings and seminars.
Processes such as legal depositions, hiring, and more are going virtual. ASR can help make this video content more accessible and engaging. But more importantly, end-to-end (E2E) machine learning (ML) models are further improving speaker diarization the record of who is present and who said what.
In high-stakes situations, trust in the tools is essential. A reliable speech-to-text engine with an ultra-low WER removes the element of doubt and reduces the time required to produce end documents and make decisions.
Do you think Knight Industries ever appraised the transcript of KITT and Michael's conversations to improve efficiency? Maybe not. But, turbo-charged by the recent move to working from home, more and more of our discussions are taking place online or over the phone. Highly accurate real-time natural language processing (NLP) gives us power over our words. It adds value to every interaction.
The tools are no longer exclusive to big names like IBM and DARPA. They are available for consumers, businesses, and developers to use how their imagination decides as speech recognition technology steadies to overtake the promises of science-fiction.
Interested in speech recognition? Check out our roundup of the best speech-to-text software
The rest is here:
The evolution of speech recognition technology - TechRadar
- Days of our Lives' Suzanne Rogers on the Evolution of Maggie: "She Knows Who She Is Now, and She's Not Relying ... - Michael Fairman TV - March 14th, 2024 [March 14th, 2024]
- Kylie Jenner Talks About Her Style Evolution - The Cut - March 14th, 2024 [March 14th, 2024]
- Equator Coffees Unveils New Packaging Design, Reflecting Brand Evolution & Vision For The Future - Sprudge - March 14th, 2024 [March 14th, 2024]
- Rosewood Hotel Group Accelerates Growth And Evolution Across Its Four Distinctive Brands - Hospitality Net - March 14th, 2024 [March 14th, 2024]
- Thomson Reuters Unveils New Brand Evolution - Adweek - March 14th, 2024 [March 14th, 2024]
- Is It Becoming Acceptable to Speak of Design? - Discovery Institute - March 14th, 2024 [March 14th, 2024]
- Did Charles Darwin Convert to Christianity and Discredit Evolution on His Deathbed? - Snopes.com - March 14th, 2024 [March 14th, 2024]
- Milk, it's not just for mammals: An amphibian makes it too - NPR - March 14th, 2024 [March 14th, 2024]
- Discover Puerto Rico Debuts Evolution of Its Successful 'Live Boricua' Brand Campaign Aimed at Engaging Visitors ... - Yahoo Finance - March 14th, 2024 [March 14th, 2024]
- A Journey Through Time: The Evolution of Ras Al Khaimah Art - Business Wire - March 14th, 2024 [March 14th, 2024]
- Empowering Women: The Evolution and Innovation of coto Social Platform - CXOToday.com - March 14th, 2024 [March 14th, 2024]
- The Evolution of Da'Vine Joy Randolph - The Root - March 14th, 2024 [March 14th, 2024]
- Study on mating behaviors offers clues into the evolution of attraction - Phys.org - March 14th, 2024 [March 14th, 2024]
- Dragonball Evolutions live-action Goku says goodbye to Toriyama: Sorry we messed up - AS USA - March 14th, 2024 [March 14th, 2024]
- Investec, evolution of SMEs in the materials handling sector - Leasing Life - March 14th, 2024 [March 14th, 2024]
- Pride & Prejudice and the evolution of the female gaze on screen - Yahoo News UK - March 6th, 2024 [March 6th, 2024]
- Joe Wong's Musical Evolution - Shepherd Express - March 6th, 2024 [March 6th, 2024]
- A global survey of prokaryotic genomes reveals the eco-evolutionary pressures driving horizontal gene transfer - Nature.com - March 6th, 2024 [March 6th, 2024]
- Redefining Intelligence: Chimpanzees Break Through the Cultural Evolution Barrier - Medriva - March 6th, 2024 [March 6th, 2024]
- Mollusk Eyes Reveal How Future Evolution Depends on the Past - Quanta Magazine - March 6th, 2024 [March 6th, 2024]
- Levy Delves Into the Evolution of ADCs in NSCLC - OncLive - March 6th, 2024 [March 6th, 2024]
- The Snake Is The Spearhead of Reptile Evolution, But Why? - ScienceAlert - March 6th, 2024 [March 6th, 2024]
- 'A very special day: Birds linked to Darwins theory of evolution reintroduced to Galapagos Islands - Euronews - March 6th, 2024 [March 6th, 2024]
- Why the Powerhouses of Cells Evolve Differently in Plants - College of Natural Sciences - March 6th, 2024 [March 6th, 2024]
- Driving the DevOps Evolution: ArgoCD, Tekton and Seamless Migrations - DevOps.com - March 6th, 2024 [March 6th, 2024]
- Finding the Balance: The Evolution of Public Health Guidance Amidst Controversy - Medriva - March 6th, 2024 [March 6th, 2024]
- Insider Podcast: Paolini dishes on her Polish roots and hard-court evolution - WTA Tennis - March 6th, 2024 [March 6th, 2024]
- Interview: Sara Gruen and Rick Elice Talk About the Inspiration and Evolution of the New Musical Water for Elephants - TheaterMania.com - March 6th, 2024 [March 6th, 2024]
- The Evolution of the Laravel Welcome Page - Laravel News - March 6th, 2024 [March 6th, 2024]
- A Serpentine 'Explosion' 125 Million Years Ago Primed Snakes for Rapid, Diverse Evolution - Smithsonian Magazine - March 6th, 2024 [March 6th, 2024]
- The Evolution of Modern Technologies in Car Development - FinSMEs - March 6th, 2024 [March 6th, 2024]
- Milwaukee Transformed: From Bronzeville to Veterans Park, Aerial Timelapses Reveal City's Evolution - BNN Breaking - March 6th, 2024 [March 6th, 2024]
- The eyes are a gateway to evolution of daddy longlegs at least. - University of Wisconsin-Madison - March 6th, 2024 [March 6th, 2024]
- Adrian Newey: RB20 is the next step in Red Bull's design evolution - PlanetSport - March 6th, 2024 [March 6th, 2024]
- LiveScore releases its 'Evolution of Fan' report - Gambling Insider - March 6th, 2024 [March 6th, 2024]
- The loyalty program evolution makes its way to the full-service restaurant category - Nation's Restaurant News - March 6th, 2024 [March 6th, 2024]
- Teenage Mutant Ninja Turtles: The Last Ronin II - Re-Evolution #1 spoiler-free review: goes hard on the action, but ... - Gamesradar - March 6th, 2024 [March 6th, 2024]
- Exploring U.S. Financial Evolution: DAR Hosts Talk on Federal Reserve History in Thomasville - BNN Breaking - March 6th, 2024 [March 6th, 2024]
- Why cloud evolution needs a cohesive approach to succeed - CIO - March 6th, 2024 [March 6th, 2024]
- Gilead Sciences CEO on Company's Evolution and Commitment to the Bay Area - BioSpace - March 6th, 2024 [March 6th, 2024]
- Navigating the AI Quandary: Human Supremacy vs Machine Intelligence Evolution - BNN Breaking - March 6th, 2024 [March 6th, 2024]
- Denis Villeneuve breaks down the evolution of sandworms in 'Dune: Part Two' - Mashable - March 6th, 2024 [March 6th, 2024]
- Continued evolution of law improves governing capacity - Chinadaily.com.cn - China Daily - March 6th, 2024 [March 6th, 2024]
- The Evolution of the DEX Space with dYdX's CEO Antonio Juliano - Blockster - March 6th, 2024 [March 6th, 2024]
- Quick Commerce Evolution: 3PL Firms Aim for Same Day Delivery, Chasing Blinkit and Zepto's Lead - BNN Breaking - March 6th, 2024 [March 6th, 2024]
- What If...? Star Jeffrey Wright Addreses the Watcher's Evolution and 'Epic' Season 2 Finale - CBR - Comic Book Resources - December 31st, 2023 [December 31st, 2023]
- Evolution of the Connected Autonomous Vehicle - Ward's Auto - December 31st, 2023 [December 31st, 2023]
- A project to capture the evolution of human culture. - Psychology Today - December 31st, 2023 [December 31st, 2023]
- The Evolution of a Digital Soul. Beyond Code: A Journey of Heart and | by Mark Randall Havens | Dec, 2023 - Medium - December 31st, 2023 [December 31st, 2023]
- 4 Clues That Reid Is Finally Returning In Criminal Minds: Evolution Season 2 - Screen Rant - December 31st, 2023 [December 31st, 2023]
- Evolution of Samoyed and Kitten's Friendship Delights Internet: 'Wholesome' - Newsweek - December 31st, 2023 [December 31st, 2023]
- Crypto Evolution: Pullix (PLX) vs OKB (OKB) & KuCoin (KCS) - Crypto Reporter - December 31st, 2023 [December 31st, 2023]
- Alfa Romeos mediocre F1 season heralded its era of evolution: Prime Tire - The Athletic - December 31st, 2023 [December 31st, 2023]
- Beyond The Uniform: 10 Years of Evolution in SYNC Performance's Custom Program - SkiRacing.com - December 31st, 2023 [December 31st, 2023]
- Why SZA's evolution into a popstar has earned her recognition as artist of the year - Salon - December 31st, 2023 [December 31st, 2023]
- AI in 2023 Rises, Falls and Evolution - Finance Magnates - December 31st, 2023 [December 31st, 2023]
- Indonesia's Indosat pursues evolution from telecom to tech company - Nikkei Asia - December 31st, 2023 [December 31st, 2023]
- EdTech Evolution: 3 Stocks Educating the Next Generation - InvestorPlace - December 31st, 2023 [December 31st, 2023]
- Informa Tech Interview with Huawei about voice evolution and innovations at 5G Core Summit 2023 - Informa Tech ... - Light Reading - December 31st, 2023 [December 31st, 2023]
- Looking ahead: What will the DeFi evolution look like in 2024? - Ledger Insights - Ledger Insights - December 31st, 2023 [December 31st, 2023]
- Why Cat Bohannon wrote 'Eve, How the Female Body Drove 200 Million Years of Human Evolution' | India News ... - IndiaTimes - December 31st, 2023 [December 31st, 2023]
- The smart-design evolution of the laboratory space - pharmaphorum - December 31st, 2023 [December 31st, 2023]
- The WILD Evolution of Teenage Mutant Ninja Turtles TMNT (VIDEO) - FandomWire - December 31st, 2023 [December 31st, 2023]
- The supernatural invades American museums via indigenous artifacts - Why Evolution Is True - December 31st, 2023 [December 31st, 2023]
- Baleen Whales First Evolved Large Body Size in Cold Southern Waters, New Fossil Shows - Sci.News - December 31st, 2023 [December 31st, 2023]
- The Evolution of Identity in Taiwan The Diplomat - The Diplomat - December 31st, 2023 [December 31st, 2023]
- From the Archive: The Evolution Of Hockey Pools - The Hockey News - December 31st, 2023 [December 31st, 2023]
- 'X-Men: Evolution' Is Better Than 'X-Men: The Animated Series' - Collider - December 31st, 2023 [December 31st, 2023]
- Unveiling the Silver Screen: The Evolution of Celebrity Nudity in Cinema - The Hype Magazine - December 31st, 2023 [December 31st, 2023]
- Are Humans Still Evolving? 'Maybe More Rapidly Than Ever,' Says Scientist - Newsweek - December 31st, 2023 [December 31st, 2023]
- The Intersection of Real Estate and Fintech: Evolution, Impact of Policies, and Global Dynamics - CXOToday.com - December 31st, 2023 [December 31st, 2023]
- Kyle Richards' Style Evolution: Her Best Looks - Us Weekly - December 31st, 2023 [December 31st, 2023]
- Criminal Minds: Evolution Season 2's "Deeper Secrets" Teased By Aisha Tyler - Screen Rant - December 31st, 2023 [December 31st, 2023]
- Saturday: Hili dialogue Why Evolution Is True - Why Evolution Is True - December 31st, 2023 [December 31st, 2023]
- NBA 2K24 MyTEAM New Year Resolution Adds 14 Evolution Cards - ClutchPoints - December 31st, 2023 [December 31st, 2023]
- dive into the history of NASA's logo evolution from the space ... - Designboom - November 8th, 2023 [November 8th, 2023]
- Resolving the puzzle of same-sex sexual interactions - Nature.com - November 8th, 2023 [November 8th, 2023]
- The History and Evolution of Black Friday And How It Got Its Name - Yahoo Life - November 8th, 2023 [November 8th, 2023]
- Evolution of Terran R, with Tim Ellis (Relativity Space) - Payload - November 8th, 2023 [November 8th, 2023]
- Brownell Raves About Breakout Junior's Evolution - The Clemson Insider - November 8th, 2023 [November 8th, 2023]