How voice technology is transforming computing

Composition of human head, key symbol and fractal design elements on the subject of encryption, security, digital communications, science and technology

ANY sufficiently advanced technology, noted Arthur C. Clarke, a British science-fiction writer, is indistinguishable from magic. The fast-emerging technology of voice computing proves his point. Using it is just like casting a spell: say a few words into the air, and a nearby device can grant your wish.

The Amazon Echo, a voice-driven cylindrical computer that sits on a table top and answers to the name Alexa, can call up music tracks and radio stations, tell jokes, answer trivia questions and control smart appliances; even before Christmas it was already resident in about 4% of American households. Voice assistants are proliferating in smartphones, too: Apple’s Siri handles over 2bn commands a week, and 20% of Google searches on Android-powered handsets in America are input by voice. Dictating e-mails and text messages now works reliably enough to be useful. Why type when you can talk?

This is a huge shift. Simple though it may seem, voice has the power to transform computing, by providing a natural means of interaction. Windows, icons and menus, and then touchscreens, were welcomed as more intuitive ways to deal with computers than entering complex keyboard commands. But being able to talk to computers abolishes the need for the abstraction of a “user interface” at all. Just as mobile phones were more than existing phones without wires, and cars were more than carriages without horses, so computers without screens and keyboards have the potential to be more useful, powerful and ubiquitous than people can imagine today.

Voice will not wholly replace other forms of input and output. Sometimes it will remain more convenient to converse with a machine by typing rather than talking (Amazon is said to be working on an Echo device with a built-in screen). But voice is destined to account for a growing share of people’s interactions with the technology around them, from washing machines that tell you how much of the cycle they have left to virtual assistants in corporate call-centres. However, to reach its full potential, the technology requires further breakthroughs—and a resolution of the tricky questions it raises around the trade-off between convenience and privacy.

Alexa, what is deep learning?

Computer-dictation systems have been around for years. But they were unreliable and required lengthy training to learn a specific user’s voice. Computers’ new ability to recognise almost anyone’s speech dependably without training is the latest manifestation of the power of “deep learning”, an artificial-intelligence technique in which a software system is trained using millions of examples, usually culled from the internet. Thanks to deep learning, machines now nearly equal humans in transcription accuracy, computerised translation systems are improving rapidly and text-to-speech systems are becoming less robotic and more natural-sounding. Computers are, in short, getting much better at handling natural language in all its forms (see Technology Quarterly).

Although deep learning means that machines can recognise speech more reliably and talk in a less stilted manner, they still don’t understand the meaning of language. That is the most difficult aspect of the problem and, if voice-driven computing is truly to flourish, one that must be overcome. Computers must be able to understand context in order to maintain a coherent conversation about something, rather than just responding to simple, one-off voice commands, as they mostly do today (“Hey, Siri, set a timer for ten minutes”). Researchers in universities and at companies large and small are working on this very problem, building “bots” that can hold more elaborate conversations about more complex tasks, from retrieving information to advising on mortgages to making travel arrangements. (Amazon is offering a $1m prize for a bot that can converse “coherently and engagingly” for 20 minutes.)

When spells replace spelling

Consumers and regulators also have a role to play in determining how voice computing develops. Even in its current, relatively primitive form, the technology poses a dilemma: voice-driven systems are most useful when they are personalised, and are granted wide access to sources of data such as calendars, e-mails and other sensitive information. That raises privacy and security concerns.

To further complicate matters, many voice-driven devices are always listening, waiting to be activated. Some people are already concerned about the implications of internet-connected microphones listening in every room and from every smartphone. Not all audio is sent to the cloud—devices wait for a trigger phrase (“Alexa”, “OK, Google”, “Hey, Cortana”, or “Hey, Siri”) before they start relaying the user’s voice to the servers that actually handle the requests—but when it comes to storing audio, it is unclear who keeps what and when.

Police investigating a murder in Arkansas, which may have been overheard by an Amazon Echo, have asked the company for access to any audio that might have been captured. Amazon has refused to co-operate, arguing (with the backing of privacy advocates) that the legal status of such requests is unclear. The situation is analogous to Apple’s refusal in 2016 to help FBI investigators unlock a terrorist’s iPhone; both cases highlight the need for rules that specify when and what intrusions into personal privacy are justified in the interests of security.

Consumers will adopt voice computing even if such issues remain unresolved. In many situations voice is far more convenient and natural than any other means of communication. Uniquely, it can also be used while doing something else (driving, working out or walking down the street). It can extend the power of computing to people unable, for one reason or another, to use screens and keyboards. And it could have a dramatic impact not just on computing, but on the use of language itself. Computerised simultaneous translation could render the need to speak a foreign language irrelevant for many people; and in a world where machines can talk, minor languages may be more likely to survive. The arrival of the touchscreen was the last big shift in the way humans interact with computers. The leap to speech matters more. – The Economist

Related Posts
VimpelCom announces $40mln sale of Telecel stake to Zimbabwe govt
HARARE,– Amsterdam-headquartered telecoms giant VimpelCom on Wednesday announced that it had agreed to sell its controlling stake in Telecel Zimbabwe to the government of Zimbabwe for $40 million. VimpelCom, which owns ...
READ MORE
Facebook opens up Messenger app to developers, businesses
Facebook is opening up its Messenger chat application so that other software developers can add new features that will let people include photos, videos and other enhancements to their online ...
READ MORE
Econet’s Liquid Telecom expands FTTH in Zimbabwe
HARARE - Liquid Telecom has revealed the latest expansion of its FTTH network in Zimbabwe. Liquid Telecom has already laid fibre rings around Victoria Falls, which provide multiple routes of ...
READ MORE
Apple customers report devices crashing on iOS 9 update
SAN FRANCISCO/SYDNEY - A significant number of Apple Inc customers are reporting their mobile devices have crashed after attempting to upload the new iOS 9 operating system, the latest in ...
READ MORE
U.S., Apple ratchet up rhetoric in fight over encryption
WASHINGTON - The U.S. Department of Justice filed a motion on Friday seeking to compel Apple Inc (AAPL.O) to comply with a judge's order to unlock the encrypted iPhone belonging ...
READ MORE
‘Telecel should not have been allowed to operate without licence’ – Chinamasa
VICTORIA FALLS– Mobile operator Telecel Zimbabwe, whose operating licence was cancelled last month over alleged non-compliance with the country’s regulations, should not have been allowed to operate without a licence ...
READ MORE
Apple Results to Show How Far iPhone Sales Have Fallen
Apple Inc. has warned investors that its quarterly earnings report will bear bad news about iPhone sales. The question now is just how bad. The world’s most valuable company forecast in ...
READ MORE
Telecel pays licence fee installment for June
HARARE,– Telecel Zimbabwe, which is currently embroiled in a legal battle with government over the cancellation of its operating licence on Friday said it had paid its licence fee for ...
READ MORE
Telecel teeters on brink of insolvency
Zimbabwe’s beleaguered mobile network operator Telecel Zimbabwe’s finances are in a sorry state amid indications the company is bankrupt. The company reportedly has management inefficiencies, high director fees and is ...
READ MORE
A mobile phone shows a Facebook page   promoting Hillary Clinton for president in 2016, in this photo illustration taken April 13, 2015. REUTERS/Mike Segar
As the U.S. presidential campaign heats up, Facebook Inc is going out of its way to show its neutrality - an increasingly urgent matter for the social network as evidence ...
READ MORE
VimpelCom announces $40mln sale of Telecel stake to
Facebook opens up Messenger app to developers, businesses
Econet’s Liquid Telecom expands FTTH in Zimbabwe
Apple customers report devices crashing on iOS 9
U.S., Apple ratchet up rhetoric in fight over
‘Telecel should not have been allowed to operate
Apple Results to Show How Far iPhone Sales
Telecel pays licence fee installment for June
Telecel teeters on brink of insolvency
Facebook’s political influence under a microscope

Arts & Entertainment

Arts & Entertainment

Jennifer Lopez coyly addresses Drake relationship amid romance rumours

20th January 2017 Staff Reporter 0

Jennifer Lopez claims her relationship with rapper Drake is strictly professional. In a new interview, the Shades Of Blue actress avoided questions about their rumoured romance and instead, confirmed they have collaborated on a new […]

Arts & Entertainment

Paris Jackson ‘wants to vomit’ seeing white actor as ‘King of Pop’

12th January 2017 Staff Reporter 0

Michael Jackson’s daughter Paris says she is “incredibly offended” by a new comic film in which a white British actor, Joseph Fiennes, portrays the late “King of Pop.” Paris Jackson, 18, also called the tongue-in-cheek […]