UPDATED 22:45 EDT / SEPTEMBER 27 2016

NEWS

Google says it has almost cracked language translation

When Google launched its Translate service a decade ago, the Internet was pretty much devoid of digital language translation tools. Google Translate became more useful over time, but it could also be catastrophic if one had serious work to do, leading many people to doubt these tools would ever work well.

That was then. On Tuesday, Google announced that it has created an AI translation tool, called the Google Neural Machine Translation system (GNMT), that is 60 percent more accurate than the Google Translate we all know. The new tool, which uses a so-called Phrase-Based Machine Translation algorithm, is embarrassing its former Google Translate tool in a translation stand-off and, more important, just about equaling the score of human translators.

image00

Tests show that when translating Spanish to English, Google Translate scored 3.6 in an accuracy test. 6 is the perfect translation while 0 is equal to “total nonsense.” Google’s new translation tool scored a 5 in the test, while human translators, according to the report, generally score around 5.1.  We must remember these are cognate languages, meaning almost 40 percent of words in both languages are related. But Google is so confident that its first release of GNMT will be English-Mandarin Chinese.

Google explains in a technical report just how it works:

Our model consists of a deep LSTM network with 8 encoder and 8 decoder layers using attention and residual connections. To improve parallelism and therefore decrease training time, our attention mechanism connects the bottom layer of the decoder to the top layer of the encoder. To accelerate the final translation speed, we employ low-precision arithmetic during inference computations. To improve handling of rare words, we divide words into a limited set of common sub-word units (“wordpieces”) for both input and output.

In laymen’s terms, LSTM, for long short-term memory, works as our own memories work when translating language. Short-term memory can find the best translation for one word, but long-term memory will remember complex or unusual word orders. The old Google Translate method just broke down sentences by words or phrases, but didn’t remember entire complex sentences. That’s why some translations, especially using idiomatic expressions, might end up looking hilarious, profound or insulting.

“What we have now is not perfect. But you can tell that it is much, much better,” Google engineer Mike Schuster told Wired. As Wired pointed out, there’s now a big race in which tech giants are competing to create the perfect translation machine with the aid of neural networks. Peter Lee, a corporate vice president at Microsoft Research who helps lead some AI research at the company, said Microsoft is “racing against everyone… We’re all on the verge.”

Photo of Tower of Babel: photopin cc

Since you’re here …

… We’d like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.

If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.