Wednesday,  December 11 , 2024

Linkedin Pinterest
News / Business

Translating tool getting really, really accurate

By Karen Turner, Special To The Washington Post
Published: October 9, 2016, 4:00pm

Any students using Google Translate to cheat on their Spanish homework can rejoice. The foreign-language translation tool is about to get a whole lot more accurate.

Last week, Google launched an updated translation tool that utilizes sophisticated artificial intelligence to produce startlingly accurate language translations. While the tool has been used to successfully translate between English and Spanish, French and Chinese in a research setting, it’s only available currently to everyday users for Chinese to English translations. The new system, which uses deep machine learning to mimic the functioning of a human brain, is called the Google Neural Machine Translation system, or GNMT.

To test the system, Google had human raters evaluate translations on a scale from 0 to 6. Translating from English to Spanish, the new Google tool’s translation was rated an average of 5.43; human translators earned an average of 5.5. For Chinese to English, the only public-facing option that currently utilizes the new system, Google Translate was rated an average of 4.3 while human translators got 4.6.

Overall, across all three languages, Google said its new tool is 60 percent more accurate than the old Google Translate tool, which used phrase-based machine translation, or PBMT. “With the previous PBMT model, when we translate a sentence from one language to another, we would translate one word or a phrase in the source sentence at a time, then re-order the words in the correct grammar of the target language,” said Quoc Le, a Google researcher who worked on the project.

“The complication is language has a lot of ambiguity. In our new GNMT system, we treat a whole sentence as a unit, and translate (the words) in a group.”

The complexity of a translation machine that can digest entire phrases rather than rely on word-by-word translation are somewhat lost even on the researchers themselves.

Despite its improved accuracy, the GNMT model still mistranslates rare terms and occasionally drops words. And it hasn’t acquired any common sense. Given the sentence “the trophy cannot fit the cabinet because it’s too big,” the model could mistranslate because it doesn’t know which “it” is the one that’s too big. “GNMT doesn’t have a model of the how the world actually works yet,” said Le.

Support local journalism

Your tax-deductible donation to The Columbian’s Community Funded Journalism program will contribute to better local reporting on key issues, including homelessness, housing, transportation and the environment. Reporters will focus on narrative, investigative and data-driven storytelling.

Local journalism needs your help. It’s an essential part of a healthy community and a healthy democracy.

Community Funded Journalism logo
Loading...