From Feature To Paradigm: Deep Learning In Machine Translation

Marta R. Costa-jussà

doi:10.1613/jair.1.11198

PDF

Published: Apr 30, 2018

DOI: https://doi.org/10.1613/jair.1.11198

Marta R. Costa-jussà

Abstract

In the last years, deep learning algorithms have highly revolutionized several areas including speech, image and natural language processing. The speciﬁc ﬁeld of Machine Translation (MT) has not remained invariant. Integration of deep learning in MT varies from re-modeling existing features into standard statistical systems to the development of a new architecture. Among the diﬀerent neural networks, research works use feed-forward neural networks, recurrent neural networks and the encoder-decoder schema. These architectures are able to tackle challenges as having low-resources or morphology variations.

This manuscript focuses on describing how these neural networks have been integrated to enhance diﬀerent aspects and models from statistical MT, including language modeling, word alignment, translation, reordering, and rescoring. Then, we report the new neural MT approach together with a description of the foundational related works and recent approaches on using subword, characters and training with multilingual languages, among others. Finally, we include an analysis of the corresponding challenges and future work in using deep learning in MT.

Issue

Vol. 61 (2018)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details