Enhancing Neural Arabic Machine Translation using Character-Level CNN-BILSTM and Hybrid Attention


  • Dhaya Eddine Messaoudi ICOSI Laboratory, Abbes Laghrour University, Khenchela, Algeria
  • Djamel Nessah ICOSI Laboratory, Abbes Laghrour University, Khenchela, Algeria
Volume: 14 | Issue: 5 | Pages: 17029-17034 | October 2024 | https://doi.org/10.48084/etasr.8383


Neural Machine Translation (NMT) has made significant strides in recent years, especially with the advent of deep learning, which has greatly enhanced performance across various Natural Language Processing (NLP) tasks. Despite these advances, NMT still falls short of perfect translation, facing ongoing challenges such as limited training data, handling rare words, and managing syntactic and semantic dependencies. This study introduces a multichannel character-level NMT model with hybrid attention for Arabic-English translation. The proposed approach addresses issues such as rare words and word alignment by encoding characters, incorporating Arabic word segmentation as handcrafted features, and using part-of-speech tagging in a multichannel CNN-BiLSTM encoder. The model then uses a Bi-LSTM decoder with hybrid attention to generate target language sentences. The proposed model was tested on a subset of the OPUS-100 dataset, achieving promising results.


Arabic natural language processing, deep-learning, machine translation, deep CNN Bi-LSTM, hybrid attention, PoS-tagging, Arabic word segmentation


