Skip to main navigation Skip to search Skip to main content

Transfer Learning and Custom Loss Applied to Transformer-Based Text Translation for Sign Language Animated Subtitles

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Online learning's rise presents unique challenges for the deaf community, particularly in understanding educational videos. This research addresses the problem by proposing a solution to generate animated subtitles in the Indonesian Sign Language System, namely SIBI (Sistem Isyarat Bahasa Indonesia). The existing method produces word-by-word animations from directly recognized spoken text that are too fast and difficult to follow. We developed a text translation model for the Moodle application to shorten the original spoken text of the educational videos into SIBI text without losing the crucial information. We propose transfer learning to train our transformer-based models and use a custom loss function to ensure alignment with the SIBI dictionary. Pre-trained models mBART50 and NLLB200 were fine-tuned on the SIBIVID-MP12 dataset, which was created in collaboration with Special Education teachers. Experiment results show that the proposed method improves translation metrics significantly, with the best performance observed for the NLLB200 model fine-tuned with our proposed custom loss, achieving sacreBLEU, chrF++, METEOR, and ROUGE-L improvements of 71%, 9.79%, 22.92%, and 14.55%, respectively. This research demonstrates the potential for enhanced inclusivity in online learning for the deaf community.

Original languageEnglish
Pages (from-to)36858-36876
Number of pages19
JournalIEEE Access
Volume13
DOIs
Publication statusPublished - 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 4 - Quality Education
    SDG 4 Quality Education

Keywords

  • Machine translation
  • NLLB200
  • SIBI
  • custom loss
  • mBART50
  • subtitle generation
  • transfer learning

Fingerprint

Dive into the research topics of 'Transfer Learning and Custom Loss Applied to Transformer-Based Text Translation for Sign Language Animated Subtitles'. Together they form a unique fingerprint.

Cite this