Abstract
While structure learning achieves remarkable performance in high-resource languages, the situation differs for under-represented languages due to the scarcity of annotated data.This study focuses on assessing the efficacyof transfer learning in enhancing dependency parsing for Javanese—a language spoken by 80million individuals but characterized by limited representation in natural language processing. We utilized the Universal Dependencies dataset consisting of dependency treebanks from more than 100 languages, including Javanese. We propose two learning strategies to train the model: transfer learning (TL) and hierarchical transfer learning (HTL). While TL onlyuses a source language to pre-train the model, the HTL method uses a source language and an intermediate language in the learning process. The results show that our best model usesthe HTL method, which improves performance with an increase of 10 % for both UAS and LAS evaluations compared to the baseline model
Original language | English |
---|---|
Title of host publication | Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics: Student Research Workshop |
Pages | 1-9 |
DOIs | |
Publication status | Published - 2023 |
Event | Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics: Student Research Workshop - Nusa Dua, Bali Duration: 1 Nov 2023 → 1 Nov 2023 |
Conference
Conference | Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics: Student Research Workshop |
---|---|
Period | 1/11/23 → 1/11/23 |