Publication Date:
2023
Short description:
Towards a Unified Digital Resource for Tunisian Arabic Lexicography / Gugliotta, Elisa; Mallia, Michele; Panascì, Livia. - (2023), pp. 579-590. [10.34619/srmk-injj]
abstract:
This paper presents our work on linking language tools for Tunisian Arabic, focusing on a lexicographic database and a corpus of informal written texts. This work on Tunisian Arabic is an ongoing pilot study, while our wider goal is
to create resources for various under-resourced
languages. We outline a methodology that
emphasises open science principles, leveraging existing language resources and NLP tools for standardisation and annotation. Our approach ensures reproducibility and benefits other researchers. We share annotated data on a
digital platform and release NLP tools on a dedicated repository. Our work aligns with FAIR principles, facilitating open and effective research on under-resourced languages.
to create resources for various under-resourced
languages. We outline a methodology that
emphasises open science principles, leveraging existing language resources and NLP tools for standardisation and annotation. Our approach ensures reproducibility and benefits other researchers. We share annotated data on a
digital platform and release NLP tools on a dedicated repository. Our work aligns with FAIR principles, facilitating open and effective research on under-resourced languages.
Iris type:
2.1 Contributo in volume (Capitolo o Saggio)
Keywords:
Tunisian Arabic, lexicographic database, informal written texts, under-resourced languages, language resources, NLP tools, standardisation, annotation, open science, reproducibility, FAIR principles, digital platform
List of contributors:
Gugliotta, Elisa; Mallia, Michele; Panascì, Livia
Book title:
Proceedings of the 4th Conference on Language, Data and Knowledge