Skip to Main Content (Press Enter)

Logo UNISS
  • ×
  • Home
  • Degrees
  • Courses
  • Jobs
  • People
  • Outputs
  • Organizations
  • Third Mission
  • Expertise & Skills

Logo UNISS

|

UNIFIND

uniss.it
  • ×
  • Home
  • Degrees
  • Courses
  • Jobs
  • People
  • Outputs
  • Organizations
  • Third Mission
  • Expertise & Skills
  1. Outputs

Lahajat: A rule-based converter of standard arabic lexical databases into spoken arabic forms

Chapter
Publication Date:
2016
Short description:
Lahajat: A rule-based converter of standard arabic lexical databases into spoken arabic forms / Lancioni, Giuliano; Gugliotta, Elisa; Pettinari, Valeria. - (2016), pp. 395-399.
abstract:
Lexical resources on Arabic tend to focus on the standard version of the language (Modern Standard Arabic, MSA), mostly used in written and formal sources. However, the diffusion of informal genres has increasingly made it necessary the production of wider resources, encompassing the features of spoken varieties commonly found in written texts. The Lahajat project addresses this need by providing a series of rule-based transformations that enlarge existing lexical resources for MSA in order to cover for typical morphonological features found in spoken varieties. In particular, two specific case studies are shown that apply to two widely diverging varieties, Egyptian Arabic and Tunisian Arabish.
Iris type:
2.1 Contributo in volume (Capitolo o Saggio)
Keywords:
Lexical Database, Negation, Morphological Analysis, French-speaking, Postage, Standard Language, Code-switching, Rule-based Approach, Personal Pronouns, Transliteration, Loanwords, Parallel Corpus, Social Media Text
List of contributors:
Lancioni, Giuliano; Gugliotta, Elisa; Pettinari, Valeria
Authors of the University:
GUGLIOTTA Elisa
Handle:
https://iris.uniss.it/handle/11388/362150
Book title:
4th IEEE International Colloquium on Information Science and Technology (CiSt)
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.6.0.0