Crosslingual automatic diacritization for Egyptian Colloquial Dialect

Zayyan, Ayman A.; Elmahdy, Mohamed; Husni, Husniza Binti; Yousf, Shahrul Azmi

Author	Zayyan, Ayman A.
Author	Elmahdy, Mohamed
Author	Husni, Husniza Binti
Author	Yousf, Shahrul Azmi
Available date	2021-09-01T10:02:46Z
Publication Date	2016
Publication Name	Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA
Resource	Scopus
URI	http://dx.doi.org/10.1109/AICCSA.2016.7945665
URI	http://hdl.handle.net/10576/22397
Abstract	In this paper, the problem of missing diacritic marks in most of dialectal Arabic written resources is addressed. Our aim is to implement a scalable and extensible platform for automatically retrieving the diacritic marks for undiacritized dialectal Arabic texts. Different rule-based and statistical techniques are proposed. These include: morphological analyzer-based, maximum likelihood estimate, and statistical n-gram models. The proposed platform includes helper tools for text preprocessing and encoding conversion. Diacritization accuracy of each technique is evaluated in terms of Diacritic Error Rate (DER) and Word Error Rate (WER). The approach trains several n-gram models on different lexical units. A data pool of both Modern Standard Arabic (MSA) data along with Dialectal Arabic data was used to train the models. 2016 IEEE.
Language	en
Publisher	IEEE Computer Society
Subject	Text processing Diacritization Dialectal arabics Maximum likelihood estimate Modern standards Morphological analyzer Statistical techniques Text preprocessing Vowelization Maximum likelihood estimation
Title	Crosslingual automatic diacritization for Egyptian Colloquial Dialect
Type	Conference Paper
dc.accessType	Abstract Only

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Computer Science & Engineering [‎2402‎ items ]

Show simple item record

Crosslingual automatic diacritization for Egyptian Colloquial Dialect

Files in this item

This item appears in the following Collection(s)

Video