Show simple item record

AuthorZayyan, Ayman A.
AuthorElmahdy, Mohamed
AuthorHusni, Husniza binti
AuthorAl Ja'am, Jihad M.
Available date2021-07-05T10:58:31Z
Publication Date2016
Publication NameISCAIE 2016 - 2016 IEEE Symposium on Computer Applications and Industrial Electronics
ResourceScopus
URIhttp://dx.doi.org/10.1109/ISCAIE.2016.7575067
URIhttp://hdl.handle.net/10576/21085
AbstractIn this paper, the problem of missing diacritic marks in most of Arabic written resources is investigated. Our aim is to implement a scalable and extensible platform to automatically restore missing diacritic marks for Modern Standard Arabic text. Different rule-based and statistical techniques are proposed. These include: morphological analyzer-based, maximum likelihood estimate, and statistical n-gram models. Diacritization accuracy of each technique was evaluated based on Diacritic Error Rate (DER) and Word Error Rate (WER). The proposed platform includes helper tools for text preprocessing and encoding conversion. It yielded a WER of 7.1% and DER of 3.9%. When the case ending was ignored, the platform yielded a WER and DER of 5.1% and 2.7%, respectively. 2016 IEEE.
Languageen
PublisherInstitute of Electrical and Electronics Engineers Inc.
SubjectArabic
diacritization
text processing
vowelization
TitleAutomatic diacritics restoration for modern standard Arabic text
TypeConference Paper
Pagination221-225


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record