عرض بسيط للتسجيلة

المؤلفZayyan, Ayman A.
المؤلفElmahdy, Mohamed
المؤلفHusni, Husniza Binti
المؤلفYousf, Shahrul Azmi
تاريخ الإتاحة2021-09-01T10:02:46Z
تاريخ النشر2016
اسم المنشورProceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA
المصدرScopus
معرّف المصادر الموحدhttp://dx.doi.org/10.1109/AICCSA.2016.7945665
معرّف المصادر الموحدhttp://hdl.handle.net/10576/22397
الملخصIn this paper, the problem of missing diacritic marks in most of dialectal Arabic written resources is addressed. Our aim is to implement a scalable and extensible platform for automatically retrieving the diacritic marks for undiacritized dialectal Arabic texts. Different rule-based and statistical techniques are proposed. These include: morphological analyzer-based, maximum likelihood estimate, and statistical n-gram models. The proposed platform includes helper tools for text preprocessing and encoding conversion. Diacritization accuracy of each technique is evaluated in terms of Diacritic Error Rate (DER) and Word Error Rate (WER). The approach trains several n-gram models on different lexical units. A data pool of both Modern Standard Arabic (MSA) data along with Dialectal Arabic data was used to train the models. 2016 IEEE.
اللغةen
الناشرIEEE Computer Society
الموضوعText processing
Diacritization
Dialectal arabics
Maximum likelihood estimate
Modern standards
Morphological analyzer
Statistical techniques
Text preprocessing
Vowelization
Maximum likelihood estimation
العنوانCrosslingual automatic diacritization for Egyptian Colloquial Dialect
النوعConference Paper
dc.accessType Abstract Only


الملفات في هذه التسجيلة

الملفاتالحجمالصيغةالعرض

لا توجد ملفات لها صلة بهذه التسجيلة.

هذه التسجيلة تظهر في المجموعات التالية

عرض بسيط للتسجيلة