عرض بسيط للتسجيلة

المؤلفZayyan, Ayman A.
المؤلفElmahdy, Mohamed
المؤلفHusni, Husniza binti
المؤلفAl Ja'am, Jihad M.
تاريخ الإتاحة2021-07-05T10:58:31Z
تاريخ النشر2016
اسم المنشورISCAIE 2016 - 2016 IEEE Symposium on Computer Applications and Industrial Electronics
المصدرScopus
معرّف المصادر الموحدhttp://dx.doi.org/10.1109/ISCAIE.2016.7575067
معرّف المصادر الموحدhttp://hdl.handle.net/10576/21085
الملخصIn this paper, the problem of missing diacritic marks in most of Arabic written resources is investigated. Our aim is to implement a scalable and extensible platform to automatically restore missing diacritic marks for Modern Standard Arabic text. Different rule-based and statistical techniques are proposed. These include: morphological analyzer-based, maximum likelihood estimate, and statistical n-gram models. Diacritization accuracy of each technique was evaluated based on Diacritic Error Rate (DER) and Word Error Rate (WER). The proposed platform includes helper tools for text preprocessing and encoding conversion. It yielded a WER of 7.1% and DER of 3.9%. When the case ending was ignored, the platform yielded a WER and DER of 5.1% and 2.7%, respectively. 2016 IEEE.
اللغةen
الناشرInstitute of Electrical and Electronics Engineers Inc.
الموضوعArabic
diacritization
text processing
vowelization
العنوانAutomatic diacritics restoration for modern standard Arabic text
النوعConference
الصفحات221-225
dc.accessType Abstract Only


الملفات في هذه التسجيلة

الملفاتالحجمالصيغةالعرض

لا توجد ملفات لها صلة بهذه التسجيلة.

هذه التسجيلة تظهر في المجموعات التالية

عرض بسيط للتسجيلة