Enabling indexing and retrieval of historical Arabic manuscripts through template matching based word spotting
المؤلف | Faisal T. |
المؤلف | Al-Maadeed, Somaya |
تاريخ الإتاحة | 2022-05-19T10:23:12Z |
تاريخ النشر | 2017 |
اسم المنشور | 1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017 |
المصدر | Scopus |
المعرّف | http://dx.doi.org/10.1109/ASAR.2017.8067760 |
الملخص | We present a holistic segmentation-free query by example word spotting technique based on template matching. We have applied this technique to a dataset of historical Arabic handwritten manuscript images. First, the documents as well as query word images are pre-processed for separating text from the noisy background and converting to their binary equivalents. Then a pixel based approach is used for computing the similarity between the pre-processed template query word and document images by using the Correlation similarity measure. Slight variations in font sizes are tolerated by adjusting the threshold of similarity. Our robust pre-processing algorithm significantly enhances the performance of the learning-free template matching based word spotting approach. The proposed technique is simple as well as efficient as it does not involve any time-consuming learning steps. Experiments with a historical Arabic dataset yield promising results. This technique can generate locations of occurrences of query word images which is the fundamental step towards building searchable indexes for historical manuscripts. |
راعي المشروع | ACKNOWLEDGMENT This publication was made possible by QUCP grant number QUCP-CENG-CSE-15\16-1 from Qatar University. The statements made herein are solely the responsibility of the authors. |
اللغة | en |
الناشر | Institute of Electrical and Electronics Engineers Inc. |
الموضوع | Image processing Document images Indexing and retrieval Pixel based approach Pre-processing algorithms Query words Query-by example Similarity measure Word Spotting Template matching |
النوع | Conference |
الصفحات | 57-63 |
الملفات في هذه التسجيلة
الملفات | الحجم | الصيغة | العرض |
---|---|---|---|
لا توجد ملفات لها صلة بهذه التسجيلة. |
هذه التسجيلة تظهر في المجموعات التالية
-
علوم وهندسة الحاسب [2426 items ]