Binarization of degraded document images using convolutional neural networks based on predicted two-channel images
المؤلف | Akbari Y. |
المؤلف | Britto A.S. |
المؤلف | Al-Maadeed, Somaya |
المؤلف | Oliveira L.S. |
تاريخ الإتاحة | 2022-05-19T10:23:11Z |
تاريخ النشر | 2019 |
اسم المنشور | Proceedings of the International Conference on Document Analysis and Recognition, ICDAR |
المصدر | Scopus |
المعرّف | http://dx.doi.org/10.1109/ICDAR.2019.00160 |
الملخص | Due to the poor condition of most of historical documents, binarization is difficult to separate document image background pixels from foreground pixels. This paper proposes Convolutional Neural Networks (CNNs) based on predicted two-channel images in which CNNs are trained to classify the foreground pixels. The promising results from the use of multispectral images for semantic segmentation inspired our efforts to create a novel prediction-based two-channel image. In our method, the original image is binarized by the structural symmetric pixels (SSPs) method, and the two-channel image is constructed from the original image and its binarized image. In order to explore impact of proposed two-channel images as network inputs, we use two popular CNNs architectures, namely SegNet and U-net. The results presented in this work show that our approach fully outperforms SegNet and U-net when trained by the original images and demonstrates competitiveness and robustness compared with state-of-the-art results using the DIBCO database. |
راعي المشروع | This publication was made possible by NPRP grant # NPRP8-140-2-065 from Qatar National Research Fund (a member of Qatar Foundation). The statement made herein are solely the responsibility of the authors. |
اللغة | en |
الناشر | IEEE Computer Society |
الموضوع | Convolution Image segmentation Pixels Semantics Background pixels Degraded document images Document image binarization Historical documents Multispectral images Prediction-based SegNet Semantic segmentation Convolutional neural networks |
النوع | Conference |
الصفحات | 973-978 |
الملفات في هذه التسجيلة
الملفات | الحجم | الصيغة | العرض |
---|---|---|---|
لا توجد ملفات لها صلة بهذه التسجيلة. |
هذه التسجيلة تظهر في المجموعات التالية
-
علوم وهندسة الحاسب [2409 items ]