Binarization of Degraded Document Images Using Convolutional Neural Networks and Wavelet-Based Multichannel Images
Author | Akbari Y. |
Author | Al-Maadeed, Somaya |
Author | Adam K. |
Available date | 2022-05-19T10:23:11Z |
Publication Date | 2020 |
Publication Name | IEEE Access |
Resource | Scopus |
Identifier | http://dx.doi.org/10.1109/ACCESS.2020.3017783 |
Abstract | Convolutional neural networks (CNNs) have previously been broadly utilized to binarize document images. These methods have problems when faced with degraded historical documents. This paper proposes the utilization of CNNs to identify foreground pixels using novel input-generated multichannel images. To create the images, the original source image is decomposed into wavelet subbands. Then, the original image is approximated by each subband separately, and finally, the multichannel image is constituted by arranging the original source image (grayscale image) as the first channel and the approximated image by each subband as the remaining channels. To achieve the best results, two scenarios are considered, that is, two-channel and four-channel images, and then fed into two types of CNN architectures, namely, single and multiple streams. To investigate the effect of the multichannel images proposed as network inputs, the CNNs used in the architectures are three popular networks, namely, U-net, SegNet, and DeepLabv3+. The experimental results of the scenarios demonstrate that our method is more successful than the three CNNs when trained by the original source images and proves competitive performance in comparison with state-of-the-art results using the DIBCO database. |
Sponsor | This work was supported by the Qatar National Research Fund (a member of Qatar Foundation) through the National Priority Research Program (NPRP) under Grant NPRP 7-442-1-082. |
Language | en |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Subject | Convolution Network architecture Competitive performance Degraded document images Gray-scale images Historical documents Multichannel images Multiple streams State of the art Wavelet subbands Convolutional neural networks |
Type | Article |
Pagination | 153517-153534 |
Volume Number | 8 |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |
This item appears in the following Collection(s)
-
Computer Science & Engineering [2408 items ]