An efficient approach for textual data classification using deep learning
Author | Alqahtani, Abdullah |
Author | Ullah Khan, Habib |
Author | Alsubai, Shtwai |
Author | Sha, Mohemmed |
Author | Almadhor, Ahmad |
Author | Iqbal, Tayyab |
Author | Abbas, Sidra |
Available date | 2022-12-29T07:18:04Z |
Publication Date | 2022-09-15 |
Publication Name | Frontiers in Computational Neuroscience |
Identifier | http://dx.doi.org/10.3389/fncom.2022.992296 |
Citation | Alqahtani, A., Khan, H. U., Alsubai, S., Sha, M., Almadhor, A., Iqbal, T., & Abbas, S. (2022). An efficient approach for textual data classification using deep learning. Frontiers in Computational Neuroscience, 16. |
ISSN | 1662-5188 |
Abstract | Text categorization is an effective activity that can be accomplished using a variety of classification algorithms. In machine learning, the classifier is built by learning the features of categories from a set of preset training data. Similarly, deep learning offers enormous benefits for text classification since they execute highly accurately with lower-level engineering and processing. This paper employs machine and deep learning techniques to classify textual data. Textual data contains much useless information that must be pre-processed. We clean the data, impute missing values, and eliminate the repeated columns. Next, we employ machine learning algorithms: logistic regression, random forest, K-nearest neighbors (KNN), and deep learning algorithms: long short-term memory (LSTM), artificial neural network (ANN), and gated recurrent unit (GRU) for classification. Results reveal that LSTM achieves 92% accuracy outperforming all other model and baseline studies. |
Sponsor | Qatar University [IRCC-2021-010]. |
Language | en |
Publisher | Frontiers |
Subject | deep learning machine learning text categorization text classification text data |
Type | Article |
Volume Number | 16 |
ESSN | 1662-5188 |
Files in this item
This item appears in the following Collection(s)
-
Accounting & Information Systems [527 items ]