A semantic approach for document clustering

Shaban, Khaled B.

Author	Shaban, Khaled B.
Available date	2022-12-21T10:01:45Z
Publication Date	2009
Publication Name	Journal of Software
Resource	Scopus
URI	http://dx.doi.org/10.4304/jsw.4.5.391-404
URI	http://hdl.handle.net/10576/37487
Abstract	Conventional document mining systems mainly use the presence or absence of keywords to mine texts. However, simple word counting and frequency distributions of term appearances do not capture the meaning behind the words, which results in limiting the ability to mine the texts. In this paper, the application of a semantic understandingbased approach to mine documents is presented. The approach is based on semantic notions to represent text, and to measure similarity between text documents. The representation scheme reflects existing relations between concepts and facilitates accurate similarity measurements that result in better mining performance. A document mining process, namely semantic document clustering, is investigated and tackled in various ways. The proposed representation scheme along with the proposed similarity measure were implemented as vital components of a mining system. The approach has enabled more effective document clustering than what conventional techniques would provide. The experimental work is reported, and its results are presented and analyzed. 2009 ACADEMY PUBLISHER.
Language	en
Publisher	Academy Publisher
Subject	Document clustering Document mining Semantic understanding Similarity measure Text representation
Title	A semantic approach for document clustering
Type	Article
Pagination	391-404
Issue Number	5
Volume Number	4

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Computer Science & Engineering [‎2202‎ items ]

Show simple item record

A semantic approach for document clustering

Files in this item

This item appears in the following Collection(s)

Video