Document clustering analysis based on hybrid cuckoo search and K-means algorithm

dc.contributor.authorBoushaki, Saida Ishak
dc.contributor.authorBendjeghaba, Omar
dc.contributor.authorBrakta, Noureddine
dc.date.accessioned2022-02-15T08:21:22Z
dc.date.available2022-02-15T08:21:22Z
dc.date.issued2021
dc.description.abstractThe clustering is an interesting technique for unsupervised document organization in the World Wide Web (WWW). The most widely used partitioning clustering algorithm is K-means. However, it has an issue with random initialization, which might lead to local optimum situations. In fact, metaheuristics-based clustering has demonstrated their efficiency to reach a global solution instead of local one. The Cuckoo search (CS) has been widely used for the clustering problem. However, the number of iterations grows dramatically when the dataset is high dimensional like the documents. In this study, the hybridization cuckoo search and K-means algorithms for the document clustering are analyzed. So, three hybrid algorithms are investigated and compared. The performance and the efficiency of the proposed algorithms are evaluated using Reuters 21578 Text Categorization Benchmark Dataset. The obtained results show the capability of the new approaches to generate more compact clustering and enhancing purity and F-measure clustering qualitiesen_US
dc.identifier.urihttps://dspace.univ-boumerdes.dz/handle/123456789/7611
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.ispartofseries2021 IEEE 12th Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2021;pp. 58-62
dc.subjectCuckoo Searchen_US
dc.subjectK-meansen_US
dc.subjectDocument Clusteringen_US
dc.subjectOptimizationen_US
dc.subjectMetaheuristicen_US
dc.subjectF-measureen_US
dc.subjectPurityen_US
dc.subjectVector Spaceen_US
dc.titleDocument clustering analysis based on hybrid cuckoo search and K-means algorithmen_US
dc.typeOtheren_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: