Big data clustering based on spark chaotic improved particle swarm optimization

dc.contributor.authorBoushaki, Saida Ishak
dc.contributor.authorMahammed, Brahim Hadj
dc.contributor.authorBendjeghaba, Omar
dc.contributor.authorMosbah, Messaoud
dc.date.accessioned2024-03-17T09:29:41Z
dc.date.available2024-03-17T09:29:41Z
dc.date.issued2024
dc.description.abstractIn recent years, the surge in continuously accelerating data generation has given rise to the prominence of big data technology. The MapReduce architecture, situated at the core of this technology, provides a robust parallel environment. Spark, a leading framework in the big data landscape, extends the capabilities of the traditional MapReduce model. Coping with big data, especially in the realm of clustering, requires more efficient techniques. Meta-heuristic-based clustering, known for offering global solutions within reasonable time frames, emerges as a promising approach. This paper introduces a parallel-distributed clustering algorithm for big data within the Spark Framework, named Spark, chaotic improved PSO (S-CIPSO). Centered on particle swarm optimization (PSO), the proposed algorithm is enhanced with a chaotic map and an efficient procedure. Test results, conducted on both real and artificial datasets, establish the superior performance and quality of clustering results achieved by the proposed approach. Additionally, the scalability and robustness of S-CIPSO are validated, demonstrating its effectiveness in handling large-scale datasets.en_US
dc.identifier.issn2502-4752
dc.identifier.urihttps://ijeecs.iaescore.com/index.php/IJEECS/article/view/35388
dc.identifier.urihttp://doi.org/10.11591/ijeecs.v34.i1.pp419-429
dc.identifier.urihttps://dspace.univ-boumerdes.dz/handle/123456789/13705
dc.language.isoenen_US
dc.publisherInstitute of Advanced Engineering and Science (IAES)en_US
dc.relation.ispartofseriesIndonesian Journal of Electrical Engineering and Computer ScienceOpen Access/ Vol. 34, N°1(2024);PP. 419 - 429
dc.subjectBig data clusteringen_US
dc.subjectChaotic mapen_US
dc.subjectMapReduceen_US
dc.subjectParticle swarm optimizationen_US
dc.subjectSparken_US
dc.titleBig data clustering based on spark chaotic improved particle swarm optimizationen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Big data clustering based on spark chaotic improved particle swarm optimization.pdf
Size:
504.67 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: