Open Journal of Applied Sciences

Volume 13, Issue 7 (July 2023)

ISSN Print: 2165-3917   ISSN Online: 2165-3925

Google-based Impact Factor: 0.92  Citations  h5-index & Ranking

Study on the Development and Implementation of Different Big Data Clustering Methods

HTML  XML Download Download as PDF (Size: 2472KB)  PP. 1163-1177  
DOI: 10.4236/ojapps.2023.137092    92 Downloads   416 Views  

ABSTRACT

Clustering is an unsupervised learning method used to organize raw data in such a way that those with the same (similar) characteristics are found in the same class and those that are dissimilar are found in different classes. In this day and age, the very rapid increase in the amount of data being produced brings new challenges in the analysis and storage of this data. Recently, there is a growing interest in key areas such as real-time data mining, which reveal an urgent need to process very large data under strict performance constraints. The objective of this paper is to survey four algorithms including K-Means algorithm, FCM algorithm, EM algorithm and BIRCH, used for data clustering and then show their strengths and weaknesses. Another task is to compare the results obtained by applying each of these algorithms to the same data and to give a conclusion based on these results.

Share and Cite:

Ntayagabiri, J. , Ndikumagenge, J. , Ndayisaba, L. and Philippe, B. (2023) Study on the Development and Implementation of Different Big Data Clustering Methods. Open Journal of Applied Sciences, 13, 1163-1177. doi: 10.4236/ojapps.2023.137092.

Cited by

No relevant information.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.