Menyu
ITI
əməkdaslarının elmi isləri
Elektron kitabxana
Konfranslar İnformasiya Sistemi
Qəzetlər
UOT 004
|
ITI əməkdaşlarının elmi işləri - məqalə |
Biblioqrafik təsvir | Alguliyev , R.M. Efficient algorithm for big data clustering on single machine / R.M. Alguliyev , R.M. Aliguliyev , L.V. Sukhostat // CAAI Transactions on Intelligence Technology. - 2020. - N: vol.5, iss.1.- P. 9-14. | Annotasiya | Big data analysis requires the presence of large computing powers, which is not always feasible. And so, it
became necessary to develop new clustering algorithms capable of such data processing. This study proposes a new
parallel clustering algorithm based on the k-means algorithm. It significantly reduces the exponential growth of
computations. The proposed algorithm splits a dataset into batches while preserving the characteristics of the initial
dataset and increasing the clustering speed. The idea is to define cluster centroids, which are also clustered, for each
batch. According to the obtained centroids, the data points belong to the cluster with the nearest centroid. Real large
datasets are used to conduct the experiments to evaluate the effectiveness of the proposed approach. The proposed
approach is compared with k-means and its modification. The experiments show that the proposed algorithm is a
promising tool for clustering large datasets in comparison with the k-means algorithm.
| Elektron variant | Elektron variant |
|
________
© ict.az http://ict.az/az
|
|