Statistical Analytic Tool for Dimensionalities in Big Data: The role of Hierarchical Cluster Analysis

No Thumbnail Available
Date
2016
Journal Title
Journal ISSN
Volume Title
Publisher
Research India Publications
Abstract
An interest for presenting this paper rose because of massive increase information with a very high dimensional from different sources in this era of globalization. Data are produced continuously and are unstructured (1). This paper is confined to literature review search for big data issue and challenges of several scopes in data. It brings a detailed discussion on the problem on these data and analysis done using the effective multivariate statistical tool namely clustering analysis technique as a data reduction technique. It is used as a base for discussion for existing challenge of multi-dimensionalities of data. The findings indicated that, the world is noisy due to massive flow of information continuously. Findings revealed that data emanating from face book, you tube and twitter can be used to predict the epidemic of influenza and even market trend (2 and 3). With face book data is used to predict the people`s interest. However, data from different sources have been proved to be useful in decision making efficiently and effectively for public as well as private sector. Cluster analysis technique sorts data/alike things into groups, to see if there a high natural degree association among members of the same group and low between members of different groups. Finally, this technique has proved failure to handle such heap of data with varied sources. With regards to data stored, it remains to be a challenge in terms of analysis among researchers and scientists. Therefore, it calls for advanced statistical software to cater for such an existing challenges.
Description
Article published by Research India Publications in the International Journal of Statistics and Systems Volume 11, Number 1 (2016), pp. 19-26
Keywords
Big data, Dimensionalities, Clustering analytics
Citation
APA