Application of Multivariate-Rank-Based Techniques in Clustering of Big Data | Zendy

Pritha Guha | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Application of Multivariate-Rank-Based Techniques in Clustering of Big Data

Author(s) -

Pritha Guha

Publication year - 2018

Publication title -

vikalpa the journal for decision makers

Language(s) - English

Resource type - Journals

eISSN - 2395-3799

pISSN - 0256-0909

DOI - 10.1177/0256090918804385

Subject(s) - petabyte , big data , data science , volume (thermodynamics) , rank (graph theory) , computer science , byte , cluster analysis , multivariate statistics , data mining , process (computing) , variety (cybernetics) , artificial intelligence , machine learning , mathematics , physics , quantum mechanics , combinatorics , operating system

Executive Summary Very large or complex data sets, which are difficult to process or analyse using traditional data handling techniques, are usually referred to as big data. The idea of big data is characterized by the three ‘v’s which are volume, velocity, and variety ( Liu, McGree, Ge, & Xie, 2015 ) referring respectively to the volume of data, the velocity at which the data are processed and the wide varieties in which big data are available. Every single day, different sectors such as credit risk management, healthcare, media, retail, retail banking, climate prediction, DNA analysis and, sports generate petabytes of data (1 petabyte = 250 bytes). Even basic handling of big data, therefore, poses significant challenges, one of them being organizing the data in such a way that it can give better insights into analysing and decision-making. With the explosion of data in our life, it has become very important to use statistical tools to analyse them.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research