
Recent trends in big data using hadoop
Author(s) -
Chetna Kaushal,
Deepika Koundal
Publication year - 2019
Publication title -
international journal of informatics and communication technology/international journal of informatics and communication technology (ij-ict)
Language(s) - English
Resource type - Journals
eISSN - 2722-2616
pISSN - 2252-8776
DOI - 10.11591/ijict.v8i1.pp39-49
Subject(s) - big data , computer science , cluster analysis , data mining , the internet , data science , set (abstract data type) , data set , world wide web , artificial intelligence , programming language
Big data refers to huge set of data which is very common these days due to the increase of internet utilities. Data generated from social media is a very common example for the same. This paper depicts the summary on big data and ways in which it has been utilized in all aspects. Data mining is radically a mode of deriving the indispensable knowledge from extensively vast fractions of data which is quite challenging to be interpreted by conventional methods. The paper mainly focuses on the issues related to the clustering techniques in big data. For the classification purpose of the big data, the existing classification algorithms are concisely acknowledged and after that, k-nearest neighbor algorithm is discreetly chosen among them and described along with an example.