Overview of Big Data platforms


Abstract

The primary purpose of this paper is to present and provide main advantages and disadvantages of most popular big data platforms as well as their comparison in terms of ease of installation, work, performance and price, in order to find the most suitable solution to work with big sets of data. Nowadays, the data is largely analyzed by scientists not related to IT, so the ease of use and presentation of data is extremely important. The purpose of the assessment was to indicate the best IT tool for analyzing data from the point of view of a young analyst or scientist graduating and entering the labor market.


Keywords

big data, data analysis, platform, tool comparison

[1] MapR Producent website https://mapr.com/docs/51/MapROverview/c_overview_intro.html (website) [05/2019]
[2] Cloudera Producent website https://www.cloudera.com/documentation/enterprise/5-13-x/topics/introduction.html (website) [05/2019]
[3] Hortonworks Producent website
https://hortonworks.com/products/data-platforms/hdp/ (website) [05/2019]
[4] Microsoft Azure Producent website
https://docs.microsoft.com/pl-pl/azure/hdinsight/hadoop/apache-hadoop-introduction (website) [05/2019]
[5] M. Siudziński, Hadoop article
https://itwiz.pl/hadoop-czyli-przetwarzanie-rozproszone-open-source/ (website)[05/2019]
[6] R.Wasiluk, P.Muryjas: The assessment of usefulness modern IT tools of data analysis Big Data, Institute of Computer Science, Lublin University of Technology, 2017
Download

Published : 2019-12-30


Wróbel, G., & Wikira, M. D. (2019). Overview of Big Data platforms. Journal of Computer Sciences Institute, 13, 283-287. https://doi.org/10.35784/jcsi.1296

Gabriel Wróbel  g.wrobel@pollub.edu.pl
Lublin University of Technology  Poland
Maciej Daniel Wikira 
University of Oulu, Finland  Poland