IVML  
  about | r&d | publications | courses | people | links
   

S. Sioutas, Ph. Mylonas, A. Panaretos, P. Gerolymatos, D. Vogiatzis, E. Karavaras, T. Spitieris, A. Kanavos
Survey of Machine Learning Algorithms on Spark Over DHT-based Structures
Algorithmic Aspects of Cloud Computing, pp.146-156, LNCS, April 2017
ABSTRACT
Several solutions have been proposed over the past few years on data storage, data management as well as data retrieval systems. These solutions can process massive amount of data stored in relational or distributed database management systems. In addition, decision making analytics and predictive computational statistics are some of the most common and well studied fields in computer science. In this paper, we demonstrate the implementation of machine learning algorithms over an open-source distributed database management system that can run in parallel on a cluster. In order to accomplish that, a system architecture scheme (e.g. Apache Spark) over Apache Cassandra is proposed. This paper also presents a survey of the most common machine learning algorithms and the results of the experiments performed over a Point-Of-Sales (POS) data set.
01 April , 2017
S. Sioutas, Ph. Mylonas, A. Panaretos, P. Gerolymatos, D. Vogiatzis, E. Karavaras, T. Spitieris, A. Kanavos, "Survey of Machine Learning Algorithms on Spark Over DHT-based Structures", Algorithmic Aspects of Cloud Computing, pp.146-156, LNCS, April 2017
[ BibTex] [ Print] [ Back]

© 00 The Image, Video and Multimedia Systems Laboratory - v1.12