2016 IEEE International Congress on Big Data

Open Source Big Data Analytics Frameworks Written in Scala

John A. Miller, Casey Bowman, Vishnu Gowda Harish and Shannon Quinn

Department of Computer Science
University of Georgia
Athens, GA, USA

Abstract

Frameworks for big data arguably began with Google’s use of MapReduce. Since then, a huge amount of progress has been made in the development of big data frameworks, many of which have been released as open source. Further to increase portability and ease of set-up, many are coded in a Java Virtual Machine (JVM) based language, e.g., Java or Scala. In addition, processing of big data involves the flow of data, and of course, the processing of data as it flows. This computational paradigm is a natural for functional programming. Furthermore, the map, reduce and combiner have analogs in functional programming. There has been a trend in the last few years toward developing open source big data frameworks written in Scala to support big data analytics. Scala is a modern JVM language that supports both object- oriented and functional programming paradigms.

Keywords - big data; analytics; frameworks; functional programming; scala