2016 IEEE International Congress on Big Data
Open Source Big Data Analytics Frameworks Written in Scala
John A. Miller, Casey Bowman, Vishnu Gowda Harish and Shannon Quinn
Department of Computer Science
University of Georgia
Athens, GA, USA
Abstract
Frameworks for big data arguably began with
Google’s use of MapReduce. Since then, a huge amount of
progress has been made in the development of big data
frameworks, many of which have been released as open source.
Further to increase portability and ease of set-up, many are
coded in a Java Virtual Machine (JVM) based language, e.g.,
Java or Scala. In addition, processing of big data involves
the flow of data, and of course, the processing of data as it
flows. This computational paradigm is a natural for functional
programming. Furthermore, the map, reduce and combiner
have analogs in functional programming. There has been a
trend in the last few years toward developing open source big
data frameworks written in Scala to support big data analytics.
Scala is a modern JVM language that supports both object-
oriented and functional programming paradigms.
Keywords - big data; analytics; frameworks; functional programming; scala