Sustained Innovation in Apache Spark: DataFrames, Spark SQL, and MLlib
NOVEMBER 30, 2015
The post Sustained Innovation in Apache Spark: DataFrames, Spark SQL, and MLlib appeared first on Cloudera Engineering Blog. Cloudera has announced support for Spark SQL/DataFrame API and MLlib. This post explains their benefits for app developers, data analysts, data engineers, and data scientists. In July 2015, Cloudera re-affirmed its position since 2013 : that Apache Spark is on course to replace MapReduce as the default general-purpose data processing engine for Apache Hadoop.