Articles

Python Versus R in Apache Spark

The June update to Apache Spark brought support for R, a significant enhancement that opens the big data platform to a large audience of new potential users. Support for R in Spark 1.4 also gives users an alternative to Python. But which language will emerge as the winner for doing data science in Spark? We spoke to Databricks Ali Ghodsi for answers.

According to Ghodsi, who is Databricks’ vice president of engineering and product management, the company has been bombarded with requests over the past year or so to add support for R in Apache Spark. While the software is open source, about three quarters of the framework was written by people who work for Databricks, so it basically controls the direction of Spark.

Source: datanami.com
Author: Alex Woodie

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s