Articles

MapReduce vs. Spark Smackdown

Apache Spark, you may have heard, performs faster than Hadoop MapReduce in Big Data analytics. An open source technology commercially stewarded by Databricks Inc., Spark can “run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk,” its main project site states.

Those claims aren’t readily attributed on the main site, but the Spark FAQ further states “it has been used to sort 100TB of data 3X faster than Hadoop MapReduce on 1/10th of the machines, winning the 2014 Daytona GraySort Benchmark.

Source: adtmag.com
Author: David Ramel

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s