
Spark isn’t perfect, but what is these days? Learn about the good and the bad that Spark has to offer. If you’re looking for a solution for processing huge chucks of data, […]
Spark isn’t perfect, but what is these days? Learn about the good and the bad that Spark has to offer. If you’re looking for a solution for processing huge chucks of data, […]
Large or frequent file dumps can slow the ingest pipeline down. See how Cloudera combated this to achieve a 300% speedup instead. A common design pattern often emerges when teams begin to […]
This week’s round-up of interesting big data technologies from Spark to NiFi with some microservices thrown in for modern data application development. Learn about how to rapidly iterate data applications, while reusing […]
The Hadoop distribution war comes down to a final battle between Cloudera’s CDH and Hortonworks’ HDP. That wasn’t always the case. At the peak of the market’s fragmentation, many companies offered Hadoop […]
The Spark Summit heads to San Francisco. Tableau is bringing its mobile client to Android. Microsoft Ventures eyes machine learning startups. We have all this and more in our Big Data Roundup […]
There’s no doubt that the Apache Spark phenomena has taken the big data world by storm. But can the technology actually deliver according to the tremendous hype that is accompanying it, or […]
The combination of Spark and Hadoop has supercharged big data analysis across many industries and use cases by lowering the barrier of entry to advanced analytics and thereby enabling data scientists to […]
Big Data consultancy Mammoth Data today published a benchmark study that shows Google’s Cloud Dataflow service outperforms the extremely popular open source data processing engine, Apache Spark. Google hired the company to […]
Thanks to an impressive grab bag of improvements in version 2.0, Spark’s quasi-streaming solution has become more powerful and easier to manage Last year was a banner year for Spark. Big names […]
One would obviously expect Hadoop to dominate the discussions at the recent Strata & Hadoop World conference in San Jose, CA. But much of the buzz this year was around Apache Spark, […]