To handle big data, shrink it

By Derrick Martins on June 2, 2015 • ( Leave a comment )

As anyone who’s ever used a spreadsheet can attest, it’s often convenient to organize data into tables. But in the age of big data, those tables can be enormous, with millions or even hundreds of millions of rows.

One way to make big-data analysis computationally practical is to reduce the size of data tables — or matrices, to use the mathematical term — by leaving out a bunch of rows. The trick is that the remaining rows have to be in some sense representative of the ones that were omitted, in order for computations performed on them to yield approximately the right results.

Source: newsoffice.mit.edu
Author: Larry Hardesty

Categories: Articles

Tagged as: Best Practices, Big Data, Business Analytics, Business Intelligence

BI Corner

At the intersection of Business Intelligence & Analytics

BI Corner Newsletter

Most Viewed Posts

Tag Cloud

To handle big data, shrink it

Leave a comment Cancel reply

Follow BI Corner

Like BI Corner on Facebook

Follow me on Twitter

Follow BI Corner by RSS

BI Corner Newsletter

Most Viewed Posts

Tag Cloud

To handle big data, shrink it

Share this:

Related

Leave a comment Cancel reply

Follow BI Corner

Like BI Corner on Facebook

Follow me on Twitter

Follow BI Corner by RSS