One of the longstanding challenges in analytics is data unification. While federated approaches are gaining some favor, the vast majority of analytic practitioners want the data to be present in one place before analyzing it. This means that data from different entities must be unified, and that’s a problem.
In the new white paper “The Seven Tenets of Scalable Data Unification,” renowned computer scientist and Tamr cofounder and CTO Michael Stonebraker lays out the challenge in his unique and plainspoken style. Stonebraker – who is also a Turing Award winner, an MIT professor, and the creator of Vertica — starts the white paper by describing data unification, which he says consists of seven steps, including ingesting, cleansing, transformation, schema integration, deduplication, classification, and exporting (but don’t confuse these seven attributes with the tenets).
Author: Alex Woodie