Federal agencies are collecting and sharing data at an unprecedented rate. Between July 2010 and August 2017, the number of data centers grew by 476 percent—from 2,094 to 12,0621. Meanwhile, the Data Accountability Transparency Act has made an unprecedented amount of this information open and accessible.
For many agencies, “data lakes”—centralized repositories of structured and unstructured data—have been the next step toward leveraging this information for analysis and decision-making. But compiling information into one place is only the first step. These data lakes need to become fully functional data platforms capable of ingesting, storing, and supporting data of any format or type, in a way that enables analysts to make connections and develop insights.