On-Demand Webinar – Hosted by O’Reilly Media
You want to create a Data Lake and store large volumes of disparate data on a Hadoop cluster for advanced analytics. You may have completed a small scope proof of concept (POC) and are ready to take the next step and bring in more data. But what about Data Governance – what role does it play in implementing a Data Lake? And as the data silos come down, how do you deal with the politics and compliance issues that come with data ownership and responsibility? Does Big Data Governance exist?
In this webcast, we will discuss the following:
- The integral role the four pillars of Data Governance play in your strategy: data privacy, data quality, process integration, data lifecycle management
- Key factors in co-mingling disparate data sets; such as adapting governance policies that were written when data was silo’ed and identifying when and when not to link sources
- Software and architectural best practice considerations for better integration of a Data Lake into an existing Data Governance framework
Recorded June 18, 2015