The failure rate on big data traditional data lake projects is between 60% and 80%. Hadoop was billed as the platform to store and analyze data. It promised to minimize data silos with the concept of a Data Lake. But, Hadoop has largely failed in its ability to consider the operational side of data and what it takes to actually run a business. Big data systems like Hadoop are complex and require teams of expensive engineers to build and maintain them. This course shows how Snowflake changes all this. Snowflake is the complete modern cloud based data platform that can integrate with, or completely replace your data lake. This course shows how this is made possible.
We do an in-depth discussion of the history of data lakes and the architecture of the modern on-premises or cloud-based data lake. We talk about what's good about data lakes and what's bad about them. We talk about data lakes becoming data swamps. Most importantly, we discuss how Snowflake can completely replace the traditional data lake and be your single repository for corporate data.
Finally we present three case studies of companies that tried to make the traditional on-premises (or cloud-based) data lake architecture work for them and finally scrapped it and settled on Snowflake.
We wrap up the course with some questions to test your knowledge of the material.