This is the code repository for Snowflake Cookbook, published by Packt.
Techniques for building modern cloud data warehousing solutions
Snowflake is a unique cloud-based data warehousing platform built from scratch to perform data management on the cloud. This book introduces you to Snowflake's unique architecture, which places it at the forefront of cloud data warehouses. You'll explore the compute model available with Snowflake, and find out how Snowflake allows extensive scaling through the virtual warehouses. You will then learn how to configure a virtual warehouse for optimizing cost and performance. Moving on, you'll get to grips with the data ecosystem and discover how Snowflake integrates with other technologies for staging and loading data.
This book covers the following exciting features:
- Get to grips with data warehousing techniques aligned with Snowflake's cloud architecture
- Broaden your skills as a data warehouse designer to cover the Snowflake ecosystem
- Transfer skills from on-premise data warehousing to the Snowflake cloud analytics platform
- Optimize performance and costs associated with a Snowflake solution
- Stage data on object stores and load it into Snowflake
- If you feel this book is for you, get your copy today!
All of the code is organized into folders. For example, Chapter02.
The code will look like the following:
CREATE TRANSIENT SCHEMA temporary_data
DATA_RETENTION_TIME_IN_DAYS = 0
COMMENT = 'Schema containing temporary data used by ETL
processes';
Following is what you need for this book: This book is for data warehouse developers, data analysts, database administrators, and anyone involved in designing, implementing, and optimizing a Snowflake data warehouse. Knowledge of data warehousing and database and cloud concepts will be useful. Basic familiarity with Snowflake is beneficial, but not necessary.
With the following software and hardware list you can run all code files present in the book (Chapter 1-10).
Chapter | Software required | OS required |
---|---|---|
1 - 10 | Web Browser (Chrome, Edge, Firefox) | Windows, Mac OS X, and Linux (Any UI Based) |
1 - 10 | Snowflake Web UI, SnowSQL, SnowPipe | Windows, Mac OS X, and Linux (Any UI Based) |
1 - 10 | Apahe Spark | Windows, Mac OS X, and Linux (Any UI Based) |
We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.
Hamid Qureshi is a senior cloud and data warehouse professional with almost two decades of total experience, having architected, designed, and led the implementation of several data warehouse and business intelligence solutions. He has extensive experience and certifications across various data analytics platforms, ranging from Teradata, Oracle, and Hadoop to modern, cloud-based tools such as Snowflake. Having worked extensively with traditional technologies, combined with his knowledge of modern platforms, he has accumulated substantial practical expertise in data warehousing and analytics in Snowflake, which he has subsequently captured in his publications.
Hammad Sharif is an experienced data architect with more than a decade of experience in the information domain, covering governance, warehousing, data lakes, streaming data, and machine learning. He has worked with a leading data warehouse vendor for a decade as part of a professional services organization, advising customers in telco, retail, life sciences, and financial industries located in Asia, Europe, and Australia during presales and post-sales implementation cycles. Hammad holds an MSc. in computer science and has published conference papers in the domains of machine learning, sensor networks, software engineering, and remote sensing.