Subscribe to Read
Sign up today to enjoy a complimentary trial and begin exploring the world of books! You have the freedom to cancel at your convenience.
Data Science on the Google Cloud Platform: Implementing End-to-End Real-Time Data Pipelines: From Ingest to Machine Learning
Title | Data Science on the Google Cloud Platform: Implementing End-to-End Real-Time Data Pipelines: From Ingest to Machine Learning |
Writer | |
Date | 2024-11-25 00:00:48 |
Type | |
Link | Listen Read |
Desciption
Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build on top of the Google Cloud Platform (GCP). This hands-on guide shows developers entering the data science field how to implement an end-to-end data pipeline, using statistical and machine learning methods and tools on GCP. Through the course of the book, youâ??ll work through a sample business decision by employing a variety of data science approaches. Follow along by implementing these statistical and machine learning solutions in your own project on GCP, and discover how this platform provides a transformative and more collaborative way of doing data science. Youâ??ll learn how to: Automate and schedule data ingest, using an App Engine application Create and populate a dashboard in Google Data Studio Build a real-time analysis pipeline to carry out streaming analytics Conduct interactive data exploration with Google BigQuery Create a Bayesian model on a Cloud Dataproc cluster Build a logistic regression machine-learning model with Spark Compute time-aggregate features with a Cloud Dataflow pipeline Create a high-performing prediction model with TensorFlow Use your deployed model as a microservice you can access from both batch and real-time pipelines Read more
Review
I knew this book for me just a few pages into the first chapter. This book by Lake is unlike many other books of data science and particular technology that just enumerate the how-to's of the particular technology. Lak starts with a concrete user problem strongly anchored in probabilistic outcomes, and then steps through a typical data science process of discovery, refinement, and then converting to a production pipeline. While teaching about GCP technologies along the way, the book stays strongly anchored in the original user-problem. There is not a corner of GCP that is needed for a full production data science product that goes untouched in this book. The material is well covered, with pointers to deeper material and user manuals.I received the first edition. As GCP technology evolved, Lak was posting updates to his blog on Medium so that everyone could take understand the updates to GCP and how to use them. I was pleasantly surprised by getting these updates and made having the book that much more valuable.