Whitepaper
4 min read

eBook: Power Up Machine Learning Process. Build Feature Stores Faster - an Introduction to Vertex AI, Snowflake and dbt Cloud

Recently we published the first ebook in the area of MLOps: "Power Up Machine Learning Process. Build Feature Stores Faster - an Introduction to Vertex AI, Snowflake and dbt Cloud". In this short article, we will tell you what you can find inside the ebook, what questions and problems it addresses and share the first opinions. 

The Business Perspective and Technical Perspective on Feature Store and MLOps 

In this ebook, we discuss in detail the Feature Store as one of the components of MLOps. From the general concept of MLOps and Feature Store as a solution, to specific problems that cause inefficiencies in ML processes to the technical details of building a Feature Store and how to integrate GCP with Snowflake using terraform. We demonstrate using the example of the Feature Store what risks are involved in not optimizing ML processes. This provides an easy-to-understand explanation of what MLOps is on the one hand, and on the other hand provides the technical details needed to adopt the solution right away. 

That's why we've divided the ebook into two parts: the business perspective and the technical perspective, so everyone can easily find the content that interests them. 

What will you find inside this MLOps ebook?

  • What MLOps is and the MLOps Platform
  • What impact MLOps implementation can have on business
  • 5 Machine Learning issues that can cause the ineffective use of data (along with a solution)
  • A step-by-step guide to building a Feature Store
  • Comparison of the most popular Feature Stores
  • Insight into Machine Learning - MLOps correlations
  • Example of MLOps architecture and workflow
  • How to integrate GCP with Snowflake using terraform
  • Vertex.ai platform - how it works in practice

See what the experts are saying about the ebook

This e-book describes the real-life challenges of running Machine Learning on production systems from both business and technical perspectives. Moreover, it guides you through the practical solutions, based on our expert's experience.  Check it out especially if you're struggling with latency, data silos, data drift or data skew!

Michał Bryś, Senior ML Engineer and Technical Product Owner

A very interesting position where the author describes the MLOps world and takes a look at it from a Feature Store perspective. This eBook contains solid theoretical information on what MLOps is and how Feature Stores can address its problems. The second part describes the first steps in Data Engineering work that will lead to creating the first features. This eBook provides very detailed information on how to start with dbt, Snowflake and Vertex AI tools.

I recommend it for everyone taking their first steps in the Data Engineering world or anyone who wants to extend his/her knowledge about Feature Stores.

Piotr Pękala, Project Architects Lead

Ideas are worthless without execution”. This popular adage can be also applied perfectly to Data Science and Machine Learning results, where so many companies jump at the idea of using ML, but are struggling to execute it by delivering ML apps to production. That’s why the MLOps movement was shaped, to gather all the engineering practices in one place to deploy and maintain ML models in production efficiently and reliably. A lot has been developed, especially recently around Feature Stores - how data is prepared, stored and fed to ML models in training and production.

In this e-book, Getindata engineers introduce the reader to the MLOps concept and then focus on demonstrating how to build a Feature Store using managed cloud services and open-source technologies. Unlike many vendor e-books, these guys spare no technical details of the architecture and the solution. While they show the solution based on particular technologies like Vertex AI, Snowflake or dbt, the knowledge and design are presented abstractly with replaceable components.

If you are responsible for the data & AI strategy in your company, this source of information can help you accelerate and grasp the concepts of MLOps and Feature Stores without a vendor selling you anything.

If you’re an engineer, this very juicy tutorial will show you in detail how you can implement a Feature Store at your company. It will not only introduce you to the problem and show how to do a PoC - from configuring cloud services to running your first pipeline, but will also help you productionize the solution with infrastructure as a code, CICD and so on. All you need from start to finish in a concise writeup!

Krzysztof Zarzycki, GetInData CTO

big data
machine learning
MLOps
Feature Store
27 October 2022

Want more? Check our articles

getindata cover nifi lego notext
Tutorial

NiFi Ingestion Blog Series. PART I - Advantages and Pitfalls of Lego Driven Development

Apache NiFi, big data processing engine with graphical WebUI, was created to give non-programmers the ability to swiftly and codelessly create data…

Read more
radiodataquantum
Radio DaTa Podcast

Data Journey with Yetunde Dada & Ivan Danov (QuantumBlack) – Kedro (an open-source MLOps framework) – introduction, benefits, use-cases, data & insights used for its development

In this episode of the RadioData Podcast, Adam Kawa talks with Yetunde Dada & Ivan Danov  about QuantumBlack, Kedro, trends in the MLOps landscape e.g…

Read more
big data for e commerce

Big Data for E-commerce.

The year 2020 was full of challenges in many areas, and in many companies and organizations.  Often, it was necessary to introduce radical changes or…

Read more
radiodatawilla
Radio DaTa Podcast

Data Journey with Arunabh Singh (Willa) – Building robust ML & Analytics capability very early with FinTech, skills & competencies for data scientists with ML/AI predictions for the next decades.

In this episode of the RadioData Podcast, Adama Kawa talks with Arunabh Singh about Willa use cases (​ FinTech): the most important ML models…

Read more
1RiTD99ILqsAaSQqY1GaLMw
Big Data Event

Five big ideas to learn at Big Data Tech Warsaw 2020

Hello again in 2020. It’s a new year and the new, 6th edition of Big Data Tech Warsaw is coming soon! Save the date: 27th of February. We have put…

Read more
data enrichtment flink sql using http connector flink getindata big data blog notext
Tutorial

Data Enrichment in Flink SQL using HTTP Connector For Flink - Part Two

In part one of this blog post series, we have presented a business use case which inspired us to create an HTTP connector for Flink SQL. The use case…

Read more

Contact us

Interested in our solutions?
Contact us!

Together, we will select the best Big Data solutions for your organization and build a project that will have a real impact on your organization.


What did you find most impressive about GetInData?

They did a very good job in finding people that fitted in Acast both technically as well as culturally.
Type the form or send a e-mail: hello@getindata.com
The administrator of your personal data is GetInData Poland Sp. z o.o. with its registered seat in Warsaw (02-508), 39/20 Pulawska St. Your data is processed for the purpose of provision of electronic services in accordance with the Terms & Conditions. For more information on personal data processing and your rights please see Privacy Policy.

By submitting this form, you agree to our Terms & Conditions and Privacy Policy