Success Story: Fintech data platform gets a boost from stream processing

A partnership between iZettle and GetInData originated in the form of a two-day workshop focused on analyzing iZettle’s needs and exploring multiple cloud providers’ offerings. Outcomes of this event led to a year-long collaboration on building a robust, third wave data platform.

The challenge

To ensure undisrupted business growth iZettle was looking for a data platform solution that could meet advanced analytics requirements and address the performance issues caused by rapidly swelling data collection. The platform should minimize the effort spent on maintenance work, allowing specialists to dedicate more time to exploratory data analysis and manufacturing business meaningful insights.

izettle getindata strem processing google cloud platform

The solution

Daily loading jobs were replaced with a streaming ingestion process running on Google DataFlow. Currently, BigQuery takes the role of a central data lake and a query engine. The ingestion process uses an internal message dictionary to validate and route messages to relevant tables. Analytics work is orchestrated with Cloud Composer and utilises BigQuery for SQL and DataFlow for complex scenarios.

The results

The introduction of dynamic streaming ingestion greatly reduced the effort and time required to onboard new sources into a data lake (days instead of weeks).
The new solution ensures that valid and complete information is ready for use ahead of a reporting day.
Thanks to simplified maintenance, teams can focus on data mining and analysis, armed with a wide range of tools that Google Cloud has to offer.

streaming

big data

google cloud platform

google dataflow

stream processing

cloud

BigQuery

Last updated: 27 May 2021

Written by

Mateusz Pytel

Google Cloud Platform Architect

Want more? Check our articles

Tutorial

Data isolation in tenant architecture on the Google Cloud Platform (GCP)

Multi-tenant architecture, also known as multi-tenancy, is a software architecture in which a single instance of software runs on a server and serves…

Tech News

If LLM’s did not exist. ML innovations in 2023 from a data scientist’s perspective

The year 2023 has definitely been dominated by LLM’s (Large Language Models) and generative models. Whether you are a researcher, data scientist, or…

DATA Pill – the blue pill that (accidentally) works!

Ever felt overwhelmed by the flood of news about the latest technologies, tools, and trends in Data, AI, and ML? A new framework here, a revolutionary…

Tutorial

Airflow in a multi-teams / multi-tenant environment. Deployment strategies

This article explores using Airflow 2 in environments with multiple teams (tenants) and concludes with a brief overview of out-of-the-box features to…

Big Data Event

A Review of the Presentations at the Big Data Technology Warsaw Summit 2022!

The 8th edition of the Big Data Tech Summit is already over, and we would like to thank all of the attendees for joining us this year. It was a real…

getindator create a modern tech inspired thumbnail graphic

Tutorial

dbt Semantic Layer - Implementation

Introduction Welcome back to the dbt Semantic Layer series! This article is a continuation of our previous article titled “dbt Semantic Layer - what…

Success Story: Fintech data platform gets a boost from stream processing

The challenge

The solution

The results

Like this post?
Spread the word

Want more? Check our articles

Data isolation in tenant architecture on the Google Cloud Platform (GCP)

If LLM’s did not exist. ML innovations in 2023 from a data scientist’s perspective

DATA Pill – the blue pill that (accidentally) works!

Airflow in a multi-teams / multi-tenant environment. Deployment strategies

A Review of the Presentations at the Big Data Technology Warsaw Summit 2022!

dbt Semantic Layer - Implementation

Contact us

Interested in our solutions?
Contact us!

Success Story: Fintech data platform gets a boost from stream processing

The challenge

The solution

The results

Like this post?Spread the word

Want more? Check our articles

Data isolation in tenant architecture on the Google Cloud Platform (GCP)

If LLM’s did not exist. ML innovations in 2023 from a data scientist’s perspective

DATA Pill – the blue pill that (accidentally) works!

Airflow in a multi-teams / multi-tenant environment. Deployment strategies

A Review of the Presentations at the Big Data Technology Warsaw Summit 2022!

dbt Semantic Layer - Implementation

Contact us

Interested in our solutions?Contact us!

Like this post?
Spread the word

Interested in our solutions?
Contact us!