10 min read

Looking Back at 2024: GetInData’s in Data & AI

Let’s take a moment to look back at 2024 and celebrate everything we’ve achieved. This year has been all about sharing knowledge, creating impactful content, and strengthening connections within the data community. Here’s a quick recap of what we’ve been up to!

  • We continued growing as part of Xebia, contributing to the broader community with insights, innovations, and impactful collaborations. Together, we’re making strides in helping organizations become more data-driven.
  • This year, we published 34 blogs on topics like Big Data, Machine Learning, AI, and streaming. From data modeling with Looker and dbt to exploring the data lakehouse revolution, our content has sparked some great conversations and learning moments.
  • We also shared inspiring client stories—like how Bank Millennium transformed customer engagement with real-time data or how Play migrated a petabyte-scale Hadoop cluster to Kubernetes using open-source tools.
  • Our Lunch DaIT's webinar series continued to tackle hot topics like LLMOps and real-time data strategies, and we kept the knowledge flowing on YouTube with hands-on demos and tutorials.
  • The Data Pill newsletter reached new heights this year, delivering weekly insights on Big Data, AI, and Cloud to a growing global audience. We also released a white paper comparing top data quality tools and announced an upcoming eBook packed with practical tips for no-code automation.
  • We had a strong presence at global events, hosting the Big Data Technology Warsaw Summit and InfoShare 2024. Our experts and clients took the stage to share cutting-edge insights on topics like generative AI, stream processing, and Kubernetes migrations.
  • Inside the company, we focused on learning and collaboration through initiatives like Lunch & Learn sessions, active Guild communities, and The Study Club. These programs helped us stay curious, connected, and constantly evolving.

It’s been an incredible year, and we’re excited to share the highlights with you. Let’s dive in!

Blog posts

In 2024 we posted a lot. You can find 34 published blog posts about data, streaming, machine learning, AI and more here. The top 5 most read blogs are as follows:

We mentioned these blogs in our previous blog: Level Up Your Data Game: 5 Must-Read Blogs You Can’t Miss in 2024

Customer Stories

We also shared our success stories of working with clients:

We also share our content on Medium. Join us here

Webinars and Videos

Our Latest Webinar was:
"LLMops: Streamlining Machine Learning Operations for Large Language Models"

Missed it? No worries! You can still catch the recording. Just fill out the form here to dive into all the insights and tips on optimizing ML workflows for LLMs.

We just wrapped up the third session of our Lunch DaIT series, titled:

  • Real-Time Data to Drive Business Growth and Innovation in 2024

Want access to all three sessions? Sign up here and learn how real-time data can take your business to the next level in 2024.

We’ve uploaded some cool demos and presentations to our YouTube channel this year. Check out:

We were honored to share some game-changing insights at Big Data Technology Warsaw  2024! Here are the top presentations from our team:

And our amazing clients took the stage too:

There’s so much more to explore! Head over to webinars.getindata.com to access past recordings, upcoming webinars, and even more resources.

Data Pill Newsletter

DATA Pill has grown significantly this year. For those unfamiliar, let me introduce it: DATA Pill is a weekly newsletter delivered every Friday morning, featuring a curated selection of the best content in Big Data, Cloud, Machine Learning, and AI.

We’ve released 138 editions of DATA Pill, available in two formats: a traditional email newsletter and a LinkedIn newsletter hosted on Adam Kawa’s profile.

Last year, we reached a combined audience of 2,781 readers—273 subscribers on our mailing list and 2,508 followers on LinkedIn.

You can explore all previous editions and subscribe here.

White Paper and Ebook

In 2024, we dropped a white paper, "Smarter Data, Brighter Decisions: Data Quality Tools Comparison."

It's your go-to guide for choosing the best data quality platform. We break down heavyweights like Monte Carlo, Collibra, Talend Data Fabric, Ataccama One, Dataprep by Trifacta, and AWS Glue DataBrew, showing how they use AI and ML to make data quality simple and workflows smoother.

Whether you're after real-time monitoring, data governance, or a no-code setup, this paper's got the insights to help you level up your data game.

Download this White Paper here.

But wait, there's more. We've also announced an upcoming eBook: "Data Quality No-Code Automation with AWS Glue DataBrew: A Proof of Concept." It's packed with practical tips on creating data quality rules, using real-world examples like HR datasets, and ensuring your data is clean, consistent, and analysis-ready.

Don't miss out—join the waiting list now and be the first to get your hands on this valuable resource!

Our Big Data Experts at conferences and meetups

Last year, we had the privilege of organizing two major conferences: the Big Data Tech Warsaw Summit and InfoShare 2024. We’re thrilled to announce that we’re now officially part of InfoShare, where we host the DataMass stage, bringing even more cutting-edge data expertise to this incredible event!

The 10th edition of the Big Data Technology Warsaw Summit was a hybrid event, combining on-site and online participation. Some of the highlights included:

  • Data Lineage for the Streaming Universe by Maciej Obuchowski
  • Kubernetes Takes the Wheel: A Case Study of Migrating from Hadoop to K8s at Play by Kosma Grochowski, Tomasz Sujkowski, and Radosław Szmit
  • Your Personal LLM and RAG-Backed Data Copilot by Marek Wiewiórka

We also hosted two hands-on training sessions:

  • Building Generative AI-Based Applications with LLMs and Data Augmentation Architectures by Michał Bryś and Marek Wiewiórka
  • Data Streaming: Analyze Your Data in Real-Time with Flink by Adrian Bednarz and Piotr Menclewicz

If you couldn’t make it last time, don’t worry! You can still check out the reviews of past presentations to see what makes this event so unique. Be sure to join us this April for the 11th edition of the BTW conference—it’s an event no data enthusiast should miss! Tickets are available now, and you can use the code getindata10 for a 10% discount.

Don’t forget to check out our DataMass stage at InfoShare, where we’ll continue to deliver insights and showcase the latest innovations in the world of data!

At the InfoShare 2024 Marek Wiewiórka and Bartosz Konieczny gave their presentations:

  • Enhanced Enterprise Intelligence with your personal AI Data Copilot
  • Fallacies of stream processing

That’s not all! Our experts had the opportunity to showcase their knowledge at various fascinating Big Data events, including:

  • Kafka Summit: Paweł Leszczyński and Maciej Obuchowski delivered an insightful presentation on OpenLineage for Stream Processing.
  • Global Azure Torino: Tomasz Kostyrka shared his expertise with a talk on Azure Policy: An Underrated Component of a Scalable Data Platform.
  • Data Platform Next Step and SQLDay: Tomasz also presented on Azure Data Platform as Code at these key events.
  • Airflow Summit 2024: Kacper Muda and Maciej Obuchowski represented our team, with Kacper presenting Activating Operational Metadata with Airflow, Atlan, and OpenLineage, and Maciej discussing OpenLineage: From Operators to Hooks.

The meetups in Stockholm and Warsaw were great opportunities to explore real-time analytics, seamless migrations, and data strategies for 2025. In Stockholm, attendees enjoyed talks on optimizing JOIN performance and analytics best practices, while the Warsaw event focused on building effective data and AI strategies for the new year. The agenda included sessions on planning data strategies that deliver immediate value and discussions on implementing and measuring their success. Both meetups were filled with engaging conversations and fresh ideas, leaving us excited for what’s next in 2025!

Internal Knowledge Sharing

We prioritize internal knowledge-sharing through initiatives such as Lunch & Learn sessions, Guild meetings, and The Study Club. Lunch & Learn sessions provide a space for experts within GetInData | Part of Xebia to share their expertise, fostering collaboration and continuous learning. These online gatherings feature presentations by specialists or teams, followed by interactive discussions where participants can ask questions and exchange insights.

In 2024, some of the topics explored in these sessions included:

  • ChatGPT: Unlocking Its True Potential
  • How Can Public Speaking Help in Your Tech Job? Introduction to Public Speaking
  • Data Access Management in a Data Mesh Architecture

Guilds are communities of like-minded individuals passionate about a shared topic. At GetInData, anyone can voluntarily join a Guild via Slack.

We currently host five active Guilds:

  • MLOps
  • DevOps
  • Streaming (Real-Time Data Processing)
  • Data Engineering
  • Advanced Analytics

The Study Club
The Study Club is our spin on a book club, created to encourage learning and connection. We delve into a variety of materials—books, lectures, articles, and courses—covering topics that align with our work and interests.

Every two weeks, we come together to share insights, exchange ideas, and deepen our understanding of the material. Meetings last about an hour and focus on meaningful discussions, practical takeaways, and real-world applications. Participants review the material independently beforehand, making the sessions both engaging and impactful.

Plans for 2025

We’ve got plenty of exciting new concepts and creative plans in store for 2025, and we can’t wait to share them with you. There’s so much to look forward to—new experiences, opportunities, and ways to connect and learn together.

Stay in the loop by following us on  Linkedin, Facebook, Twitter, and our newsletter. And be sure to subscribe to our YouTube channel for more updates!

streaming
conference
technology
ML
Data
LLM
LLMOps
7 January 2025

Want more? Check our articles

getindator beautiful magi lake with data visualization under th 04d517e5 6cb7 49b2 af1a 77884a44a1eb
Tutorial

Data lakehouse with Snowflake Iceberg tables - introduction

Snowflake has officially entered the world of Data Lakehouses! What is a data lakehouse, where would such solutions be a perfect fit and how could…

Read more
flinkmleapobszar roboczy 1 4
Tutorial

Flink with MLeap

MLOps with Stream Processing In the big data world, more and more companies are discovering the potential in fast data processing using stream…

Read more
3

Data Journey with Michał Wróbel (RenoFi) - Doing more with less with a Modern Data Platform and ML at home

In this episode of the RadioData Podcast, Adam Kawa talks with Michał Wróbel about business use cases at RenoFi (​​a U.S.-based FinTech), the Modern…

Read more
datamass getindata adoption genai
Big Data Event

A Review of the Presentations at the DataMass Gdańsk Summit 2023

The Data Mass Gdańsk Summit is behind us. So, the time has come to review and summarize the 2023 edition. In this blog post, we will give you a review…

Read more
blog7

5 main data-related trends to be covered at Big Data Tech Warsaw 2021 Part II

Trend 4. Larger clouds over the Big Data landscape  A decade ago,  only a few companies ran their Big Data infrastructure and pipelines in the public…

Read more
1 RsDrT5xOpdAcpehomqlOPg
Big Data Event

2³ Reasons To Speak at Big Data Tech Warsaw 2020 (February 27th, 2020)

Big Data Technology Warsaw Summit 2020 is fast approaching. This will be 6th edition of the conference that is jointly organised by Evention and…

Read more

Contact us

Interested in our solutions?
Contact us!

Together, we will select the best Big Data solutions for your organization and build a project that will have a real impact on your organization.


What did you find most impressive about GetInData?

They did a very good job in finding people that fitted in Acast both technically as well as culturally.
Type the form or send a e-mail: hello@getindata.com
The administrator of your personal data is GetInData Poland Sp. z o.o. with its registered seat in Warsaw (02-508), 39/20 Pulawska St. Your data is processed for the purpose of provision of electronic services in accordance with the Terms & Conditions. For more information on personal data processing and your rights please see Privacy Policy.

By submitting this form, you agree to our Terms & Conditions and Privacy Policy