12 min read

GetInData in 2021 - let’s celebrate our achievements in the Big Data world!

The year 2021 passed in the blink of an eye and the time has come to summarize our goals at GetinData and define our challenges for the next year. Today, we would like to share our achievements with you in the fields of knowledge sharing and the Big Data community. Here's a quick summary:

  • Throughout 2021, we published a lot of blogs about our solutions, Big Data technologies and Big Data events
  • We shared interesting facts and news from the world of Big Data on our social media channels - follow our profiles and stay up to date!
  • We took part in one of the biggest events in Poland’s IT industry - the Google Cloud Region launch
  • Our Big Data Experts had the pleasure of performing in many Big Data events around the world
  • We took part in GITEX Global - one of the biggest tech events in the field of #bigdata #ai #cloud #cybersecurity and more
  • We started a new project dedicated to our Polish fans - Big Data Club. This  group focuses on Big Data solutions for business. It is primarily intended for people operating in the interface between business and technology.
  • We were recognised for our expertise in Cloud Analytics, Stream Data Analytics, Enterprise BI solutions and remote monitoring using Google Cloud
  •  This year we have exceeded 100+ people being on board and we are not slowing down! (check our job offers)
  • Organizing internal Lunch & Learn sessions, which is already a tradition of ours 
  • Our experts passed a countless number of assessments in the area of ​​Google Cloud, and the number is constantly growing
  • We launched GetInData Labs, whose mission is to research and produce innovative solutions that develop our business and people (soon we will publish more information about this)
  • We contributed to many open-source projects such as Marquez, Flink HTTP Connector, Amundsen, Kedro-kubeflow plugin

Below you can find a list of our publications, webinars and conference talks so…  sit down comfortably and dive into the topics that interest you the most! 

Our Top 5 blog posts

getindata-big-data-blog-linkedin-technology-knowledge-sharing

In 2021 we published 39 blog posts in the field of Big Data. You can check out the top 5 most viewed blogs (based on data from the 5th of January) below:

That's not all, of course!  We also published lots  of blog posts about some interesting technology:  

More blog posts correlated to MLOps and our Machine Learning Platform such us:

A lot of blog posts correlated to stream-processing and our Complex Event Processing Platform

We also published a series of blog posts dedicated to businesses such as: 

and last but not least some blog posts about our people and company:

Whitepapers and our first E-book

Together with our Big Data expert Paweł Leszczyński, we prepared the whitepaper “Stream Processing Explained”. There you can read about the characteristics of streaming, challenges and the open-source streaming playground such as Apache Flink, Kafka Connect and Spark Streaming. We encourage you to download it here

Did you have the chance to check out “Apache NiFi: A Complete Guide E-book” created by our specialists: Albert Lewandowski, Paweł Leszczyński and Tomasz Nazarewicz? If not, you definitely should.
What will you find there? You'll learn about NiFi architecture, NiFi flow, the management & operations of NiFi and NiFi Registry and some recommendations for using Apache NiFi. If you are interested in NiFi, check it out!

Our Big Data Experts at the conference

getindata-big-data-blog-linkedin-technology-conferences

Last year we organized the 7th edition of the Big Data Technology Warsaw Summit, the first time this was done in a fully online version! In our blog, you can read a review of presentations and of course, we encourage you to join us during this year's edition on the 26-28th April 2022

During the first half of the year, we had the pleasure of taking part and performing  in many interesting Big Data Events:

During the second half of the year at the ClubCloud Conference, Data Science Summit and during Flink Forward Global 2021, our team members Maciej Bryński and Rafał Małanij had the pleasure of presenting   Networks! Project - Real-Time Analytics That Control 50% of Mobile Networks in Poland (slides). 

Our Big Data DevOps Expert Albert Lewandowski also performed at:

  • The Big Data Conference Europe, where he talked about best practices for ETL with Apache NiFi and Kubernetes
  • The Open Source Summit + Embedded Linux Conference 2021 where he presented an Open-source Data Platform in the Cloud
  • DataArt IT Nonstop, where he gave a presentation on creating Real-Time Data Streaming Powered by SQL on Kubernetes

Big Data Webinars and Meetups

In 2021 we organized a few webinars:

Our CTO Krzysztof Zarzycki also had the pleasure of hosting a webinar organized by Polish Executives Switzerland and Polish Professionals in Switzerland. During the meeting, we talked about the latest big data trends, how data processing works nowadays and how companies can benefit from data analytics (check slides). 

That’s not all! We also organized a Lighting Talks in a Warsaw Data Tech Talks meeting, where our team talked about: 

  • Data Discovery with Apache Atlas and Amundsen - Improving the Productivity of Users Interacting with Data, hosted by Dominik Choma and Mariusz Górski (ING)
  • Open-source vs Cloud-Managed - Data Engineers Dilemmas in the Cloud, hosted by Marcin Kaceperek
  • Real-time Monitoring that Controls 50% of Mobile Networks in Poland, hosted by Maciej Bryński and Rafał Małanij
  • Kedro, Data Scientist’s Swiss-Army Knife, hosted by Mariusz Strzelecki 

Our experts Dominik Choma and Mariusz Górski (ING) performed at the Amundsen Community meeting about Pandas profiling with Amundsen and OpenLineage. You can watch the clip here. At another meeting of the Amundsen Community, Mariusz Strzelecki talked about Feast + Amundsen Integration. Feel free to watch the video.

Knowledge Sharing and Big Data for Business

In 2021 we made some changes to our website and published two new subpages.

  • Knowledge Base - this is our library of Big Data Knowledge. This page gives you full access to our whitepapers, recordings of conferences and webinars and the most interesting blog posts or contributions that we have made to open-source software.
  • Big Data for Business - if you would like to know more about the process of cooperation with our company, the model that allows us to successfully deliver even complex Big Data projects for our clients or how Big Data can benefit your business, this is the place dedicated for you!

getindata-big-data-blog-linkedin-technology

Plans for 2022?

We have a lot of new ideas that we want to bring to life in 2021. We also have plans for lots more knowledge sharing in our profile, so if you are interested in MLOps, Stream-processing, DevOps, DataOps, Cloud, Data Science or Big Data at all, please sign up to the newsletter and follow us on Linkedin, Facebook and Twitter, and subscribe to our channel on Youtube.

big data
analytics
conference
technology
google cloud platform
getindata
stream processing
cloud
MLOps
18 January 2022

Want more? Check our articles

getindata big data tech main 1
Big Data Event

A Review of the Presentations at the Big Data Technology Warsaw Summit 2022!

The 8th edition of the Big Data Tech Summit is already over, and we would like to thank all of the attendees for joining us this year. It was a real…

Read more
ml getindataobszar roboczy 1
Use-cases/Project

Real-time Machine Learning: considerations based on Fraud Detection use case

When it comes to machine learning, most products are designed to work in batches, meaning they process data at fixed intervals rather than in real…

Read more
getindata ml innovations 2023
Tech News

If LLM’s did not exist. ML innovations in 2023 from a data scientist’s perspective

The year 2023 has definitely been dominated by LLM’s (Large Language Models) and generative models. Whether you are a researcher, data scientist, or…

Read more
mariusz blogobszar roboczy 1 4x 100
Tutorial

OAuth2-based authentication on Istio-powered Kubernetes clusters

You have just installed your first Kubernetes cluster and installed Istio to get the full advantage of Service Mesh. Thanks to really awesome…

Read more
maximizing personalization11
Tutorial

Maximizing Personalization: Real-Time Context and Persona Drive Better-Suited Products and Customer Experiences

Have you ever searched for something that isn't typical for you? Maybe you were looking for a gift for your grandmother on Amazon or wanted to listen…

Read more
airbyte column selectionobszar roboczy 1 4
Tutorial

Less data, less problems: Airbyte’s column selection is finally here

The Airbyte 0.50 release has brought some exciting changes to the platform: checkpointing (so that you don’t have to start from scratch in case of…

Read more

Contact us

Interested in our solutions?
Contact us!

Together, we will select the best Big Data solutions for your organization and build a project that will have a real impact on your organization.


What did you find most impressive about GetInData?

They did a very good job in finding people that fitted in Acast both technically as well as culturally.
Type the form or send a e-mail: hello@getindata.com
The administrator of your personal data is GetInData Poland Sp. z o.o. with its registered seat in Warsaw (02-508), 39/20 Pulawska St. Your data is processed for the purpose of provision of electronic services in accordance with the Terms & Conditions. For more information on personal data processing and your rights please see Privacy Policy.

By submitting this form, you agree to our Terms & Conditions and Privacy Policy