Big Data Event
5 min read

Big Data Tech Warsaw Summit 2019 summary

0 pjPVaAnArwat2ZH8

It’s been already more than a month after Big Data Tech Warsaw Summit 2019, but it’s spirit is still among us — that’s why we’ve decided to prolong it and share with you a short summary of this event. At the very beginning, a very big thank you to Evention for a great deal of help with organizing BDTWS 2019.

During this year’s edition, 63 Big Data experts delivered 27 presentations and leaded 27 panel discussion tables. And we didn’t yet mention the workshop part. Each year we’re setting a new record high results in terms of the number of participants and panel discussion tables — this time it was no different.

We’re extremely glad to receive numerous positive feedbacks on the spot and later on the social media feeds. One of them, written by Kamil Szkoda (you can read it here) pointed out the professionalism and purely technical approach of the event.

What were the other advantages of Big Data Tech Warsaw Summit? It was a gathering of Big Data experts from all around the world — this has secured the objective, multi-perspective approach of BDTWS. The discussed topics (AI, ML, cloud-based solutions to name a few) addressed the most recent, game-changing events such as Cloudera — Hortonworks merger or ongoing trends like cloud transitions and rising popularity and importance of Kubernetes. This helped every not-so-experienced with Big Data world participant to catch up with the industry. And the last but not least: highly meritorical speeches.

The BDTWS audience has chosen the five best presentations. These were: AI applied: filtering RTB traffic at Ad Tech scale delivered by Paweł Zawistowski (Adform); The Changing Face of ETL: Event-Driven Architectures for Data Engineers by Robin Moffatt (Confluent), Towards next generation, cloud-ready and open-source big data discovery platform presented by Krzysztof Zarzycki and Marek Wiewiórka (GetInData); Streaming Visualization by Guido Schmutz (Trivadis) and From legacy to cloud: an end to end data integration journey delivered by Max Schultze (Zalando). We’re proud to stress that the presentation performed by our Big Data experts colleagues from GetInData gathered not only top reviews but also a numerous audience.

Marek (on the left) and Krzysiek during their presentation Marek (on the left) and Krzysiek during their presentation

Big Data Tech Warsaw Summit 2019 consisted also of two-day workshops conducted by representatives from GetInData, and Google and XCaliber. We’re glad that Hadoop Ecosystem Basics workshop conducted by Piotr Krewski, GetInData’s Co-founder, received positive reviews for a very good level and complete overview of Hadoop ecosystem. Big Data on Kubernetes training led by Maciej Bryliński of XCaliber was also appreciated by the audience that underlined the unique meritorical background and a model-like way of delivering the presentation.

What are BDTWS main findings and outlook for the close future? The closing panel (and many others) were addressing this issue. The hottest topic was a shift towards Kubernetes at the cost of the Hadoop ecosystem — this is, among others, supported by the roots of the Cloudera-Hortonworks merger. When these two, heavily involved in the Hadoop environment competitors are joining their forces, it must be a sign of some negative trends in the Hadoop market. IBM acquires RedHat for its commercial Kubernetes version (OpenShift) and VMware announced the acquisition of Heptio, a company founded by Kubernetes originators. Although many agree that Kubernetes times are coming, a definite shift towards the new ecosystem is not prejudged despite various Kubernetes advantages in comparison with the Hadoop ecosystem (simplicity, open-source basis, high user experience).

The other underlined megatrends were: rising popularity of open source solutions in the cloud environment (what is associated with the rise of Kubernetes) and data scientists shortage. Besides the general observations, the conference speeches were full of various meritorical curiosities. One of them was the advanced filtering features used by Booking.com — i was really surprised when the speakers entered (after turning the website to english version) the following combination of letters kibdib and the search engine returned results for London. How’s that possible? If you take a look at your QWERTY keyboard and analyse the location of kibdib and London, you’ll find out that the following letters are (almost!) exactly next to each other. This feature was designed to omit of a mistakenly inputted command in the search toolbar and return the correct, desired result.

All in all, the impressive statistics of Big Data Tech Warsaw Summit (fifth year in a row!) are not only the proof of great Evention and GetInData’s organisational skills, but first and foremost, the growing importance of data and data management in the modern, digitalized world. We do really hope that the next year’s edition, Big Data Tech Warsaw Summit 2020, will attract the greater audience and more speakers. Hope to see you next year!

technology
kubernetes
google
big data
8 April 2019

Want more? Check our articles

runningkedroeverywhereobszar roboczy 1 4
Tutorial

Running Kedro… everywhere? Machine Learning Pipelines on Kubeflow, Vertex AI, Azure and Airflow

Building reliable machine learning pipelines puts a heavy burden on Data Scientists and Machine Learning engineers. It’s fairly easy to kick-off any…

Read more
maximizing personalization11
Tutorial

Maximizing Personalization: Real-Time Context and Persona Drive Better-Suited Products and Customer Experiences

Have you ever searched for something that isn't typical for you? Maybe you were looking for a gift for your grandmother on Amazon or wanted to listen…

Read more
complex event processing apache flink
Tutorial

My experience with Apache Flink for Complex Event Processing

My goal is to create a comprehensive review of available options when dealing with Complex Event Processing using Apache Flink. We will be building a…

Read more
semi supervised learning real timeobszar roboczy 1 4
Tutorial

Semi-supervised learning on real-time data streams

Acquiring unlabeled data is inherent to many machine learning applications. There are cases when we do not know the result of the action provided by…

Read more
transfer legacy pipeline modern using gitlab cicd
Tutorial

How we helped our client to transfer legacy pipeline to modern one using GitLab's CI/CD - Part 3

Please dive in the third part of a blog series based on a project delivered for one of our clients. Please click part I, part II to read the…

Read more
ml getindataobszar roboczy 1
Use-cases/Project

Real-time Machine Learning: considerations based on Fraud Detection use case

When it comes to machine learning, most products are designed to work in batches, meaning they process data at fixed intervals rather than in real…

Read more

Contact us

Interested in our solutions?
Contact us!

Together, we will select the best Big Data solutions for your organization and build a project that will have a real impact on your organization.


What did you find most impressive about GetInData?

They did a very good job in finding people that fitted in Acast both technically as well as culturally.
Type the form or send a e-mail: hello@getindata.com
The administrator of your personal data is GetInData Poland Sp. z o.o. with its registered seat in Warsaw (02-508), 39/20 Pulawska St. Your data is processed for the purpose of provision of electronic services in accordance with the Terms & Conditions. For more information on personal data processing and your rights please see Privacy Policy.

By submitting this form, you agree to our Terms & Conditions and Privacy Policy