Big Data Event
5 min read

Truecaller, GetInData and Google’s contribution to Big Data Tech Warsaw Summit

1 6ZTvzJwCviqIJcV5WQC0Sg GetInData, Google and Truecaller participate in the Big Data Tech Warsaw Summit 2019.

It’s already less than two weeks to the 5th edition of Big Data Warsaw Tech Summit! How time flies! The initiating coverage of this event you may find here, but today we’ll walk you through the presentation that will be delivered by our colleagues from Truecaller and will also shed some light on the GetInData’s contribution to the Summit and its joint initiative with Google. Enjoy!

Presentations

Truecaller, GetInData’s long-lasting client, is a Swedish tech company that innovates how people call over phone. Truecaller has created one of the most popular caller ID and phone spam protection app. The app has over 100 million daily active users globally (growing!), runs a petabyte-scale infrastructure and contains several features powered by data & ML/AI such as spam detection. Dhanesh Padmanabhan, their Director of Engineering, will deliver a speech on Truecaller’s analytics stack and their data-driven use-cases deployed on the top of Kafka, Spark, Hadoop that support the company’s core activities.

Krzysztof Zarzycki and Marek Wiewiórka, who are our most experienced Big Data veterans at GetInData, will discuss the future-proof, cloud-ready and open-source big data discovery platform. As this may sound a little bit vague, I will briefly walk you through it. As GetInData works all the time on Big Data platforms with many clients, we are able to spot their bottlenecks and inefficiencies. This time my fellow colleagues will take a look at Data Lake. At most companies, Data Lake becomes a controlled environment with strict service level agreements and strong guarantees where there is no room for creativity and experiments that drives the unique, cutting-edge and innovative data science projects. Krzysztof and Marek will explain how this problem may be addressed by a modern open-source data discovery platform, built next to existing Data Lake — the Data Discovery platform. Consider it as a cloud-ready open-source-based extension to Data Lake, that gives freedom of processing power and libraries choice and seamless data access, without lock-in.

Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018. Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018.

The panel and roundtable discussions

The Summit agenda assumes also a series of roundtable discussions and panels. Their aim is to engage all the participants and drive meaningful and inspiring debates on hot topics in Big Data world.

The main panel, entitled How current megatrends are changing the Big Data landscape and what it means to us, will be led by Adam Kawa from GetInData,

Two roundtable discussions will be moderated by our experts — they are “Stream processing engines — features, performance, comparison” and “Big Data — the cloud way”. Join these tables if you’d like to discuss these topics with other specialists and our experts.

One of the roundtable discussion at Big Data Tech Warsaw Summit 2017. One of the roundtable discussion at Big Data Tech Warsaw Summit 2017.

The workshops

Besides presentations and roundtable discussions, GetInData will also contribute to the workshop’s part of the Summit.

  • Mateusz Pytel, our Google Cloud certified professional data engineer with a few years of experience in using Google Cloud Platform will co-host Big Data on Google Cloud workshop along with Radosław Stankiewicz, Strategic Cloud Engineer at Google. This one-day training is an excellent opportunity to get some hands-on taste of such technologies like BigQuery, Dataflow, Beam, PubSub or Data Studio that are available on Google Cloud Platform.
  • Piotr Krewski and will shed some light on Hadoop ecosystem during Hadoop Ecosystem Basics workshop.
  • Tomasz Żukowski will discuss Spark system functionalities in From small data in Python to big data model in Apache Spark.

If you want to get some information, take a look at the workshop agenda here.

Call for Action

And if you don’t register for the Summit yet, remember, the deadline is looming. Hope to see you there!

conference
big data
cloud
DevOps
open source
19 February 2019

Want more? Check our articles

getindata flink kafka audio spectrum analyzer smalltext
Use-cases/Project

Puzzles in the time of plague: truly over-engineered audio spectrum analyzer

Quarantaine projectStaying at home is not my particular strong point. But tough times have arrived and everybody needs to change their habits and re…

Read more
getindata transfer pipelines to modern gitlab cicd small
Tutorial

How we helped our client to transfer legacy pipeline to modern one using GitLab's CI/CD - Part 1

This blog series is based on a project delivered for one of our clients. We splited the content in three parts, you can find a table of content below…

Read more
getindata 6 trends big data 2021 blog
Tech News

6 Big Data Trends For 2021

2020 was a very tough year for everyone. It was a year full of emotions, constant adoption and transformation - both in our private and professional…

Read more
transfer legacy pipeline modern gitlab cicd kubernetes kaniko
Tutorial

How we helped our client to transfer legacy pipeline to modern one using GitLab's CI/CD - Part 2

Please dive in the second part of a blog series based on a project delivered for one of our clients. If you miss the first part, please check it here…

Read more
blog6

5 main data-related trends to be covered at Big Data Tech Warsaw 2021. Part I.

A year is definitely a long enough time to see new trends or technologies that get more traction. The Big Data landscape changes increasingly fast…

Read more
getindata nifi blog post
Tutorial

NiFi Ingestion Blog Series. PART III - No coding, just drag and drop what you need, but if it’s not there… - custom processors, scripts, external services

Apache NiFI, a big data processing engine with graphical WebUI, was created to give non-programmers the ability to swiftly and codelessly create data…

Read more

Contact us

Fill out this simple form. Our team will contact you promptly to discuss the next steps.

hello@getindata.comFist bump illustration

Any questions?

Choose one
By submitting this form, you agree to our  Terms & Conditions