Big Data Event
5 min read

Truecaller, GetInData and Google’s contribution to Big Data Tech Warsaw Summit

1 6ZTvzJwCviqIJcV5WQC0Sg GetInData, Google and Truecaller participate in the Big Data Tech Warsaw Summit 2019.

It’s already less than two weeks to the 5th edition of Big Data Warsaw Tech Summit! How time flies! The initiating coverage of this event you may find here, but today we’ll walk you through the presentation that will be delivered by our colleagues from Truecaller and will also shed some light on the GetInData’s contribution to the Summit and its joint initiative with Google. Enjoy!

Presentations

Truecaller, GetInData’s long-lasting client, is a Swedish tech company that innovates how people call over phone. Truecaller has created one of the most popular caller ID and phone spam protection app. The app has over 100 million daily active users globally (growing!), runs a petabyte-scale infrastructure and contains several features powered by data & ML/AI such as spam detection. Dhanesh Padmanabhan, their Director of Engineering, will deliver a speech on Truecaller’s analytics stack and their data-driven use-cases deployed on the top of Kafka, Spark, Hadoop that support the company’s core activities.

Krzysztof Zarzycki and Marek Wiewiórka, who are our most experienced Big Data veterans at GetInData, will discuss the future-proof, cloud-ready and open-source big data discovery platform. As this may sound a little bit vague, I will briefly walk you through it. As GetInData works all the time on Big Data platforms with many clients, we are able to spot their bottlenecks and inefficiencies. This time my fellow colleagues will take a look at Data Lake. At most companies, Data Lake becomes a controlled environment with strict service level agreements and strong guarantees where there is no room for creativity and experiments that drives the unique, cutting-edge and innovative data science projects. Krzysztof and Marek will explain how this problem may be addressed by a modern open-source data discovery platform, built next to existing Data Lake — the Data Discovery platform. Consider it as a cloud-ready open-source-based extension to Data Lake, that gives freedom of processing power and libraries choice and seamless data access, without lock-in.

Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018. Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018.

The panel and roundtable discussions

The Summit agenda assumes also a series of roundtable discussions and panels. Their aim is to engage all the participants and drive meaningful and inspiring debates on hot topics in Big Data world.

The main panel, entitled How current megatrends are changing the Big Data landscape and what it means to us, will be led by Adam Kawa from GetInData,

Two roundtable discussions will be moderated by our experts — they are “Stream processing engines — features, performance, comparison” and “Big Data — the cloud way”. Join these tables if you’d like to discuss these topics with other specialists and our experts.

One of the roundtable discussion at Big Data Tech Warsaw Summit 2017. One of the roundtable discussion at Big Data Tech Warsaw Summit 2017.

The workshops

Besides presentations and roundtable discussions, GetInData will also contribute to the workshop’s part of the Summit.

  • Mateusz Pytel, our Google Cloud certified professional data engineer with a few years of experience in using Google Cloud Platform will co-host Big Data on Google Cloud workshop along with Radosław Stankiewicz, Strategic Cloud Engineer at Google. This one-day training is an excellent opportunity to get some hands-on taste of such technologies like BigQuery, Dataflow, Beam, PubSub or Data Studio that are available on Google Cloud Platform.
  • Piotr Krewski and will shed some light on Hadoop ecosystem during Hadoop Ecosystem Basics workshop.
  • Tomasz Żukowski will discuss Spark system functionalities in From small data in Python to big data model in Apache Spark.

If you want to get some information, take a look at the workshop agenda here.

Call for Action

And if you don’t register for the Summit yet, remember, the deadline is looming. Hope to see you there!

conference
big data
cloud
DevOps
open source
19 February 2019

Want more? Check our articles

Big Data Tech Warsaw Summit 2019 summary

It’s been already more than a month after Big Data Tech Warsaw Summit 2019, but it’s spirit is still among us — that’s why we’ve decided to prolong it…

Read more

Business value of event processing - use cases

Every second your IT systems exchange millions of messages. This information flow includes technical messages about opening a form on your website…

Read more

Running Spark on Amazon Web Services (AWS)

When you search thought the net looking for methods of running Apache Spark on AWS infrastructure you are most likely to be redirected to the…

Read more

White Paper: Big Data Technologies in the Aviation Industry

AboutIn this White Paper we described use-cases in the aviation industry which are the most prominent examples of Big Data related implementations…

Read more

2³ Reasons To Speak at Big Data Tech Warsaw 2020 (February 27th, 2020)

Big Data Technology Warsaw Summit 2020 is fast approaching. This will be 6th edition of the conference that is jointly organised by Evention and…

Read more

Enabling Hive on Spark on CDH 5.14 — a few problems (and solutions)

Recently I’ve had an opportunity to configure CDH 5.14 Hadoop cluster of one of GetInData’s customers to make it possible to use Hive on Spark…

Read more

Contact us

Fill out this simple form. Our team will contact you promptly to discuss the next steps.

hello@getindata.comFist bump illustration

Any questions?

Choose one
By submitting this form, you agree to our  Terms & Conditions