Truecaller, GetInData and Google’s contribution to Big Data Tech Warsaw Summit

1 6ZTvzJwCviqIJcV5WQC0Sg GetInData, Google and Truecaller participate in the Big Data Tech Warsaw Summit 2019.

It’s already less than two weeks to the 5th edition of Big Data Warsaw Tech Summit! How time flies! The initiating coverage of this event you may find here, but today we’ll walk you through the presentation that will be delivered by our colleagues from Truecaller and will also shed some light on the GetInData’s contribution to the Summit and its joint initiative with Google. Enjoy!

Presentations

Truecaller, GetInData’s long-lasting client, is a Swedish tech company that innovates how people call over phone. Truecaller has created one of the most popular caller ID and phone spam protection app. The app has over 100 million daily active users globally (growing!), runs a petabyte-scale infrastructure and contains several features powered by data & ML/AI such as spam detection. Dhanesh Padmanabhan, their Director of Engineering, will deliver a speech on Truecaller’s analytics stack and their data-driven use-cases deployed on the top of Kafka, Spark, Hadoop that support the company’s core activities.

Krzysztof Zarzycki and Marek Wiewiórka, who are our most experienced Big Data veterans at GetInData, will discuss the future-proof, cloud-ready and open-source big data discovery platform. As this may sound a little bit vague, I will briefly walk you through it. As GetInData works all the time on Big Data platforms with many clients, we are able to spot their bottlenecks and inefficiencies. This time my fellow colleagues will take a look at Data Lake. At most companies, Data Lake becomes a controlled environment with strict service level agreements and strong guarantees where there is no room for creativity and experiments that drives the unique, cutting-edge and innovative data science projects. Krzysztof and Marek will explain how this problem may be addressed by a modern open-source data discovery platform, built next to existing Data Lake — the Data Discovery platform. Consider it as a cloud-ready open-source-based extension to Data Lake, that gives freedom of processing power and libraries choice and seamless data access, without lock-in.

Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018. Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018.

The panel and roundtable discussions

The Summit agenda assumes also a series of roundtable discussions and panels. Their aim is to engage all the participants and drive meaningful and inspiring debates on hot topics in Big Data world.

The main panel, entitled How current megatrends are changing the Big Data landscape and what it means to us, will be led by Adam Kawa from GetInData,

Two roundtable discussions will be moderated by our experts — they are “Stream processing engines — features, performance, comparison” and “Big Data — the cloud way”. Join these tables if you’d like to discuss these topics with other specialists and our experts.

One of the roundtable discussion at Big Data Tech Warsaw Summit 2017. One of the roundtable discussion at Big Data Tech Warsaw Summit 2017.

The workshops

Besides presentations and roundtable discussions, GetInData will also contribute to the workshop’s part of the Summit.

Mateusz Pytel, our Google Cloud certified professional data engineer with a few years of experience in using Google Cloud Platform will co-host Big Data on Google Cloud workshop along with Radosław Stankiewicz, Strategic Cloud Engineer at Google. This one-day training is an excellent opportunity to get some hands-on taste of such technologies like BigQuery, Dataflow, Beam, PubSub or Data Studio that are available on Google Cloud Platform.
Piotr Krewski and will shed some light on Hadoop ecosystem during Hadoop Ecosystem Basics workshop.
Tomasz Żukowski will discuss Spark system functionalities in From small data in Python to big data model in Apache Spark.

If you want to get some information, take a look at the workshop agenda here.

Call for Action

And if you don’t register for the Summit yet, remember, the deadline is looming. Hope to see you there!

conference

big data

cloud

DevOps

open source

Last updated: 19 February 2019

Written by

Adam Kawa

CEO and Founder

Want more? Check our articles

dynamodb aws jedraszewski getindata big data blog

Tutorial

Amazon DynamoDB - single table design

DynamoDB is a fully-managed NoSQL key-value database which delivers single-digit performance at any scale. However, to achieve this kind of…

Whitepaper

White Paper: Big Data Technologies in the Aviation Industry

About In this White Paper we described use-cases in the aviation industry which are the most prominent examples of Big Data related implementations…

Tutorial

NiFi Scripted Components - the missing link between scripts and fully custom stuff

Custom components As we probably know, the biggest strength of Apache Nifi is the large amount of ready-to-use components. There are, of course…

getindata nifi ingestion universe made out flow files nifi architecture big data

Tutorial

NiFi Ingestion Blog Series. PART IV - Universe made out of flow files - NiFi architecture

Apache NiFi, a big data processing engine with graphical WebUI, was created to give non-programmers the ability to swiftly and codelessly create data…

Tutorial

Maximizing Personalization: Real-Time Context and Persona Drive Better-Suited Products and Customer Experiences

Have you ever searched for something that isn't typical for you? Maybe you were looking for a gift for your grandmother on Amazon or wanted to listen…

Making the Right Choice: Flink or Kafka Streams?

Introduction Many teams may face the question, "Should we use Flink or Kafka Streams when starting a new project with real-time streaming requirements…

Truecaller, GetInData and Google’s contribution to Big Data Tech Warsaw Summit

Presentations

The panel and roundtable discussions

The workshops

Call for Action

Like this post?
Spread the word

Want more? Check our articles

Amazon DynamoDB - single table design

White Paper: Big Data Technologies in the Aviation Industry

NiFi Scripted Components - the missing link between scripts and fully custom stuff

NiFi Ingestion Blog Series. PART IV - Universe made out of flow files - NiFi architecture

Maximizing Personalization: Real-Time Context and Persona Drive Better-Suited Products and Customer Experiences

Making the Right Choice: Flink or Kafka Streams?

Contact us

Interested in our solutions?
Contact us!

Truecaller, GetInData and Google’s contribution to Big Data Tech Warsaw Summit

Presentations

The panel and roundtable discussions

The workshops

Call for Action

Like this post?Spread the word

Want more? Check our articles

Amazon DynamoDB - single table design

White Paper: Big Data Technologies in the Aviation Industry

NiFi Scripted Components - the missing link between scripts and fully custom stuff

NiFi Ingestion Blog Series. PART IV - Universe made out of flow files - NiFi architecture

Maximizing Personalization: Real-Time Context and Persona Drive Better-Suited Products and Customer Experiences

Making the Right Choice: Flink or Kafka Streams?

Contact us

Interested in our solutions?Contact us!

Like this post?
Spread the word

Interested in our solutions?
Contact us!