Big Data Event
5 min read

Truecaller, GetInData and Google’s contribution to Big Data Tech Warsaw Summit

1 6ZTvzJwCviqIJcV5WQC0Sg GetInData, Google and Truecaller participate in the Big Data Tech Warsaw Summit 2019.

It’s already less than two weeks to the 5th edition of Big Data Warsaw Tech Summit! How time flies! The initiating coverage of this event you may find here, but today we’ll walk you through the presentation that will be delivered by our colleagues from Truecaller and will also shed some light on the GetInData’s contribution to the Summit and its joint initiative with Google. Enjoy!

Presentations

Truecaller, GetInData’s long-lasting client, is a Swedish tech company that innovates how people call over phone. Truecaller has created one of the most popular caller ID and phone spam protection app. The app has over 100 million daily active users globally (growing!), runs a petabyte-scale infrastructure and contains several features powered by data & ML/AI such as spam detection. Dhanesh Padmanabhan, their Director of Engineering, will deliver a speech on Truecaller’s analytics stack and their data-driven use-cases deployed on the top of Kafka, Spark, Hadoop that support the company’s core activities.

Krzysztof Zarzycki and Marek Wiewiórka, who are our most experienced Big Data veterans at GetInData, will discuss the future-proof, cloud-ready and open-source big data discovery platform. As this may sound a little bit vague, I will briefly walk you through it. As GetInData works all the time on Big Data platforms with many clients, we are able to spot their bottlenecks and inefficiencies. This time my fellow colleagues will take a look at Data Lake. At most companies, Data Lake becomes a controlled environment with strict service level agreements and strong guarantees where there is no room for creativity and experiments that drives the unique, cutting-edge and innovative data science projects. Krzysztof and Marek will explain how this problem may be addressed by a modern open-source data discovery platform, built next to existing Data Lake — the Data Discovery platform. Consider it as a cloud-ready open-source-based extension to Data Lake, that gives freedom of processing power and libraries choice and seamless data access, without lock-in.

Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018. Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018.

The panel and roundtable discussions

The Summit agenda assumes also a series of roundtable discussions and panels. Their aim is to engage all the participants and drive meaningful and inspiring debates on hot topics in Big Data world.

The main panel, entitled How current megatrends are changing the Big Data landscape and what it means to us, will be led by Adam Kawa from GetInData,

Two roundtable discussions will be moderated by our experts — they are “Stream processing engines — features, performance, comparison” and “Big Data — the cloud way”. Join these tables if you’d like to discuss these topics with other specialists and our experts.

One of the roundtable discussion at Big Data Tech Warsaw Summit 2017. One of the roundtable discussion at Big Data Tech Warsaw Summit 2017.

The workshops

Besides presentations and roundtable discussions, GetInData will also contribute to the workshop’s part of the Summit.

  • Mateusz Pytel, our Google Cloud certified professional data engineer with a few years of experience in using Google Cloud Platform will co-host Big Data on Google Cloud workshop along with Radosław Stankiewicz, Strategic Cloud Engineer at Google. This one-day training is an excellent opportunity to get some hands-on taste of such technologies like BigQuery, Dataflow, Beam, PubSub or Data Studio that are available on Google Cloud Platform.
  • Piotr Krewski and will shed some light on Hadoop ecosystem during Hadoop Ecosystem Basics workshop.
  • Tomasz Żukowski will discuss Spark system functionalities in From small data in Python to big data model in Apache Spark.

If you want to get some information, take a look at the workshop agenda here.

Call for Action

And if you don’t register for the Summit yet, remember, the deadline is looming. Hope to see you there!

conference
big data
cloud
DevOps
open source
19 February 2019

Want more? Check our articles

llm reading assistant getindataobszar roboczy 1 4
Tech News

Combining Kedro and Streamlit to build a simple LLM-based Reading Assistant

Generative AI and Large Language Models are taking applied machine learning by storm, and there is no sign of a weakening of this trend. While it is…

Read more
data pipelines dbt bigquery getindata
Tutorial

Up & Running: data pipeline with BigQuery and dbt

Nowadays, companies need to deal with the processing of data collected in the organization data lake. As a result, data pipelines are becoming more…

Read more
deploying serverless mlflow google cloud platform using cloud run machine learning getindata notext
Tutorial

Deploying serverless MLFlow on Google Cloud Platform using Cloud Run

At GetInData, we build elastic MLOps platforms to fit our customer’s needs. One of the key functionalities of the MLOps platform is the ability to…

Read more
big data blog getindata data enrichment flink sql http connector
Tutorial

Data Enrichment in Flink SQL using HTTP Connector For Flink - Part One

HTTP Connector For Flink SQL  In our projects at GetInData, we work a lot on scaling out our client's data engineering capabilities by enabling more…

Read more
datagenerationobszar roboczy 1 4
Tutorial

Data online generation for event stream processing

In a lot of business cases that we solve at Getindata when working with our clients, we need to analyze sessions: a series of related events of actors…

Read more
hfobszar roboczy 1 4
Tutorial

Automated Machine Learning (AutoML) with BigQuery ML. Start Machine Learning easily and validate if ML is worth investing in or not.

Machine learning is becoming increasingly popular in many industries, from finance to marketing to healthcare. But let's face it, that doesn't mean ML…

Read more

Contact us

Interested in our solutions?
Contact us!

Together, we will select the best Big Data solutions for your organization and build a project that will have a real impact on your organization.


What did you find most impressive about GetInData?

They did a very good job in finding people that fitted in Acast both technically as well as culturally.
Type the form or send a e-mail: hello@getindata.com
The administrator of your personal data is GetInData Poland Sp. z o.o. with its registered seat in Warsaw (02-508), 39/20 Pulawska St. Your data is processed for the purpose of provision of electronic services in accordance with the Terms & Conditions. For more information on personal data processing and your rights please see Privacy Policy.

By submitting this form, you agree to our Terms & Conditions and Privacy Policy