Big Data Event
5 min read

Truecaller, GetInData and Google’s contribution to Big Data Tech Warsaw Summit

1 6ZTvzJwCviqIJcV5WQC0Sg GetInData, Google and Truecaller participate in the Big Data Tech Warsaw Summit 2019.

It’s already less than two weeks to the 5th edition of Big Data Warsaw Tech Summit! How time flies! The initiating coverage of this event you may find here, but today we’ll walk you through the presentation that will be delivered by our colleagues from Truecaller and will also shed some light on the GetInData’s contribution to the Summit and its joint initiative with Google. Enjoy!

Presentations

Truecaller, GetInData’s long-lasting client, is a Swedish tech company that innovates how people call over phone. Truecaller has created one of the most popular caller ID and phone spam protection app. The app has over 100 million daily active users globally (growing!), runs a petabyte-scale infrastructure and contains several features powered by data & ML/AI such as spam detection. Dhanesh Padmanabhan, their Director of Engineering, will deliver a speech on Truecaller’s analytics stack and their data-driven use-cases deployed on the top of Kafka, Spark, Hadoop that support the company’s core activities.

Krzysztof Zarzycki and Marek Wiewiórka, who are our most experienced Big Data veterans at GetInData, will discuss the future-proof, cloud-ready and open-source big data discovery platform. As this may sound a little bit vague, I will briefly walk you through it. As GetInData works all the time on Big Data platforms with many clients, we are able to spot their bottlenecks and inefficiencies. This time my fellow colleagues will take a look at Data Lake. At most companies, Data Lake becomes a controlled environment with strict service level agreements and strong guarantees where there is no room for creativity and experiments that drives the unique, cutting-edge and innovative data science projects. Krzysztof and Marek will explain how this problem may be addressed by a modern open-source data discovery platform, built next to existing Data Lake — the Data Discovery platform. Consider it as a cloud-ready open-source-based extension to Data Lake, that gives freedom of processing power and libraries choice and seamless data access, without lock-in.

Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018. Krzysztof Zarzycki (GetInData) and Alexey Brodovshuk (Kcell) speaking about their joint project of building large-scale real-time streaming platform and its use-cases at telco at Big Data Tech Warsaw Summit 2018.

The panel and roundtable discussions

The Summit agenda assumes also a series of roundtable discussions and panels. Their aim is to engage all the participants and drive meaningful and inspiring debates on hot topics in Big Data world.

The main panel, entitled How current megatrends are changing the Big Data landscape and what it means to us, will be led by Adam Kawa from GetInData,

Two roundtable discussions will be moderated by our experts — they are “Stream processing engines — features, performance, comparison” and “Big Data — the cloud way”. Join these tables if you’d like to discuss these topics with other specialists and our experts.

One of the roundtable discussion at Big Data Tech Warsaw Summit 2017. One of the roundtable discussion at Big Data Tech Warsaw Summit 2017.

The workshops

Besides presentations and roundtable discussions, GetInData will also contribute to the workshop’s part of the Summit.

  • Mateusz Pytel, our Google Cloud certified professional data engineer with a few years of experience in using Google Cloud Platform will co-host Big Data on Google Cloud workshop along with Radosław Stankiewicz, Strategic Cloud Engineer at Google. This one-day training is an excellent opportunity to get some hands-on taste of such technologies like BigQuery, Dataflow, Beam, PubSub or Data Studio that are available on Google Cloud Platform.
  • Piotr Krewski and will shed some light on Hadoop ecosystem during Hadoop Ecosystem Basics workshop.
  • Tomasz Żukowski will discuss Spark system functionalities in From small data in Python to big data model in Apache Spark.

If you want to get some information, take a look at the workshop agenda here.

Call for Action

And if you don’t register for the Summit yet, remember, the deadline is looming. Hope to see you there!

conference
big data
cloud
DevOps
open source
19 February 2019

Want more? Check our articles

xobszar roboczy 5blog
Success Stories

From concept to production in 2 months: sales forecasting Machine Learning model for dema.ai

Sales forecasting is a critical aspect of any business, especially in the fast-paced and competitive world of e-commerce. Accurately predicting future…

Read more
getindata flink kafka audio spectrum analyzer smalltext
Use-cases/Project

Puzzles in the time of plague: truly over-engineered audio spectrum analyzer

Quarantaine project Staying at home is not my particular strong point. But tough times have arrived and everybody needs to change their habits and re…

Read more
big data blog getindata data enrichment flink sql http connector
Tutorial

Data Enrichment in Flink SQL using HTTP Connector For Flink - Part One

HTTP Connector For Flink SQL  In our projects at GetInData, we work a lot on scaling out our client's data engineering capabilities by enabling more…

Read more
bloghfobszar roboczy 1 4
Tutorial

Airbyte is in the air - data ingestion with Airbyte

One of our internal initiatives are GetInData Labs, the projects where we discover and work with different data tools. In the DataOps Labs, we’ve been…

Read more
getindata cover nifi ingestion kafka poc notext
Tutorial

NiFi Ingestion Blog Series. PART V - It’s fast and easy, what could possibly go wrong - one year history of certain nifi flow

Apache NiFi, a big data processing engine with graphical WebUI, was created to give non-programmers the ability to swiftly and codelessly create data…

Read more
lean big data 1
Tutorial

Lean Big Data - How to avoid wasting money with Big Data technologies and get some ROI

During my 6-year Hadoop adventure, I had an opportunity to work with Big Data technologies at several companies ranging from fast-growing startups (e…

Read more

Contact us

Interested in our solutions?
Contact us!

Together, we will select the best Big Data solutions for your organization and build a project that will have a real impact on your organization.


What did you find most impressive about GetInData?

They did a very good job in finding people that fitted in Acast both technically as well as culturally.
Type the form or send a e-mail: hello@getindata.com
The administrator of your personal data is GetInData Poland Sp. z o.o. with its registered seat in Warsaw (02-508), 39/20 Pulawska St. Your data is processed for the purpose of provision of electronic services in accordance with the Terms & Conditions. For more information on personal data processing and your rights please see Privacy Policy.

By submitting this form, you agree to our Terms & Conditions and Privacy Policy