Archive for the
‘Blog’ Category

We share our knowledge happily

On June 1st, the Apache Flink community announced the 1.3.0 release that introduced a few very important features. It is also a special release for us at GeInData because we had the pleasure to be an important part of it. Our work focused on improving the Flink CEP library which got boosted amongst other cool […]

In the first part of this blog post we described a number of challenges that need to be addressed when implementing data pipelines with technologies like Pig, Scalding, Spark, Spark Streaming or Storm. For instance, if we want to sessionize our events correctly with these technologies, we have to implement complex custom code to deal […]

While a lot of problems can be solved in batch, the stream-processing approach can give you even more benefits. In this blog post series we’ll discuss a real-world example of user session analytics to give you a use-case driven overview of business and technical problems that modern stream processing technologies like Apache Flink help you [...]
Big Data Tech Warsaw 2017 was successfully carried and already over now! We are pleased to say that the conference brought together over 55 speakers from 16 different countries and over 350 participants interested in practical aspects of Big Data technologies. It was definitely greatest technical Big Data event in Poland and at GetInData we [...]

Dec 17, 2016

Adam Kawa



As the Big Data Tech Warsaw 2017 conference is getting closer, we’d like to highlight the most interesting topics that will be covered during this exciting event. This year the event will contain +25 technical talks given in four parallel tracks.

Nov 08, 2016

Adam Kawa



Schema evolution of a Hive table backed by Avro file format allows you to modify the table schema in several “schema-compatible” ways without the need of rewriting all existing data. Thanks to that, your HiveQL queries can read old and new Avro files uniformly using the current table schema. In this blog post I briefly […]

In this short post I will focus on user management aspects of HUE. Something that every administrator needs to tackle. Intro When it comes to production setups HUE provides ways to integrate existing user base (e.g LDAP) with the service itself. That pretty much solves the problem for production. The situation looks a bit different […]

Loading posts...
Sort Gallery
Enter your email here