Our contribution to Apache Flink 1.3.0

On June 1st, the Apache Flink community announced the 1.3.0 release that introduced a few very important features. It is also a special release for us at GeInData because we had the pleasure to be an important part of it. Our work focused on improving the Flink CEP library which got boosted amongst other cool features with this release.

Flink CEP

The list of features we contributed that we are most proud of are includes:

  • Quantifiers [FLINK-3318] – it allows you to specify pattern sequences consisting of flexible number of events matched into single parts. This feature is well known in the world of regex as such operators like: *, +, ? or count(#times)

Quantifiers

  • Support for different continuity requirements ([FLINK-6165], [FLINK-6208]) – this features enables combining simple patterns into larger sequence in many ways, which enables to model large variety of use cases

  • NOT pattern [FLINK-3320] – this quite important feature allows you to specify not only positive patterns that also ones that breaks sequence on occurrence of certain events.

Flink Core

Of course, as a responsible member of community, beside implementing new cool features we also tried and will still try to make Apache Flink as stable and reliable as possible. This resulted in number of other smaller patches such as FLINK-6137, FLINK-6609, FLINK-5781, FLINK-6290, FLINK-6080, FLINK-6000, FLINK-5824, FLINK-5864.

Kudos

At this point we would like to thank every member of the Flink community, especially Kostas Kloudas from dataArtisans who shepherds the Flink CEP library, for a very warm welcome and all the help we got. We would also like to congratulate the whole Apache Flink community for another great release! As GetInData we promise to be an active member of the community in the future.

SHARE THIS!
Post by Dawid Wysakowicz

Data Engineer at GetInData working to help people and companies succeed with Apache Flink. Has already started to participate in the Flink community with first patch contributions behind. First interested with Big Data technologies in 2015 while writing Master Thesis on Distributed Genomic Datawarehouse. Recently had helped to extract value from large datasets at mBank

Leave a Reply

Your email address will not be published. Required fields are marked *

Blue Captcha Image
Refresh

*