Big Data Training on Big Data Technology Warsaw Summit 2018
On 20th and 21st of February just before 4th edition of Big Data Technology Summit, Big Data experts from our team hosted one-day workshops: Introduction to Big Data, Real-time Stream Processing and Large scale text mining with Spark.
100 participants in two days
We received plenty of registration requests for all courses and we had to split them in two groups, one on each day. In total for two days, we trained around 100 people coming from diverse companies across multiple industries.
The most popular and noteworthy choice was Introduction to Big Data. That was a great training for beginners. The participants got to know such Big Data technologies and tools as Hadoop, Hive, Spark, Sqoop, Kafka, HUE, and Jupyter.
If you would like to gain knowledge about the most useful Big Data tools, sign up for our Big Data Workshop – the next edition will take place on 22nd of May 2018 in Warsaw.
Growing interest in stream processing
Technologies used to process unbounded streams of data in real-time are getting a lot of traction. On this workshops, we focused on Apache Flink, the most promising and complete framework for stream processing. The attendees had also a chance to look through such technologies as Kafka and Spark Streaming.
Large scale text processing with Apache Spark
The goal of the course was to teach participants of how to apply data scientist’s methods to a huge amount of text data. The toolset that we leveraged included Apache Spark (MLlib), a bunch of python libraries and Jupyter notebook.
Feedback from attendees
We received a lot of good reviews after training:
“Great atmosphere, well-prepared lecturer, well-prepared infrastructure”
“The exercises were very well supported by the hosts”
“Well prepared, interesting exercise material. The level was good but with a challenge for me. I would like to come back to some of the exercises in the coming days”
“A good balance between the amount of theoretical data and exercises. It was OK”