Slides: Introduction to Apache Flume

Aug 05, 2014

Adam Kawa




We would like to share slides about Apache Flume that come from Hadoop Administrator Training delivered by GetInData.

Apache Flume is a distributed, reliable, and available service for collecting, aggregating, and moving large amounts of log data. By reading these slides, you learn about Apache Flume, its motivation, the most important features, an architecture of Flume, its reliability guarantees, Agent’s configuration, integration with the Apache Hadoop Ecosystem and more.

Post by Adam Kawa

Adam became a fan of Big Data after implementing his first Hadoop job in 2010. Since then he has been working with Hadoop at Spotify (where he had proudly operated one of the largest and fastest-growing Hadoop clusters in Europe for two years), Truecaller, Authorized Cloudera Training Partner and finally now at GetInData. He works with technologies like Hadoop, Hive, Spark, Flink, Kafka, HBase and more. He has helped a number of companies ranging from fast-growing startups to global corporations. Adam regularly blogs about Big Data and he also is a frequent speaker at major Big Data conferences and meetups. He is the co-founder of Stockholm HUG and the co-organizer of Warsaw HUG.

Leave a Reply

Your email address will not be published. Required fields are marked *

Blue Captcha Image