Introduction To YARN

Jun 06, 2014

Adam Kawa

hadoop, yarn



We would like to recommend to read “Introduction To YARN” written by our consultant for IBM developerWorks. Please find the abstract of the article below:

Apache Hadoop is currently one of the most popular tools for big data processing. It has been successfully deployed in production by many companies for several years. Though Hadoop is considered as a reliable, scalable, and cost-effective solution, it is constantly being improved by a large community of developers. As a result, the 2.0 version offers several revolutionary features including YARN, HDFS Federation, and a highly-available NameNode which make the Hadoop cluster much more efficient,powerful, and reliable. In this article, learn about the advantages YARN provides over the previous version of the distributed processing layer in Hadoop.

Read more at IBM developerWorks.

Post by Adam Kawa

Adam became a fan of Big Data after implementing his first Hadoop job in 2010. Since then he has been working with Hadoop at Spotify (where he had proudly operated one of the largest and fastest-growing Hadoop clusters in Europe for two years), Truecaller, Authorized Cloudera Training Partner and finally now at GetInData. He works with technologies like Hadoop, Hive, Spark, Flink, Kafka, HBase and more. He has helped a number of companies ranging from fast-growing startups to global corporations. Adam regularly blogs about Big Data and he also is a frequent speaker at major Big Data conferences and meetups. He is the co-founder of Stockholm HUG and the co-organizer of Warsaw HUG.

Leave a Reply

Your email address will not be published. Required fields are marked *

Blue Captcha Image