This Refcard presents Apache Hadoop, a software framework that enables distributed storage and processing of large datasets using simple high-level programming models. The card covers the most important concepts of Hadoop, describes its architecture, and explains how to start using it as well as write and execute various applications on Hadoop.
Piotr Krewski has extensive practical experience in writing applications running on Hadoop clusters as well as in maintaining, managing and expanding Hadoop clusters.
At Spotify, he was part of the team operating arguably the biggest Hadoop cluster in Europe.
He is a co-founder of GetInData where he currently works as architect and engineer helping companies with building scalable, distributed architectures for storing and processing big data. Piotr serves also as Hadoop Instructor delivering GetInData proprietary trainings for administrators, developers and analysts working with Big Data solutions.