Camus, a MapReduce job that loads data from Kafka into HDFS, has a number of time-related configuration settings and assumptions. They control how many messages are consumed from Kafka in each Camus run and where the data is stored in HDFS. I summarize them in this blog post.
Subscribe our Newsletter
Subscribe to our newsletter to stay up to date!
GetInData s.c. with registered address Artura Grottgera 15/1, 00-785 Warsaw, Poland, company registration no. PL5252603485.
It doesn't matter if you need our service or you want to learn from us. Do not hesitate to contact or visit us directly!
Adam Kawa: (+48) 537 334 606
Piotr Krewski: (+48) 888 185 137
Krzysztof Zarzycki: (+41) 775 366 867
Klaudia Zdunczyk: (+48) 663 422 641
EMAIL or FIND US