Job Description :
In-depth knowledge on the various components and technologies under? Cloudera? distribution - HDFS, Hive, Impala, Spark, YARN, Sentry, Oozie, etc – should be strong with configurations and DevOps skills - Contribute to the Design by understanding the architectural trade-offs, partner with developers & vendors in building the best practices, help developers with the optimal configurations - Very good knowledge on analyzing and fixing the bottlenecks on the cluster - performance tuning, effective resource usage, capacity planning, investigating/implementing emerging tech advancements - Perform daily performance monitoring of the cluster - Implement best practices, ensure cluster stability and create/analyze performance metrics - Good understanding of Linux and system fundamentals, design and implement custom scripts in Bash, Ksh, Python, etc.. for automation