Job Description :
Overall 9+ years of work experience with 3-5 years of working experience in hadoop administration.
Responsible for setup, administration, monitoring, tuning, optimizing, governing Hadoop Cluster and Hadoop components
Design and implement new components and various emerging technologies in Hadoop Echo System and successful execution of various Poof-Of-Technology (PoT)
Design and implement high availability options for critical component like Kerberos, Ranger, Amabari, Resource Manager, MySQL repositories
Collaborate with various cross functional teams; infrastructure, network, database and application for various activities: deployment new hardware/software, environment, capacity, uplift, etc.
Work with various teams to setup new Hadoop users, security and platform governance
Create and execute capacity planning strategy process for the Hadoop platform
Work on cluster maintenance as well as creation and removal of nodes using tools like Ganglia, Nagios, Cloudera Manager Enterprise, Ambari, etc.
Performance tuning of Hadoop clusters and various Hadoop components and routines
Monitor job performances, file system/disk-space management, cluster and database connectivity, log files, management of backup/security and troubleshooting various user issues
Hadoop cluster performance monitoring and tuning, disk space management
Harden the cluster to support use cases and self services in 24x7 model and apply advanced troubleshooting techniques to critical, highly complex customer problems
Contribute to the evolving Hadoop architecture of our services to meet changing requirements for scaling, reliability, performance, manageability and price
Setup monitoring and alerts for the Hadoop cluster, creation of dashboards, alerts and weekly status report for uptime, usage, issues, etc
Design, implement, test and document performance benchmarking strategy for platform as well or each use case
Act as liaison between the Hadoop cluster administrators and the Hadoop application development team to identify and resolve issues impacting application availability, scalabilty, performance and data throughput
Automate deployment and management of Hadoop services including implementing monitoring