We are looking for an experienced SRE / DevOps Engineer with strong troubleshooting expertise and hands-on experience in CI/CD, Kubernetes, and the ELK Stack. The selected candidate will play a key role in designing, implementing, and improving DevOps processes, automation pipelines, and monitoring systems that support efficient and scalable software delivery.
-
Lead the development and optimization of DevOps practices across the organization.
-
Build, maintain, and enhance CI/CD pipelines using Jenkins and Groovy.
-
Manage and support Kubernetes clusters and related infrastructure.
-
Implement, manage, and optimize ELK Stack (Elasticsearch, Logstash, Kibana) for monitoring and logging.
-
Establish and maintain continuous build, integration, and deployment environments.
-
Review, verify, and troubleshoot software code to improve reliability and performance.
-
Set up tools, systems, and infrastructure for development, testing, and deployment.
-
Monitor product and operational processes across the full lifecycle and improve reliability.
-
Build and encourage automation to enhance delivery efficiency.
-
Perform vulnerability assessments and implement security and risk controls.
-
Drive incident management and conduct root cause analysis.
-
Select, integrate, and deploy DevOps and CI/CD tools.
-
Lead continuous improvement initiatives for deployment and release operations.
-
Mentor and guide junior DevOps and engineering team members.
-
Communicate and coordinate with internal teams and external stakeholders.
-
Track KPIs and customer experience metrics to support decision-making.
-
Provide periodic reporting to leadership and customers on progress and performance.
-
12+ years of overall IT experience with a strong background in DevOps or SRE.
-
Hands-on experience in Jenkins and Groovy scripting for CI/CD pipelines.
-
Strong troubleshooting and problem-resolution capabilities.
-
Proven experience managing Kubernetes platforms.
-
Practical experience with Elasticsearch, Logstash, and Kibana.
-
Experience with infrastructure automation and monitoring tools.
-
Strong knowledge of secure DevOps practices and vulnerability management.
-
Demonstrated ability to perform root cause analysis in incident situations.
-
Excellent communication, collaboration, and team mentoring skills.