Job Description :

Job Title: Site Reliability Engineer II

Location: Remote anywhere in US

IV: Phone and Video


Looking for candidates with a background in networking and Istio

Top skills that are REQUIRED:
- Service Mesh exp (Istio)
-
Exp with cloud service provider GCP
- Kubernetes
- CI/CD exp (Gitlab)
- DevOps exp


Site Reliability Engineers (SREs) at Client are responsible for maintaining the availability and uptime of infrastructure. SREs use software engineering principles to solve operational challenges to create reliable infrastructure. We try to reduce the toil from our everyday work using as much automation as possible.
Responsibilities:
* Implement SRE principles and practices across organization to improve performance and efficiency
* Research and implement solutions to build an always-up, always-available, resilient services
* Integrate and automate existing manual solutions and processes
* Participate in an on-call rotation for availability incidents
* Plan for growth and capacity of the infrastructure
* Troubleshoot and support productions issues
* Participates on cross functional company project teams responsible for implementing technology.
* Investigates anomalies/outages and determines steps to reproduce, root cause, and solutions options.
* Monitors environment performance and provides all necessary reporting analysis.
* Attends relevant conference/seminars to remain current on new and upcoming technology.
Skills Requirements:
* Hands-on experience with cloud service providers(at least one of GCP, AWS)
* Hands-on experience with at least one configuration management software (Ansible/Chef/Puppet)
* Experience with setting up Logging (e.g. ELK) and Monitoring(e.g. Prometheus) solutions
* Working knowledge of containers and any one container orchestration platform(Kubernetes/Nomad/Mesos/Swarm)
* Understanding and experience in at least one CI/CD pipeline (Jenkins/Travis/CircleCI/Gitlab etc.)
* Good understanding of Unix/Linux operating systems and its internals
* Well-versed with Linux CLI
* Apart from shell scripting(sh/bash), proficient with one other programming language(Python/Ruby/Go/Perl)
* Working knowledge of any one distributed version control systems (git/bzr/hg)
* Ability to write good technical user document
* Exposure to managing Infrastructure as Code with tools like Terraform/CloudFormation or using Cloud Provider SDKs
Experience Requirement:
* At least 4-6 years of hands-on DevOps/SRE experience
* At least 1-2 years of experience developing code, either maintaining scripts or applications
* Bachelor's degree in Engineering or related technical or quantitative degree (e.g. mathematics

             

Similar Jobs you may be interested in ..