Job Description :
Position : Site Reliability Engineer (SRE)
Location : Atlanta GA
Duration : 6+ months

Description :
Proficient in production monitoring concepts and implementation including real user, application performance, system, log, time-series, and dash boarding. Includes tools like app dynamics, newrelic, splunk, grafana, ELK, etc
Proficient in production systems design including High Availability, Performance, Efficiency, and Security
Proficient in a modern scripting language like shell.
Proficient in a modern infrastructure automation toolkit such as Puppet or Chef or Ansible.
Proficient in a Linux or Unix based environment
Deep understanding of modern micro service based architectures and operations.
Experience in destructive testing methodologies and tools.
Experience in CI/CD automation.
Experience in a version control systems such as Git or SVN.
Experience in a cloud computing platform and the associated automation patterns it provides
Experience in defensive coding practices and patterns for high-availability
Systematic approach to solving problems.
Strong communication skills, ownership, and drive.
Ability to debug and optimize code and automate routine tasks.
In catastrophic situations available 24x7 to quickly respond and resolve critical service outages severely impacting consumers.
             

Similar Jobs you may be interested in ..