Job Description :
Role :Site Reliability Engineer Location:Tempe, AZ Duration: 12 months Have managed production infrastructure sites for front and back-end services Good knowledge of Linux internals and administration Experience in systems software development (java go, python, bash, . ) Deep knowledge of infrastructure as code principles, knowledge of Terraform is a must to have. Deep experience with AWS (Cloud Computing: Ec2, S3, RDS, VPC, Security Groups, ELB, ElastiCache, Beanstalk, Redshift, knowledge with SQL and noSQL database administration Able to define actionable monitoring and alerting for systems On-call experience dealing with production incident management and resolution Cloud Expert: Well versed in AWS services for monitoring, logging, metrics, high availability, and automation Strong organizational skills with extremely high level of attention to detail Highly motivated, quality conscious self-starter that requires little to no supervision, able to own tasks from start to finish Customer focused - Investigates and resolves customer issues and inquiries (i.e., emergency and non-emergency) Identify, receive, triage and act upon events and incidents coming from various SaaS services Consistently meets or exceeds established Command Center key performance indicators (KPI's) Work per escalation, notification and incident practices Monitor the availability or the CI/CD environments Working under pressure in production environments running production customer workloads and services 5+ Year of AWS Cloud Experience and 8+ years in IT Excellent written and verbal communication skills Experience implementing fault detection, and automating fixes Experience designing scalable services Experience designing distributed, fault-tolerant systems Experience with Micro-Services Architecture Experience managing services in AWS At least one AWS certification (Associates level) Thanks& Regards Supriya Pathak Direct EXT-100 Phone E-mail: Web:
             

Similar Jobs you may be interested in ..