Job Description :
Job Description:, ALL CAPS, NO SPACES B/T UNDERSCORES, , Bill Rate: $72, , PTN_US_GBAMSREQID_CandidateBeelineID, i.e. PTN_US_9999999_SKIPJOHNSON0413, , MSP Owner: Bader Almubarak, Location: Austin, Duration: 3 Months, GBaMS ReqID: 10187054, , , Job Title: Site Reliability Engineer (SRE), ________________________________________, , Role Summary:, Support live services by ensuring high availability of infrastructure, primary services, and studio services., Enable rapid game development through on-demand infrastructure services and cloud-based workflows., Engage across the full product lifecycle from architecture and delivery to production deployment and incident response., Manage both on-premises and cloud resources with a strong grasp of cloud infrastructure fundamentals., ________________________________________, , Key Responsibilities:, Design and architect distributed systems in the cloud; assist in migrating systems from on-prem data centers to the cloud., Develop monitoring, alerting, and dashboarding solutions to enhance visibility into application performance and business metrics., Troubleshoot and maintain large-scale distributed production systems across on-prem and cloud environments., Conduct root cause analysis and post-mortems to prevent future incidents., Leverage automation to reduce toil, improve detection and resolution times (MTTD & MTTR), and repair services., Design and implement CI/CD pipelines., Create documentation and support tooling for online support teams., ________________________________________, , Qualifications & Skills:, Experience monitoring infrastructure and application availability to meet SLI and SLO targets., Proficiency in virtualization, containerization, and cloud computing (AWS preferred)., Familiarity with VMWare ecosystems, Kubernetes, and Docker., Knowledge of tools and technologies such as:, ElasticSearch, Prometheus, Graphite, Kafka, Strong systems administration skills, particularly in *nix environments., Solid understanding of networking protocols and components., Experience with automation and orchestration tools like:, Chef, Puppet, Terraform, Packer, Jenkins, Programming experience in Python, Golang, and/or Java., Background in working with distributed systems., Comments for Suppliers:, Rate Details