Job Description :

Lead Site Reliability Engineer 100% Remote
Must-Haves:

  • Previous SRE experience in enterprise or large-scale environments
  • Proficient in .NET (C#) for debugging, application performance monitoring, and infrastructure automation
  • Microsoft Azure (App Services, Azure DevOps, Azure Monitor, Key Vault, etc.)
  • Infrastructure as Code: ARM templates / Bicep / Terraform
  • CI/CD pipelines using Azure DevOps or GitHub Actions
  • Monitoring & Alerting: App Insights, Azure Monitor, Prometheus, Grafana
  • Strong understanding of SLA/SLO/SLI principles
  • Incident management, root cause analysis, and postmortems
  • Scripting in PowerShell / Bash / Python
  • Configuration management with Ansible / Chef / Puppet
  • Experience working with Containers (Docker, AKS)
             

Similar Jobs you may be interested in ..