Job Description :
DevOps Engineer III
Observability/SRE
Location: Hybrid - Denver, CO. Preference for local candidates; must be on-site approximately 4 days per week.
KEY RESPONSIBILITIES
  • Platform Stability: Maintain and troubleshoot a Kubernetes-based microservices architecture across 41+ data centers.
  • SRE & Operations: Lead change management, software rollouts, and platform upgrades while participating in an on-call rotation.
  • Observability: Investigate recording and playback failures using tools like Elastic, Prometheus, and Grafana.
  • Network Integration: Identify and resolve connectivity issues between the Cloud DVR platform and the Content Delivery Network (CDN).
  • Automation: Develop scripts to automate investigations and routine maintenance tasks.
EXPERIENCE: 6 8 years of professional experience in DevOps or SRE roles.
TOP 3 CORE REQUIREMENTS
  • Database Management: Proficiency in managing large-scale metadata databases.
  • Large-Scale Storage: Hands-on experience with object storage or distributed storage platforms.
  • Linux & Analytics: Strong Linux systems administration, bash/python scripting, and monitoring/alerting.
TECHNICAL ENVIRONMENT
  • Orchestration: Kubernetes, Docker.
  • Languages: Python, Bash, Go-lang.
  • Data/Storage: MySQL, SingleStore (MemSQL), IBM Cleversafe (Object Storage).
  • Monitoring: Grafana, Prometheus, Elasticsearch.
  • Cloud/DevOps: Azure DevOps, GitHub, Ansible.
             

Similar Jobs you may be interested in ..