Location: Remote
Duration: 12+ months
Interview: 1-2 video rounds
Note - They are looking for a Senior Reliability/Systems Engineer that has good experience working with Azure, PaaS/LaaS, Cloud technologies and experience with Terraform, Ansible or Chef.
Client would prefer that this candidate have at least a few Cloud certifications such as: Azure Solutions Architect, Azure Developer or Azure Security Engineer.
Position Overview
Client Strategic Platforms Infrastructure Operations team is looking for a passionate Site Reliability Engineer with the ability to solve complex problems. As a Site Reliability Engineer, you will be part of a team helping Client transform, migrate and operate our existing consumer technology platform to a modernized, cloud-native platform hosted in a hybrid/multi-cloud environment.
Key Roles and Responsibilities
- Implementation and lifecycle management of cloud solutions which are secure, performant, scalable, resilient, monitored, auditable and cost optimized
- Migration of existing platforms and applications to Azure
- Automation of cloud-based infrastructure deployments and maintenance
- Manage and maintain tools for deployment, monitoring and operations.
- Focus on scalability, security and availability of all infrastructure and processes.
- Identifying and addressing infrastructure deficiencies, availability gaps, and performance bottlenecks
- Serve as T3 cloud infrastructure operations
- Collaborate with peer organizations, Cloud infrastructure platform team, product delivery teams, and support organizations on technical issues and provide guidance.
Required Qualifications
- Bachelor’s degree in Computer Science, Information Systems or related field.
- 8+ years of experience in working in Systems Engineering roles
- 5+ years of experience working core cloud technologies both PaaS & IaaS offerings
- 4+ years in depth experience with Azure core cloud technologies in a high traffic production setting
- 3+ years' experience in application migrations to cloud using native patterns
- Recognized cloud certification(s) such as Azure Solutions Architect, Azure Security Engineer, Azure Developer, AWS Solution Architect, AWS networking, AWS Security, or other recognized technology certifications in this space
Preferred Qualifications
- 4+ years of in-depth experience in core Azure cloud technologies such as: Azure DevOps, VMSS, Vnet, Azure Load balancer, Azure Application gateway, Azure Private Link, Cosmos DB, Azure Monitor/Application Insights, AKS, Azure Cache, Event Hub, Azure Functions
- 3+ years of experience building cloud automation/orchestration solutions with technologies such as: Terraform, CloudFormation, Ansible, Chef, Puppet, other
- 2+ years implementing highly available cloud/HybridCloud network solutions
- 2+ years of experience in build and CICD technologies: GitHub, Maven, Jenkins, Nexus, other
- Deep understanding of cloud security experience. Deep understanding of preventative and retrospective controls.
- Deep experience with performance tuning in a cloud environment
- Experience implementing and managing monitoring solutions for production cloud environments
- 1+ years Mulesoft architecture, development, administration experience
- Knowledge/Experience with core AWS technologies
- Experience with open-source cloud agnostic technologies such as Docker, Kubernetes, Openstack, Anisible, Terraform
- Experience with Prometheus / Grafana, Dynatrace, Nagios, Splunk, EFK, Azure Monitor, Application Insights monitoring tools
- Knowledge & demonstrated experience in Agile methodologies and practice
- Ability to adapt to a rapidly changing environment and technologies
- Ability to work in a highly collaborative environment.
Technologies
Azure DevOps, VMSS, Vnet, Azure Load balancer, Azure Application gateway, Azure Private Link, Cosmos DB, Azure Monitor/Application Insights, AKS, Azure Cache, Event Hub, Azure Functions AWS EC2, ALB/ELB, RDS, S3, LAMBDA, API Gateway, CloudFront, SNS, SQS, DynamoDB, Cloudwatch, ElastiCache, and EKS, Ansible, Terraform, shell scripting, Kubernetes, Docker, Linux Administration RHEL/Centos/Ubuntu, Kafka, Rabbit, Redis, Cassandra, MongoDB, NGINX, Openstack, GIT, Jenkins, Splunk, ELK, Dynatrace, New Relic, Grafana, Prometheus, Mulesoft