Job Description :
Role: Site Reliability Engineer
Location: Irvine CA
This will be remote for 2020 and part of 2021.
Duration: 6 months. Likely extensions for long term
Interview process: Phone and webcam

Candidates MUST HAVE eCommerce application experience

this is a remote position during covid and will return to the office when safe to do so in 2021

As a Site Reliability Engineer (SRE), you will work on producing mission-critical platforms, tools, and processes that will ensure the highest levels of availability and reliability of all our applications and services. You will have plenty of opportunity to build tools, frameworks, and cloud platforms that will support our company’s growth over the next decade. If you are a self-starter and jump on new ideas to make the platform more stable, secure, and feature-rich, this is your opportunity.

What you’ll do:
Design, deploy and configure various customer facing infrastructures, application, and services
Design and manage highly resilient Cloud infrastructure and services that meet enterprise grade SLA standards
Resolve customer escalations and help prevent reiteration of those incidents by creating processes, procedures and automations
Monitor, diagnose, and resolve urgent production issues during period potentially off normal business hours
Create and deploy scalable monitoring systems for massively growing global infrastructure
Design, implement, and deploy various cloud and application management automation
Write, augment and maintain Ops documentations

REQUIRED SKILLS:
5 years of experience in DevOps or SRE role
Bachelor's degree in Computer Science, a related technical field involving computer systems engineering, or equivalent practical experience.
Problem solver with strong customer focus and ability to engage and influence challenging audiences
Comfort and experience with Ops environment growing at a rapid scale
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
Ability to debug, optimize code, and automate routine tasks
Hands on experience with managing Cloud infrastructure and using infrastructure as code tools like Puppet, Chef, Ansible or similar
Knowledge of Virtualization, Cloud Architecture and Services, Automated Deployments, API, Docker, and Kubernetes
Strong background in Linux/Unix system administration
Excellent scripting skills and experience (Bash & Python, Python preferred)
Experience maintaining and deploying systems and software in diverse environments
Strong understanding of web, security, and network protocols and technology including HTTP, SSL/TLS, DNS, Subnetting, NACLs, VPC, load balancer, reverse proxy, Firewalls, etc.
Rich DevOps skills across CI/CD, SCM, Static Code Analyzer, Builds and Releases, Continuous Integration Tools, and frameworks (e.g. GIT, Jenkins, etc
Ability to deliver results and work cross-functionally.
             

Similar Jobs you may be interested in ..