Job Description :

Site Reliability Engineer
Work Location
Houston, TX – Onsite (with remote option till covid)
Duration
6+ Months
Must Have Skills / Addl. Info.
Linux, Bash command and scripting, Networking, Python 3  
Understands Docker
Job Description
Essential Responsibilities and Duties:
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
Maintain and improve services once they are live by measuring and monitoring availability, latency, resource usage and overall system health.
Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
 Gauges the effectiveness and efficiency of existing systems and infrastructure; implements strategies for improving.
Collaborates with network and security staff to ensure smooth, secure and reliable operation of application software and systems
 Develops, implements and documents best practice policies and procedures for new projects or initiatives
 Effectively uses the service management systems, ensuring that best practices and lessons learned are made available to wider technical community
 Engaged in incident response and blameless postmortems.
Maintains a broad knowledge of state-of-the-art computer technology, equipment, and systems; participates in professional development activities as appropriate
Support for SRE tooling such as: Rundeck, Pagerduty, Stackdriver, PAM access (cyber Ark), Operational Readiness (Internal process), DR/Incident Drills, Incident reports, Cost Dashboards, Billing exports, AgoraCore SLI Dashboards, certificates etc.
Standard incident response and postmortems.
 
Technology:
·        Linux, Bash command and scripting, Networking, Python 3  
·        Understands Docker
 
Previous Experience and Competencies:
Bachelor’s degree in IT related discipline
Strong computer literacy with aptitude and readiness for multidiscipline training
4 – 6 years seniority (Senior and Hands on)
 
Preferred Qualifications
Strong in Software Engineering.
Interest in designing, analyzing and troubleshooting large-scale distributed systems.
Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
Ability to debug and optimize code and automate routine tasks.
 Good to have: Azure IoT Edge, Azure Cloud.
 
Behavior:
Fosters and maintains excellent internal, client and third-party relationships
Possesses a high degree of initiative
Adaptable and willing to learn new technologies; keeps abreast of key developments in relevant technologies
Able to work under pressure
Excellent oral, written communication, and interpersonal skills
Practices effective listening techniques
Able to work independently or as part of a team
Effectively analyzes and solves problems with attention to the root cause

 
             

Similar Jobs you may be interested in ..