Job Description :
Position: DevOps/SRE

Location: Chicago, IL

Interview Process: Webex

Contract: 6+ Months



Job Description:


Good infrastructure knowledge with application support background.
Have the ability to independently troubleshoot complex application issues that involve application code, servers, network.
Good background in Java or .Net based applications
Strong scripting in Unix (script) /Windows (power shell), python.
Automation background in puppet, Ansible playbooks
Good knowledge of Monitoring tools (Apica, AppD, and Splunk)
Good knowledge of CI/CD tools,
Service improvement background with proven business value.



Organization within CTC provides access control governance and Identity Services for all lines of business (LOBs) globally, providing the right access to the right people at the right time for all technology platforms and applications supported by CTC, and provides a comprehensive set of applications, tools, and staff to globally implement, monitor and manage technology risk solutions.


As SRE/DevOps Lead you will be responsible for day to day operations of infrastructures and products including the reliability and performance in production and testing environments. Works with the development support teams to provide design guidelines and imparts knowledge on technical trends and solutions. Lead Technical Projects to install new applications and expansion of existing applications. Identify the hardware and software components the applications will use for implementation and provide instruction on how to appropriately implement application components. Instruments end to end monitoring of infrastructures to ensure SLO/SLI objectives are monitored with timely alerting of potential issues. Ensure infrastructures have adequate capacity and are refreshed on a periodic basis. Develops and maintains system documentation, run-books, and production metrics reporting.


Provide support, guidance, and training to both internal and external clients during the analysis, development and testing processes. Work effectively within the team to identify and resolve issues. Communicate effectively with both technical and non-technical individuals at all levels. Provide expert knowledge of the Identity Management and Security Tools architecture and serve as Subject Matter Expert to IT Risk Management teams on those topics.


Responsibilities:


Develop and maintain Service Level Objectives, Service Level Indicator and Error budgets.
Strong communication and presentation skills with management, lateral and staff.
Training the team on DevOps/SRE skill set.
Develop software to automate the manual operational tasks.
Run, maintain and improve the service against established Service Level Objectives by applying software engineering principles
Responsible for the availability, performance, change management, monitoring, and capacity management of their services
Engage in with the development team throughout the life cycle to help build for reliability
Troubleshoot priority incidents, conduct blameless post-mortems and ensure permanent closure of the incidents
Analyze patterns of production incidents, develop permanent remediation plans, and implement automation to prevent future incidents from occurring through software engineering.
Facilitates maximum speed of delivery by objectively binding to error budgets of the service
Manage the efforts to split between manual operational work and engineering work
Part of the 24x7x365 support coverage
             

Similar Jobs you may be interested in ..