Job Description :
Core Responsibilities:
Build, maintain and support IT infrastructure (Open Stack and Cloud Foundry)
Experience deploying, maintaining and troubleshooting the Open Stack application in a Cloud environment
5+ years'' experience working in high transaction production environments following industry best practices for change, problem and incident management.
Experience supporting multiple applications and meeting SLAs for uptime and availability
Troubleshooting cross-platform transactions in a distributed multi-datacenter environment
Ability to troubleshoot transaction failures within an application environment and determine root cause (e.g. Network, database, SAN, application, connectivity, etc.
Drives issues through closure engaging all appropriate resources. Leads technical bridges and provides troubleshooting direction. Provides guidance and recommended solutions to complex technical issues
Acts as an advocate for Engineering Operations procedures, policies, and processes. Ensures projects are fully integrated into the operations environment including lifecycle problem management from front line CARE through Engineering
Creates data and metric systems to track operational workflows; maintains records of results and feedback. Analyzes data and metrics, identifies problem areas, and provides actionable insight to management
Provides input to Engineering and vendors on defects and required enhancements. Attains all relevant industry standard technical certifications
Performs complex and routine maintenance tests for designated areas of engineering. Identifies, isolates, and escalates issues to appropriate personnel. Ensures that all maintenance is properly validated to minimize subscriber impact to (ideally) zero
Contributes to design considerations for new products or architectural changes to existing products. Assists with or leads efforts to build new application infrastructure, coordinating efforts across teams
Analyses problems in design, configuration, data flow, and data state within a highly complex multi-product provisioning system

Technical Expertise:

Experience with scripting for application support using Puppet, Python, Shell, Perl or similar languages.
Working knowledge of several continuous integration/automation tools – Ansible, Chef, Jenkins and GOCD
Experience deploying applications on the servers using a configuration management tool such as puppet, chef, fabric, cobbler, etc.
Deep knowledge of OpenStack, Cloud Foundry or other Cloud computing and virtualization concepts.
Experience with system monitoring – Nagios, Wily, AppDynamics
             

Similar Jobs you may be interested in ..