Job Description :
Job Title: Infrastructure Engineer with 2-3 years experience
Location:  Chicago, IL
Duration: 6 months

Position Overview:
Participate in a 24x7/365 NOC team within a SaaS business responsible for monitoring, troubleshooting, remediation operations, and escalation of compute room environment, applications, networking, systems, messaging, developmental LAB, and telecommunications resolving incidents and outage events utilizing documented processes and run books.
Drive incident management during outages ensuring resolution, escalation, and communication processes are followed. 

Position Goals:
1st tier monitoring of all hosted environments including applications, networking, systems, messaging, developmental LAB, telecommunications incidents and outage events.
Decrease the signal-to-noise ratio of pages and alerts within the Global Hosting Services IT organization through proactive measures.
Triage outage events: identify the issue and its user/customer impact, communicate clearly and effectively and engage any required teams to resolve the issue as efficiently as possible.
Develop Global Hosting IT infrastructure knowledge base documentation and run books.
Experience with various monitoring applications and other data sources including Scom, AppDynamics, Nagios and Zenoss.
Work with other IT teams to coordinate change control of scheduled and unscheduled maintenance activities and outages.
Maintain event process/run books for accuracy. 
Provide 24x7/365 rotational support of production systems, applications, and services.

Principal Duties & Responsibilities:
Outage Communication
Create and manage outage events based upon service level monitors 
Drive IT communications and focus during outages 
Serve as the point-person for escalated outages. Responsible for managing communication, timelines, information gathering, etc.
1st tier Production Monitoring and Alerting 
Provide clear and concise reporting to the internal stakeholders regarding production uptime and availability and outage resolution.
Creation and maintenance of Run-book/Knowledgebase 
Tier 1 support of the production applications and infrastructure 
Release/data fix support as needed by the business 
Ownership of the execution of business related jobs and tasks.
Establish processes for introducing new jobs and monitoring in to the production environment
             

Similar Jobs you may be interested in ..