Job Description :
Responsibilities
Lead the design for acquiring state-of-the-art monitoring capabilities for critical Digital customer experiences across our Retail, PBM, Specialty business units.
Partner closely with product teams, customer engagement teams, application technical teams, business analysts to identify core business/technical metrics indicative of KPIs and system health.
Elaborate metrics requirements for acquiring the necessary monitoring capabilities
Partner closely with application technical teams in technical design for ensuring data logging and availability for monitoring and alerting
Develop & Build new monitoring dashboards and alerts through various tools such as Splunk, Analytics, etc.
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
Lead the development of the processes and software necessary to maintain services post-deployment through data collection and monitoring ensuring overall health of the services provided.
Monitor and continuously improve the availability and performance of infrastructure, systems and applications
Capture and analyze data on Systems Availability, MTBF, and MTTR across all Digital channels; identify patterns and drive changes to both systems and processes to provide sustained improvements.
Document automation and the interaction of software and system as necessary to enable in others; ensure that other members of the team meet the same high standard of documentation.

Required Qualifications

5+ years experience with software development and/or systems engineering
5+ years experience troubleshooting software.
Strong analytical and troubleshooting skills
Strong collaboration skills and ability to communicate all aspects of the requirements, including the creation of formal documentation
Experience with large customer facing properties, preferably e-commerce, health care.
Usage and configuration of monitoring tools and dashboards such as Splunk, Logstash, Graphite, and StatsD
Understanding of monitoring tools landscape such as real user monitoring, synthetic monitoring, performance monitoring
Be able to write solid code in one or more of the following: Java, Javacript, C#, Python, Perl or Ruby.
Hands on experience with web servers and middleware such as ATG, JBoss, Weblogic, Websphere, Spring, NodeJS, Apache, and NGINX.
Good knowledge of database technologies, writing queries using SQL, and basic performance tuning
             

Similar Jobs you may be interested in ..