Job Description :
Position Purpose:
Provides operational support for the production High Performance Computing (HPC) environment in the Advanced Computing Technologies (ACT) department.
Drawing upon the operating plan, design specifications and technical oversight, leverage enabling technologies to meet the desired goals, objectives and strategies of the Computational Fluid Dynamics, Simulation, and modeling engineering business areas.
Responsible for the optimum integration of scientific applications to high performance computing technology.

Principle Duties and Responsibilities:
Essential Functions:
Assists with the day-to-day operations of production HPC clusters.
Troubleshoots and maintains the Infiniband network.
Assists end users running applications on the HPC cluster(s
Manage, maintain, monitor, and control interactive and batch processes, both scheduled and unscheduled (including on-request processing
Complete engineering-defined batch processing and backups in the correct sequence and within the established time periods.
Perform proactive failure trend analysis and root cause analysis for all system failures.
Produce trend reports to highlight production issues and follow predetermined action and escalation procedures when issues are encountered.
Monitor, verify, and suggest appropriate adjustments to support proper application executions.
Provide technical solutions that meet the performance and processing objectives of the business areas.
Follow upgrade plans to ensure compliance with corporate policies and industry best practices.
Provide support during data center upgrades and outages.
Assist with performance tuning and benchmarking activities.