Job Description :

Job Description

This position is On-Site 100%

GC Holder/US Citizen ONLY

Location: Rio Rancho, NM

The Intel Extreme Storage Architecture and Development division is seeking an experienced Linux System Administrator to work onsite in Rio Rancho, NM to support a software development infrastructure consisting of physical servers, VMs, Omni-Path and Infiniband networks, and applications used for CI.

Responsibilities
Install, configure, and support a wide variety of hardware such as servers, switches, and disk storage devices. Involves racking, running cable, configuring server BIOS, etc. in the data center.
Install, configure and support Omni-Path and Infiniband high speed networks (switches, cabling, Fabric Manager).
Perform BIOS and firmware upgrades.
Troubleshoot and resolve issues with bare metal provisioning via PXE and disk imaging.
Troubleshoot and resolve issues with KVM VMs, disk imaging, networking, etc.
Develop and test Ansible code to install, configure and upgrade applications and services.
Write bash and python scripts for ad-hoc automation and monitoring.
Work closely with the CI team to quickly troubleshoot and resolve infrastructure issues.
Create and maintain infrastructure documentation, how-to articles.

Required Skills
Strong Linux System Administration experience. Preferably with RedHat based distros.
Experience with TCP IPv4 networking
Experience with configuring and troubleshooting SSH, DNS, DHCP, NFS
Experience with Git, GitHub and/or GitLab
Experience with Ansible
Experience with Bash and Python scripting
Must be capable of lifting 1U and 2U rack mounted servers up to 35 pounds
Ability to work independently and as part of a distributed team.

Desired Skills
Experience working in very large data centers
Experience supporting multiple Linux distros - Fedora, CentOS 8.x, openSUSE 15.x, and Ubuntu 20.04
Experience with installing and configuring Omni-Path and Infiniband networks
Experience with KVM/QEMU, libvirt
Experience with High Performance Computing (HPC) tools - Slurm, Powerman, Conman, ClusterShell, pdsh, Open MPI
Experience with bare metal provisioning via PXE with Cobbler, or via cloud-init
Experience with remote consoles and power management via IMPI
Experience with centralized user authentication with FreeIPA, sssd, autofs
Experience testing Ansible roles with Molecule
Experience with Ansible AWX
Experience installing and configuring Jenkins
Experience with Zabbix
Experience with Nexus and Artifactory
Experience with HTTP load balancing using Corosync, Pacemaker, HAProxy
Experience using JIRA for tracking work requests and Confluence for documentation

Minimum educational requirement: BS degree in CS or closely related field.