Job Description :

Hi

Hope you are doing well !!

I have an urgent position. Kindly go through the Job description and let me know if this would be of interest to you.

Title : Principal Network Engineer (Hybrid)

Duration : 6+ Months

Location : Santa Clara, CA

About the job

  • We are seeking a highly skilled Principal Network Engineer to join our dynamic team to build the next generation of IT AI Clusters and help lead the team through a major technology transformation into running AI on-prem and build infrastructure by integrating Enterprise ready platforms while building a solid foundation with automation. We are looking for a passionate engineer who will solve networking problems for scalable AI clusters.
  • This is a hands-on network engineering position focused on the architecture, design, development and deployment of ultra-high-speed, resilient, and scalable DC AI Clusters and Interconnects for GPU-accelerated data centers and compute clusters. Outstanding problem-solving abilities and a comprehensive understanding of the network security protocols & standards, routing, switching, automation and deep understanding of fundamental network theory is also critical to your success at NVIDIA.

What You Will Be Doing

    • Lead the architecture, design, and deployment of global-scale DCs inter-connects and fabric for HPC, AI, and GPU computing clusters.
    • Develop high-performance data center fabric using InfiniBand, Ultra Ethernet and related technologies.
    • Optimize carrier interconnects, intra and inter DC routing, and dark fiber deployments to ensure low latency and high reliability.
    • Partner with system, OS, GPU, and HPC teams to deliver scalable, highly available networks for extreme-performance workloads.
    • Implement network monitoring, telemetry, solving, and continuous performance improvement processes.
    • Drive technology selection, vendor engagement, and lifecycle management for Data Center hardware and software.
    • Collaborate with internal product managers develop NVIDIA on NVIDIA solutions

What We Need To See

  • MS or PhD in Electrical Engineering, Computer Science, Computer Engineering, Artificial Intelligence, Data Science, Mathematics, Statistics, or equivalent experience.
  • 12+ years of experience in building, managing and supporting large scale hybrid networks, developing automation pipelines with Python, Ruby, Go or other languages used in infrastructure automation.
  • Expert in networking technologies: InfiniBand, Ultra Ethernet, ROCEv2, DCQCN, TCP/UDP, IPv4/IPv6, BGP/MP-BGP, VPN, L2 switching, EVPN, VxLAN, Segment Routing, MPLS.
  • Experience automating network infrastructure
  • Experience using an automated configuration management system (Python,Terraform, Chef, Puppet, Ansible, Salt, etc.)

If you are interested, please share your updated resume and suggest the best number & time to connect with you

Himanshu Gupta
US IT RECRUITER, DMS VISIONS INC

Ext-104 |

LinkedIn:

4645 Avon Lane, Suite 210, Frisco, TX 75033

             

Similar Jobs you may be interested in ..