Argentina

Lead HPC Network Engineer (General Rodríguez)

Lead HPC Network Engineer (General Rodríguez)
Descripción
We are looking for a Lead HPC Network Engineer to drive the strategy, architecture, and engineering excellence behind advanced AI, research, and Kubernetes-based GPU infrastructure for a major general technology client.

The role focuses on defining the technical vision, leading architecture decisions, and setting engineering standards for high-performance network fabrics supporting large-scale LLM and distributed AI workloads, including InfiniBand/RDMA, high-speed Ethernet, Kubernetes networking, host‑side GPU networking, SmartNIC/DPU technologies, and deep network observability. As a technical leader, you will mentor senior engineers, influence client roadmaps, and own end‑to‑end delivery of mission‑critical network platforms.

The ideal candidate combines deep expertise across InfiniBand NDR/HDR and next‑generation fabrics, RDMA/RoCE, NVIDIA/Mellanox networking, NCCL/MSCCL communication patterns, Linux host networking, PCIe/GPU/NIC topology, and Kubernetes networking for GPU clusters, with a proven track record of leading engineering teams and shaping large‑scale HPC/AI network platforms.

Responsibilities
- Own the architectural vision and long‑term roadmap for high‑performance InfiniBand/RDMA and Ethernet fabrics supporting large‑scale GPU clusters and distributed AI/LLM workloads
- Lead the design, evaluation, and selection of cluster network topologies, including Fat‑tree, Clos, Rail‑optimized, and Dragonfly, and define decision frameworks aligned with workload scale, performance, and cost constraints
- Establish engineering standards and best practices for host‑side networking, including NIC configuration, drivers, firmware, IRQ affinity, NUMA placement, PCIe topology, and GPU‑to‑NIC communication paths
- Drive performance engineering initiatives for RDMA/RoCE, NCCL/MSCCL, and collective communication across multi‑node GPU training workloads, and lead complex root‑cause investigations
- Define the reference architecture for Kubernetes networking on GPU cluste Postúlate en Kit Empleo: kitempleo.com.ar/empleo/pm3xc
Información clave
Consejos de seguridad
Desconfía de las ofertas de trabajo que establecen explícitamente “sin necesidad de experiencia”.
1 / 10
Más info sobre el aviso

El aviso Lead HPC Network Engineer (General Rodríguez) fue publicado en la categoría General Rodríguez Informática, telecomunicación de Locanto.

Ahora mismo, no tenemos más avisos en esta categoría en General Rodríguez.

¿Buscás algo más? Podés aumentar tu radio de búsqueda y mirar los resultados en otras ubicaciones en tu región, como Informática, telecomunicación en Moreno, Luján o La Reja. Además, en esta sección, disponemos de más avisos clasificados en un radio de 15 km. Hacé clic aquí para verlos.