Must have-
•
8+ years of professional ML engineering experience at an AI/ML-focused organization.
• Familiarity with the state-of-the-art in behavior learning, language, and/or computer vision.
• Experience training large-scale foundation models (VLMs, text-to-video models, etc) utilizing distributed training and high-performance optimization techniques such as quantization, mixed precision, model parallelism, data parallelism or FSDP.
• Extensive practical experience with PyTorch.
• Strong proficiency in Python and software development best practices such as unit testing, documentation, code review, continuous integration, and dependency management.
• Familiarity with data pipelines, model serving and optimization, cloud training, and dataset management.
• An ability to move fast and switch between modes of rapid prototyping and robust implementation as required.
• Experience deploying models on embodied systems/robots.
• Experience working in mixed teams of research scientists and engineers.
• Experience Amazon EC2, S3, and/or Sagemaker.
• Experience with Bazel.