Tesla Motors, Inc.
AI Infrastructure Engineer, Model Optimization & Deployment, Optimus
Tesla Motors, Inc., Palo Alto, California, United States, 94306
What to Expect
Tesla AIissolvingrobust, real-world AI through humanoid robots.As a Software Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture, visualize data,assistwith exporting and deploying neural networks toTesla'sneural network chip with real-time latency constraints on Optimus, and evaluate experimental results. You will help us automate the entire workflows of training, validation, and production ofOptimus. Most importantly, you will see your work repeatedly shipped to andutilizedby thousands of Humanoid Robots in real world applications.
What You'll Do
Optimize ML models for latency, memory usage, and inference speed
Quantize, prune, and convert models (e.g., to ONNX, TensorRT, TFLite) for deployment on various platforms (cloud, edge, mobile)
Benchmark and profile model performance across different environments
Package and deploy models as REST APIs, batch jobs, or streaming services using tools like FastAPI, Flask, or gRPC
Implement CI/CD pipelines for automated testing and deployment of ML models
Ensure scalability and reliability of ML services in production environments
What You'll Bring
Strong proficiency in Python and PyTorch
Experience with model optimization tools (e.g., ONNX, TensorRT, TFLite, TVM)
Experience with model inference optimization and quantization
Solid understanding of containerization and orchestration (Docker, Kubernetes)
Familiarity with cloud platforms (AWS, GCP, Azure) and serverless deployments
Strong grasp of software engineering principles and CI/CD pipelines
Experience deploying models to edge devices or mobile platforms
Knowledge of data serialization formats (e.g., protobuf, Avro)
Exposure to observability tools (e.g., Prometheus, Grafana) for ML monitoring
Compensation and Benefits Benefits
Aetna PPO and HSA plans > 2 medical plan options with $0 payroll deduction
Family-building, fertility, adoption and surrogacy benefits
Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution
Company Paid (Health Savings Account) HSA Contribution when enrolled in the High Deductible Aetna medical plan with HSA
Healthcare and Dependent Care Flexible Spending Accounts (FSA)
401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
Company paid Basic Life, AD&D, short-term and long-term disability insurance
Employee Assistance Program
Sick and Vacation time (Flex time for salary positions), and Paid Holidays
Back-up childcare and parenting support resources
Voluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
Weight Loss and Tobacco Cessation Programs
Tesla Babies program
Commuter benefits
Employee discounts and perks program
Expected Compensation $140,000 - $420,000/annual salary + cash and stock awards + benefits
Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.
#J-18808-Ljbffr
What You'll Do
Optimize ML models for latency, memory usage, and inference speed
Quantize, prune, and convert models (e.g., to ONNX, TensorRT, TFLite) for deployment on various platforms (cloud, edge, mobile)
Benchmark and profile model performance across different environments
Package and deploy models as REST APIs, batch jobs, or streaming services using tools like FastAPI, Flask, or gRPC
Implement CI/CD pipelines for automated testing and deployment of ML models
Ensure scalability and reliability of ML services in production environments
What You'll Bring
Strong proficiency in Python and PyTorch
Experience with model optimization tools (e.g., ONNX, TensorRT, TFLite, TVM)
Experience with model inference optimization and quantization
Solid understanding of containerization and orchestration (Docker, Kubernetes)
Familiarity with cloud platforms (AWS, GCP, Azure) and serverless deployments
Strong grasp of software engineering principles and CI/CD pipelines
Experience deploying models to edge devices or mobile platforms
Knowledge of data serialization formats (e.g., protobuf, Avro)
Exposure to observability tools (e.g., Prometheus, Grafana) for ML monitoring
Compensation and Benefits Benefits
Aetna PPO and HSA plans > 2 medical plan options with $0 payroll deduction
Family-building, fertility, adoption and surrogacy benefits
Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution
Company Paid (Health Savings Account) HSA Contribution when enrolled in the High Deductible Aetna medical plan with HSA
Healthcare and Dependent Care Flexible Spending Accounts (FSA)
401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
Company paid Basic Life, AD&D, short-term and long-term disability insurance
Employee Assistance Program
Sick and Vacation time (Flex time for salary positions), and Paid Holidays
Back-up childcare and parenting support resources
Voluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
Weight Loss and Tobacco Cessation Programs
Tesla Babies program
Commuter benefits
Employee discounts and perks program
Expected Compensation $140,000 - $420,000/annual salary + cash and stock awards + benefits
Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.
#J-18808-Ljbffr