Software Engineer, AI Hardware Infrastructure Job at Tesla in Palo Alto
Tesla, Palo Alto, California, United States
Software Engineer, AI Hardware Infrastructure
Get AI-powered advice on this job and more exclusive features.
What To Expect
As a member of the AIHW Infra team, you will play a critical role in supporting Tesla's AI hardware initiatives by developing automation, infrastructure, and services. Join a dynamic team of engineers dedicated to accelerating workloads through collaboration with AI HW design teams and High-Performance Computing (HPC) groups. Your primary focus will be building robust infrastructure solutions, while also assisting in debugging performance bottlenecks and root-causing cluster issues as needed. The ideal candidate is a proactive engineer with a passion for creating scalable, efficient systems.
What You'll Do
- Develop Python libraries to automate, monitor, measure, and troubleshoot workflows on AI hardware infrastructure
- Spending approximately 50% of time building automation and infrastructure, primarily in Python and other languages, with the remaining time focused on debugging, experimentation, and resolving infrastructure challenges and performance bottlenecks
- Creating and maintaining tools for infrastructure, automation, observability, and reporting to ensure system reliability and performance
- Collaborating with AI HW design and HPC teams to identify, debug, and resolve performance bottlenecks and cluster issues
- Supporting internal users by triaging errors, root-causing issues, and providing effective, maintainable solutions
- Proactively addressing potential infrastructure challenges to minimize user impact and enhance scalability and efficiency of AI hardware workloads
What You'll Bring
- Degree in Engineering, Computer Science, or equivalent experience with evidence of exceptional ability and practical software engineering expertise
- Strong proficiency in Python and adaptability to learn new languages and frameworks
- Extensive familiarity with Linux administration and internals
- Experience or strong interest in automation, observability, and infrastructure development and deployment
- Ability to collaborate effectively with cross-functional teams to debug and optimize complex systems
Benefits
- Medical plans and plan options with $0 payroll deduction
- Family-building, fertility, adoption and surrogacy benefits
- Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution
- Company Paid HSA Contribution when enrolled in the High-Deductible medical plan with HSA
- Healthcare and Dependent Care Flexible Spending Accounts (FSA)
- 401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
- Company paid Basic Life, AD&D
- Short-term and long-term disability insurance (90 day waiting period)
- Employee Assistance Program
- Sick and Vacation time (Flex time for salary positions, Accrued hours for Hourly positions), and Paid Holidays
- Back-up childcare and parenting support resources
- Voluntary benefits: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
- Weight Loss and Tobacco Cessation Programs
- Tesla Babies program
- Commuter benefits
- Employee discounts and perks program
Expected Compensation
$132,000 - $300,000/annual salary + cash and stock awards + benefits.
Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.
Seniority level
- Entry level
Employment type
- Full-time
Job function
- Engineering and Information Technology
- Industries: Motor Vehicle Manufacturing, Renewable Energy Semiconductor Manufacturing, and Utilities