Microsoft Corporation
Join to apply for the
Lead AI Software Architect
role at
Microsoft 6 days ago Be among the first 25 applicants Join to apply for the
Lead AI Software Architect
role at
Microsoft Get AI-powered advice on this job and more exclusive features. Do you want to be at the forefront of innovating the latest Inference systems to propel Microsofts cloud growth? Are you seeking a unique career opportunity that combines technical capabilities, cross team collaboration, with business insight and strategy?
Microsofts mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to achieve our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.
Join the Strategic Planning and Architecture (SPARC) team within Microsofts Azure Hardware Systems and Infrastructure (AHSI) organization, the team behind Microsofts expanding Cloud Infrastructure and for powering Microsofts Intelligent Cloud mission. Microsoft delivers more than 200 online services to more than one billion individuals worldwide and AHSI is the team behind our expanding cloud infrastructure. We deliver the core infrastructure and foundational technologies for Microsoft's cloud businesses including Microsoft Azure, Bing, MSN, Office 365, OneDrive, Skype, Teams and Xbox Live.
We are looking for
Lead AI Software Architect
to join our team!
Responsibilities
Lead the SW architectural design, development, and deployment of the future AI inference infrastructure optimized for Microsofts AI cloud. Collaborate closely with hardware architecture, compiler, systems, simulation/perf optimization to ensure seamless integration and optimized performance. Define and execute strategies for inference , cost optimizations, workload balancing, and memory optimization. Mentor and guide the software engineering team, setting clear technical directions and providing architectural oversight. Evaluate, select, and integrate third-party libraries and open-source frameworks (e.g., TensorRT, TVM, PyTorch, ONNX) for optimized inference performance. Act as a technical liaison between hardware engineers and software teams to communicate requirements, constraints, and opportunities for co-design. Identify performance bottlenecks and opportunities to intersect future hardware and system roadmap planning, influencing strategic direction. Ensure robust software quality and implement best practices for software engineering, testing, and continuous integration.
Qualifications
Required/minimum qualifications
Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience. 7+ years of industry experience, with at least 5 years in AI inference software stack development and architecture. 5+ years of experience in designing and optimizing software stacks for specialized AI hardware, including accelerators, GPUs, or custom ASICs. 3+ years of experience building infrastructure and identify the opportunities for end2end Perf/TCO optimization for business critical AI workloads 3+ years of experience with AI inference frameworks and compiler toolchains such as TensorRT, ONNX Runtime, MLIR, or similar.
Other Requirements
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Preferred Qualifications
Familiarity with open source AI inference SW stacks like vLLM, Dynamo, sglang. Experience contributing to open-source AI frameworks or compiler projects. Previous experience in leading the AI software stack for an early-stage hardware startup or novel hardware project. Publications, patents, or other recognized contributions in the field of AI inference software architecture or acceleration Exceptional leadership, communication, and collaboration skills with a proven track record of guiding technical teams. Excellent understanding of hardware-software interaction, memory hierarchies, compute kernels, and data movement optimization. Proficiency in C++, Python, and experience with low-level programming, performance optimization, and system-level integration.
Software Engineering IC6 - The typical base pay range for this role across the U.S. is USD $163,000 - $296,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $220,800 - $331,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until Septmeber 9th, 2025.
#AHSI #SPARC
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Seniority level
Seniority level
Not Applicable Employment type
Employment type
Full-time Job function
Job function
Engineering and Information Technology Industries
Software Development Referrals increase your chances of interviewing at Microsoft by 2x Sign in to set job alerts for Lead Software Architect roles.
Senior Software Architect (global-Remote-Non-US)
Austin, TX $141,200.00-$338,500.00 1 day ago Austin, TX $209,000.00-$359,150.00 1 week ago Staff Software Architect (US Remote/Hybrid, CT or ET Timezones)
Austin, TX $120,275.31-$163,230.78 4 weeks ago Senior Mainframe Software Engineer - Global Supply Chain
Senior Hardware Modeling Simulation SDE, AWS Machine Learning Accelerators
Austin, TX $151,300.00-$261,500.00 6 days ago Austin, TX $184,000.00-$356,500.00 1 week ago Sr. Technical Consultant (Developer), ServiceNow HR and Workplace Service Delivery Solutions
Austin, TX $135,300.00-$236,800.00 2 weeks ago Grid Systems Senior Solution Architect -1898 & Co.
Grid Systems Senior Solution Architect -1898 & Co.
Austin, Texas Metropolitan Area 2 days ago Austin, TX $170,000.00-$220,000.00 3 weeks ago Austin, TX $185,000.00-$225,000.00 1 week ago Senior Software Engineer, Full Stack Web Development
Senior Software Engineer (AI Fintech Startup)
Senior Software Engineer, Frontend Web Development
Austin, TX $95,200.00-$168,700.00 2 weeks ago Senior Software Engineer, Infrastructure, Platforms Infrastructure Engineering
Austin, TX $166,000.00-$244,000.00 1 week ago Austin, TX $150,000.00-$200,000.00 2 months ago Senior Full Stack Software Engineer - Core Product, Poe (Remote)
Austin, TX $155,656.00-$234,201.00 1 year ago Austin, TX $152,100.00-$232,900.00 22 hours ago Austin, TX $74,800.00-$178,200.00 3 days ago Senior Principal Software Engineer - AI Products
Austin, TX $216,000.00-$312,550.00 4 days ago Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr
Lead AI Software Architect
role at
Microsoft 6 days ago Be among the first 25 applicants Join to apply for the
Lead AI Software Architect
role at
Microsoft Get AI-powered advice on this job and more exclusive features. Do you want to be at the forefront of innovating the latest Inference systems to propel Microsofts cloud growth? Are you seeking a unique career opportunity that combines technical capabilities, cross team collaboration, with business insight and strategy?
Microsofts mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to achieve our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.
Join the Strategic Planning and Architecture (SPARC) team within Microsofts Azure Hardware Systems and Infrastructure (AHSI) organization, the team behind Microsofts expanding Cloud Infrastructure and for powering Microsofts Intelligent Cloud mission. Microsoft delivers more than 200 online services to more than one billion individuals worldwide and AHSI is the team behind our expanding cloud infrastructure. We deliver the core infrastructure and foundational technologies for Microsoft's cloud businesses including Microsoft Azure, Bing, MSN, Office 365, OneDrive, Skype, Teams and Xbox Live.
We are looking for
Lead AI Software Architect
to join our team!
Responsibilities
Lead the SW architectural design, development, and deployment of the future AI inference infrastructure optimized for Microsofts AI cloud. Collaborate closely with hardware architecture, compiler, systems, simulation/perf optimization to ensure seamless integration and optimized performance. Define and execute strategies for inference , cost optimizations, workload balancing, and memory optimization. Mentor and guide the software engineering team, setting clear technical directions and providing architectural oversight. Evaluate, select, and integrate third-party libraries and open-source frameworks (e.g., TensorRT, TVM, PyTorch, ONNX) for optimized inference performance. Act as a technical liaison between hardware engineers and software teams to communicate requirements, constraints, and opportunities for co-design. Identify performance bottlenecks and opportunities to intersect future hardware and system roadmap planning, influencing strategic direction. Ensure robust software quality and implement best practices for software engineering, testing, and continuous integration.
Qualifications
Required/minimum qualifications
Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience. 7+ years of industry experience, with at least 5 years in AI inference software stack development and architecture. 5+ years of experience in designing and optimizing software stacks for specialized AI hardware, including accelerators, GPUs, or custom ASICs. 3+ years of experience building infrastructure and identify the opportunities for end2end Perf/TCO optimization for business critical AI workloads 3+ years of experience with AI inference frameworks and compiler toolchains such as TensorRT, ONNX Runtime, MLIR, or similar.
Other Requirements
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Preferred Qualifications
Familiarity with open source AI inference SW stacks like vLLM, Dynamo, sglang. Experience contributing to open-source AI frameworks or compiler projects. Previous experience in leading the AI software stack for an early-stage hardware startup or novel hardware project. Publications, patents, or other recognized contributions in the field of AI inference software architecture or acceleration Exceptional leadership, communication, and collaboration skills with a proven track record of guiding technical teams. Excellent understanding of hardware-software interaction, memory hierarchies, compute kernels, and data movement optimization. Proficiency in C++, Python, and experience with low-level programming, performance optimization, and system-level integration.
Software Engineering IC6 - The typical base pay range for this role across the U.S. is USD $163,000 - $296,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $220,800 - $331,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until Septmeber 9th, 2025.
#AHSI #SPARC
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Seniority level
Seniority level
Not Applicable Employment type
Employment type
Full-time Job function
Job function
Engineering and Information Technology Industries
Software Development Referrals increase your chances of interviewing at Microsoft by 2x Sign in to set job alerts for Lead Software Architect roles.
Senior Software Architect (global-Remote-Non-US)
Austin, TX $141,200.00-$338,500.00 1 day ago Austin, TX $209,000.00-$359,150.00 1 week ago Staff Software Architect (US Remote/Hybrid, CT or ET Timezones)
Austin, TX $120,275.31-$163,230.78 4 weeks ago Senior Mainframe Software Engineer - Global Supply Chain
Senior Hardware Modeling Simulation SDE, AWS Machine Learning Accelerators
Austin, TX $151,300.00-$261,500.00 6 days ago Austin, TX $184,000.00-$356,500.00 1 week ago Sr. Technical Consultant (Developer), ServiceNow HR and Workplace Service Delivery Solutions
Austin, TX $135,300.00-$236,800.00 2 weeks ago Grid Systems Senior Solution Architect -1898 & Co.
Grid Systems Senior Solution Architect -1898 & Co.
Austin, Texas Metropolitan Area 2 days ago Austin, TX $170,000.00-$220,000.00 3 weeks ago Austin, TX $185,000.00-$225,000.00 1 week ago Senior Software Engineer, Full Stack Web Development
Senior Software Engineer (AI Fintech Startup)
Senior Software Engineer, Frontend Web Development
Austin, TX $95,200.00-$168,700.00 2 weeks ago Senior Software Engineer, Infrastructure, Platforms Infrastructure Engineering
Austin, TX $166,000.00-$244,000.00 1 week ago Austin, TX $150,000.00-$200,000.00 2 months ago Senior Full Stack Software Engineer - Core Product, Poe (Remote)
Austin, TX $155,656.00-$234,201.00 1 year ago Austin, TX $152,100.00-$232,900.00 22 hours ago Austin, TX $74,800.00-$178,200.00 3 days ago Senior Principal Software Engineer - AI Products
Austin, TX $216,000.00-$312,550.00 4 days ago Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr