NVIDIA
Senior Test Architect
We are seeking a highly skilled and hard‑working Senior Test Architect to join our Enterprise Software QA team. This role offers an outstanding opportunity to leave your mark on the design, construction, optimisation, and testing of our flagship super‑computers and data‑center offerings.
What You’ll Be Doing
Define End‑to‑End Test Strategy: Own and drive the overall test architecture and validation strategy for firmware across multiple NVIDIA platforms—from pre‑silicon simulation and emulation to post‑silicon bring‑up and production readiness. Develop test plans aligned with product deliverables and customer use cases.
Architect Scalable Test Infrastructure: Design and implement modular, reusable test frameworks and automation harnesses that support functional, integration, stress, regression, power, security, and performance testing. Ensure test infrastructure scales efficiently across hundreds of systems in parallel.
Engage with Engineering Teams: Work closely with firmware developers, hardware architects, silicon validation, platform QA, and system software teams to ensure comprehensive test coverage. Influence early design decisions to optimise testability and automation readiness.
Own Firmware Quality Metrics: Define quality KPIs such as code coverage, system uptime, bug escape rate, and validation completeness. Establish dashboards and reporting mechanisms to track progress and drive data‑driven decision‑making.
Drive Root Cause Analysis and Debugging: Lead complex issue investigations that span firmware, software, and hardware layers. Develop and document debug methodologies and tools to improve diagnosis efficiency across the team.
Innovate in Lab Automation and CI/CD: Partner with DevOps and infrastructure teams to enhance test automation pipelines, integrate continuous testing into nightly and pre‑merge workflows, and ensure fast and reliable release qualification.
Enable Productisation and Customer Readiness: Validate real‑world use cases, customer configurations, and production scenarios. Contribute to release gates and sign‑off criteria to ensure firmware is ready for deployment in systems critical to the mission.
Mentor, Lead, Explore, and Adopt Emerging Technologies: Serve as a technical mentor and coach to firmware QA engineers and junior test developers. Foster a culture of quality, innovation, and continuous learning across the organization. Stay up to date on trends in embedded validation, test automation frameworks, and industry standards. Champion the adoption of new tools, methodologies, and best practices to raise the quality bar.
Boost Team Efficiency with AI: Demonstrate proven experience using AI‑powered tools and copilots to accelerate test development, automate repetitive validation workflows, and streamline debugging and root‑cause analysis.
What We Need To See
B.S./M.S./PHD in Electrical Engineering, Computer Engineering, Computer Science, or related field.
12+ years of experience in software/firmware testing, with a focus on embedded or low‑level systems.
Strong knowledge of system architecture, boot processes, SoCs, I2C/SPI/PCIe interfaces, and embedded controllers.
Proven experience designing test frameworks and infrastructure in Python, C/C++, or similar languages.
Expertise with platform standards for security, telemetry and manageability (NIST, DMTF); hands‑on experience with server platform, network, storage, cluster configuration and debugging.
Background with platform telemetry, datacenter node lifecycle management/support including CPU/GPU workloads; proficiency in scripting languages such as Python.
Expertise in administering, operating, and configuring Kubernetes and Envoy.
Validated experience in CI/CD tools such as GitLab and Jenkins and the GitOps model.
Experience with lab automation, HW‑in‑the‑loop testing, and CI/CD pipelines.
Strong debugging, problem‑solving, and analytical skills.
Excellent communication and collaboration skills; experience working in a globally distributed team is a plus.
Ways To Stand Out From The Crowd
Experience with NVIDIA platforms such as DGX, HGX, Grace Hopper systems.
Exposure to security validation, compliance (e.g., FIPS, BMC security), or thermal/power validation.
Prior role as a test architect or technical lead for large‑scale firmware or embedded validation programs.
Contributions to open‑source testing tools or frameworks with strong knowledge of cloud‑scale validation, infrastructure automation, or virtualization.
Prior experience using AI tools to design test plans, identify test gaps, automation and failure analysis.
Benefits Competitive salary and benefits package, flexible work environment, and equity. Base salary for Level 5 ranges from $200,000 to $322,000; for Level 6 from $248,000 to $391,000. Full‑time remote or on‑site options available depending on location.
Applications Applications will be accepted until July 29, 2025.
EEO Statement NVIDIA is committed to fostering a diverse work environment and is proud to be an equal‑opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
#J-18808-Ljbffr
What You’ll Be Doing
Define End‑to‑End Test Strategy: Own and drive the overall test architecture and validation strategy for firmware across multiple NVIDIA platforms—from pre‑silicon simulation and emulation to post‑silicon bring‑up and production readiness. Develop test plans aligned with product deliverables and customer use cases.
Architect Scalable Test Infrastructure: Design and implement modular, reusable test frameworks and automation harnesses that support functional, integration, stress, regression, power, security, and performance testing. Ensure test infrastructure scales efficiently across hundreds of systems in parallel.
Engage with Engineering Teams: Work closely with firmware developers, hardware architects, silicon validation, platform QA, and system software teams to ensure comprehensive test coverage. Influence early design decisions to optimise testability and automation readiness.
Own Firmware Quality Metrics: Define quality KPIs such as code coverage, system uptime, bug escape rate, and validation completeness. Establish dashboards and reporting mechanisms to track progress and drive data‑driven decision‑making.
Drive Root Cause Analysis and Debugging: Lead complex issue investigations that span firmware, software, and hardware layers. Develop and document debug methodologies and tools to improve diagnosis efficiency across the team.
Innovate in Lab Automation and CI/CD: Partner with DevOps and infrastructure teams to enhance test automation pipelines, integrate continuous testing into nightly and pre‑merge workflows, and ensure fast and reliable release qualification.
Enable Productisation and Customer Readiness: Validate real‑world use cases, customer configurations, and production scenarios. Contribute to release gates and sign‑off criteria to ensure firmware is ready for deployment in systems critical to the mission.
Mentor, Lead, Explore, and Adopt Emerging Technologies: Serve as a technical mentor and coach to firmware QA engineers and junior test developers. Foster a culture of quality, innovation, and continuous learning across the organization. Stay up to date on trends in embedded validation, test automation frameworks, and industry standards. Champion the adoption of new tools, methodologies, and best practices to raise the quality bar.
Boost Team Efficiency with AI: Demonstrate proven experience using AI‑powered tools and copilots to accelerate test development, automate repetitive validation workflows, and streamline debugging and root‑cause analysis.
What We Need To See
B.S./M.S./PHD in Electrical Engineering, Computer Engineering, Computer Science, or related field.
12+ years of experience in software/firmware testing, with a focus on embedded or low‑level systems.
Strong knowledge of system architecture, boot processes, SoCs, I2C/SPI/PCIe interfaces, and embedded controllers.
Proven experience designing test frameworks and infrastructure in Python, C/C++, or similar languages.
Expertise with platform standards for security, telemetry and manageability (NIST, DMTF); hands‑on experience with server platform, network, storage, cluster configuration and debugging.
Background with platform telemetry, datacenter node lifecycle management/support including CPU/GPU workloads; proficiency in scripting languages such as Python.
Expertise in administering, operating, and configuring Kubernetes and Envoy.
Validated experience in CI/CD tools such as GitLab and Jenkins and the GitOps model.
Experience with lab automation, HW‑in‑the‑loop testing, and CI/CD pipelines.
Strong debugging, problem‑solving, and analytical skills.
Excellent communication and collaboration skills; experience working in a globally distributed team is a plus.
Ways To Stand Out From The Crowd
Experience with NVIDIA platforms such as DGX, HGX, Grace Hopper systems.
Exposure to security validation, compliance (e.g., FIPS, BMC security), or thermal/power validation.
Prior role as a test architect or technical lead for large‑scale firmware or embedded validation programs.
Contributions to open‑source testing tools or frameworks with strong knowledge of cloud‑scale validation, infrastructure automation, or virtualization.
Prior experience using AI tools to design test plans, identify test gaps, automation and failure analysis.
Benefits Competitive salary and benefits package, flexible work environment, and equity. Base salary for Level 5 ranges from $200,000 to $322,000; for Level 6 from $248,000 to $391,000. Full‑time remote or on‑site options available depending on location.
Applications Applications will be accepted until July 29, 2025.
EEO Statement NVIDIA is committed to fostering a diverse work environment and is proud to be an equal‑opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
#J-18808-Ljbffr