Logo
MedStar Health

Sr. System Level Debug Engineer- GPU

MedStar Health, Austin, Texas, us, 78716

Save Job

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

THE TEAM: AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems. If this resonates with you, come and joining our Data Center GPU organization where we are building amazing AI powered products with amazing people. THE ROLE: AMD is looking for a

lead systems engineer

to provide thought leadership and subject matter

expertise to our growing team. As a key contributor, you will have

a strong

technical background to contribute

to

all aspects of the software development process

.

We have competitive benefit packages and an award-winning culture. Join us!

The Datacenter Graphics and Accelerated Computing

(DCGPU) organization is

looking for an experience

d

system

level

debug engineer

.

Individual will be part of a

team that as to bring-up,

validate

and ensure the platform being used is fully

validated

: including electrical, power,

networking

and

SOC.

Individual will

be required

to lead and document the plan for

validating

the system itself as well put in documentation for unique steps to enable it

.

Individual

will need to be able to drive to root

closure

any issues

encountered

and communicate with the different

Functional and

IP layers for resolution.

THE PERSON: You are

a highly

motivated

hands-on leader

with

a strong

development background, problem solving mentality, excellent communication skills, ability to prioritize tasks a

long with

willingness to learn and adapt. Excellent teamwork skills and capable of

leading a highly technical team

.

Experience in

debugging of

complex HW/FW issues

is

a must

, understand the flow of a GPU through the different layers of a system

and be able to

validate

the items connecting to the GPU SOC (

pcie

,

vr's

, RMs,

retimers

, HBM, internal networking)

.

Communication Is essential in working with different owners of the

functional

code stack as

well as

the ability to drive issues via phone calls, chat messages

, e-mails

.

Hands on experience with Hardware in a

DataCenter

environment

will be

required

. KEY RESPONSIBILITIES: D

ebug / triage engineer

and understanding of industry tools for root causing complex issues Understanding of GPU/System level HW and SW flow Ability to probe parts of a board

; check electrical and power currents and

validate

a system Provide leadership

for driving to root cause issues Communicate / Document flows and methods of

brin

g

-up, boot-up, system initialization and

debug Lead technical presentations demonstrating a good understanding of application, data, infrastructure, architecture expertise and application systems design Collaborate with application, and infrastructure architects and be responsible for the defining-designing-delivering of the technical architectures, patterns, technical quality, risks, fitness for purpose and operability of technical architecture solutions Be a leader and mentor to the operation team; be hands-on and lead by example Be able to hand-on troubleshoot and solve the technical issues; own the problem and drive for resolution Able to proactively support team culture that fosters knowledge sharing, excellence, and collaboration PREFERRED EXPERIENCE: Significant experience in SoC and/or System debug of complex issues Develop / Document debug capabilities on a given SOC and System Go-to-person for debugging of issues for the Production level Platform validation Collaborate with internal teams on root causing issues, finding optimum resolutions Hands-on experience in using industry debug tools, scopes as well examine board level power Proven experience with C/C++ Demonstrable e

xperience in facilitating Agile

,

Scrum or Kanban Skilled in scripting languages such as Perl,

Ruby,

and Shell script Proficient with revision control (GIT, SVN and CVS) Experience crafting and supporting cloud environments, including IaaS and PaaS Database development, PostgreSQL, Oracle, MS SQL Server Good balance of hardware,

architecture,

and software expertise Proven ability to drive resolution of critical problems within a lab, Datacenter Relationship with external customers/partners and able to help resolve problems in their Data Center Relationship with external customers/partners on ability to work manufacturing issues/failures Relationship with external customers/partners on ability to define rqmts for manufacturing validation ACADEMIC CREDENTIALS: Bachelor's/Master's degree in Computer Science or related field strongly preferred

+ m

inimum

8

yrs experience in

S

ystem or SOC level debug and triag

e LOCATION : Austin, TX #LI-SL2 #LI-HYBRID

Benefits offered are described:

AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

#J-18808-Ljbffr