T-MOBILE USA, Inc.
At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation package - this is Total Rewards. Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches. That's how we're UNSTOPPABLE for our employees!
Job Overview At T Mobile, we don't just build technology - we empower people. We believe in investing in YOU - your growth, your leadership, and your impact. We're unstoppable when individuals like you come together to solve bold challenges, inspire innovation, and create systems that power the future.
As a Principal Platform Engineer, you'll be a key technical leader shaping the platform foundations that power T Mobile's Device Finance system. You will define the strategic platform architecture, lead modernization of legacy and cloud workloads, and deliver high resiliency, high security, high performance capabilities that are data driven and AI empowered. Your vision and hands on leadership will accelerate time to market, elevate developer productivity, and optimize cost and reliability at scale.
Job Responsibilities
Set platform strategy & reference architecture
for computing, data, integration, observability, AIOps, and security across multiple clouds and on-prem environments.
Define and architect scalable, high-performance solutions
and shared platform services; set technical direction across development teams.
Lead platform modernization
(legacy refactoring, replatforming to Kubernetes/containers, enterprise-wide architecture optimization).
Establish coding and system design standards
for microservices (Spring Boot), APIs (REST/gRPC), event-driven patterns, idempotency, and
zerodowntime
releases.
Own DevOps enablement : GitLab/GitHub CI/CD, artifact strategy,
blue/green & canary
deployments, rollout/rollback guardrails.
Drive platform performance and reliability : SLOs/error budgets, capacity planning, autoscaling, disaster recovery, and
cost-efficient
infrastructure scaling.
Implement end-to-end observability and AIOps :
Build data pipelines for
metrics, logs, traces, events, and business KPIs
(OpenTelemetry).
Deploy
anomaly detection, dynamic thresholds, noise reduction, and event correlation
to cut alert fatigue and
reduce MTTD/MTTR .
Create
automated triage and remediation runbooks ; integrate with incident and ChatOps workflows for rapid resolution.
Generate
AI/LLM-assisted incident summaries, RCA drafts, and runbook suggestions
to speed learning and response.
Champion security & compliance : threat modeling, secrets & identity, RBAC/ABAC, auditability (e.g.,
SOX/PCI ), secure SDLC controls and policy guardrails for AI/AIOps use.
Lead integrations
with third‑party platforms and internal systems (e.g.,
OFSLL
and adjacent financial services), API gateways, and shared services.
Steward data platforms : performance and resiliency for
Oracle DB
and
MongoDB , caching strategies, schema/versioning governance.
Enable AI/ML and developer tooling : platform capabilities for model serving/AI agents, golden paths, templates, and innersource components that boost dev velocity.
Advance the technology roadmap : evaluate, pilot, and adopt new platform capabilities across
AWS/Azure/GCP/OCI , optimizing for reliability and cost.
Run technical governance : design reviews, security/ops standards, architectural decision records;
mentor and coach
engineers across teams.
Education & Work Experience (Required/Preferred)
Bachelor's
degree in computer science, Software/Computer Engineering, or related field ( required );
Master's
in CS/Data/Systems ( preferred ).
10+ years
building and operating platform‑scale, cloud‑native systems;
5+ years
leading architecture/technical strategy across multiple teams.
Hands‑on expertise in
Java/Spring Boot microservices , distributed systems, and API design (REST/gRPC).
3+ years
implementing CI/CD with
GitLab/GitHub
and IaC; strong experience with
Kubernetes /containers and at least one major cloud ( AWS, Azure, GCP, or OCI ).
Demonstrated experience with
Oracle (SQL)
and
MongoDB (NoSQL)
performance and resilience.
Experience enabling
AI/ML
workloads or platform features (model serving, AI agents, feature stores) is
preferred .
AIOps exposure preferred : deploying anomaly detection, correlation, automated remediation, LLM/agent‑assisted ops, and SLO‑driven alerting.
1+ year
mentoring/coaching engineers; proven cross‑functional leadership with Product/TPM/Architecture/SRE/SecOps.
Knowledge, Skills and Abilities
Strategic platform thinking : align platform capabilities with business outcomes; create pragmatic roadmaps.
Technical leadership : lead cross‑functional engineering initiatives; influence without authority; scale best practices.
Platform engineering depth : Spring Boot,
Kubernetes ,
Kafka, RabbitMQ ,
Oracle DB ,
MongoDB ; network fundamentals and performance tuning.
Observability & operations excellence : design for reliability; instrument services; optimize
SLOs ; drive
MTTD/MTTR
down.
Security by design : deep understanding of platform security, secrets, identity, data protection, and compliance in regulated finance. AI/AIOps guardrails and auditability.
Cost & performance optimization : measure and tune infra parameters, right‑size capacity, and drive efficiency at scale.
Collaboration & communication : clear written/spoken communication; stakeholder management across products, finance, and technology.
Innovation mindset : evaluate and integrate new cloud/platform services and
AIOps
services; enable
AI/ML
and modern developer experience.
Licenses and Certifications
Certified Information Systems Security Professional (CISSP) – preferred.
AWS Certified Solutions Architect – preferred.
Certified Kubernetes Administrator (CKA) – preferred.
At least 18 years of age
Legally authorized to work in the United States
Travel Travel Required (Yes/No): No
DOT Regulated DOT Regulated Position (Yes/No): No
Safety Sensitive Position (Yes/No): No
Base Pay Range Base Pay Range: $120,500 - $217,500 Corporate Bonus Target: 20%
Benefits At T-Mobile, our benefits exemplify the spirit of One Team, Together! Full and part‑time employees have access to the same benefits when eligible, including medical, dental, and vision insurance, a flexible spending account, 401(k), employee stock grants, employee stock purchase plan, paid time off and up to 12 paid holidays…etc.
Equal Opportunity T-Mobile USA, Inc. is an Equal Opportunity Employer. All decisions concerning the employment relationship will be made without regard to age, race, ethnicity, color, religion, creed, sex, sexual orientation, gender identity or expression, national origin, religious affiliation, marital status, citizenship status, veteran status, the presence of any physical or mental disability, or any other status or characteristic protected by federal, state, or local law. Discrimination, retaliation or harassment based upon any of these factors is wholly inconsistent with how we do business and will not be tolerated.
Accommodations For individuals with disabilities needing reasonable accommodation, please email ApplicantAccommodation@t-mobile.com or call 1-844-873-9500.
#J-18808-Ljbffr
Job Overview At T Mobile, we don't just build technology - we empower people. We believe in investing in YOU - your growth, your leadership, and your impact. We're unstoppable when individuals like you come together to solve bold challenges, inspire innovation, and create systems that power the future.
As a Principal Platform Engineer, you'll be a key technical leader shaping the platform foundations that power T Mobile's Device Finance system. You will define the strategic platform architecture, lead modernization of legacy and cloud workloads, and deliver high resiliency, high security, high performance capabilities that are data driven and AI empowered. Your vision and hands on leadership will accelerate time to market, elevate developer productivity, and optimize cost and reliability at scale.
Job Responsibilities
Set platform strategy & reference architecture
for computing, data, integration, observability, AIOps, and security across multiple clouds and on-prem environments.
Define and architect scalable, high-performance solutions
and shared platform services; set technical direction across development teams.
Lead platform modernization
(legacy refactoring, replatforming to Kubernetes/containers, enterprise-wide architecture optimization).
Establish coding and system design standards
for microservices (Spring Boot), APIs (REST/gRPC), event-driven patterns, idempotency, and
zerodowntime
releases.
Own DevOps enablement : GitLab/GitHub CI/CD, artifact strategy,
blue/green & canary
deployments, rollout/rollback guardrails.
Drive platform performance and reliability : SLOs/error budgets, capacity planning, autoscaling, disaster recovery, and
cost-efficient
infrastructure scaling.
Implement end-to-end observability and AIOps :
Build data pipelines for
metrics, logs, traces, events, and business KPIs
(OpenTelemetry).
Deploy
anomaly detection, dynamic thresholds, noise reduction, and event correlation
to cut alert fatigue and
reduce MTTD/MTTR .
Create
automated triage and remediation runbooks ; integrate with incident and ChatOps workflows for rapid resolution.
Generate
AI/LLM-assisted incident summaries, RCA drafts, and runbook suggestions
to speed learning and response.
Champion security & compliance : threat modeling, secrets & identity, RBAC/ABAC, auditability (e.g.,
SOX/PCI ), secure SDLC controls and policy guardrails for AI/AIOps use.
Lead integrations
with third‑party platforms and internal systems (e.g.,
OFSLL
and adjacent financial services), API gateways, and shared services.
Steward data platforms : performance and resiliency for
Oracle DB
and
MongoDB , caching strategies, schema/versioning governance.
Enable AI/ML and developer tooling : platform capabilities for model serving/AI agents, golden paths, templates, and innersource components that boost dev velocity.
Advance the technology roadmap : evaluate, pilot, and adopt new platform capabilities across
AWS/Azure/GCP/OCI , optimizing for reliability and cost.
Run technical governance : design reviews, security/ops standards, architectural decision records;
mentor and coach
engineers across teams.
Education & Work Experience (Required/Preferred)
Bachelor's
degree in computer science, Software/Computer Engineering, or related field ( required );
Master's
in CS/Data/Systems ( preferred ).
10+ years
building and operating platform‑scale, cloud‑native systems;
5+ years
leading architecture/technical strategy across multiple teams.
Hands‑on expertise in
Java/Spring Boot microservices , distributed systems, and API design (REST/gRPC).
3+ years
implementing CI/CD with
GitLab/GitHub
and IaC; strong experience with
Kubernetes /containers and at least one major cloud ( AWS, Azure, GCP, or OCI ).
Demonstrated experience with
Oracle (SQL)
and
MongoDB (NoSQL)
performance and resilience.
Experience enabling
AI/ML
workloads or platform features (model serving, AI agents, feature stores) is
preferred .
AIOps exposure preferred : deploying anomaly detection, correlation, automated remediation, LLM/agent‑assisted ops, and SLO‑driven alerting.
1+ year
mentoring/coaching engineers; proven cross‑functional leadership with Product/TPM/Architecture/SRE/SecOps.
Knowledge, Skills and Abilities
Strategic platform thinking : align platform capabilities with business outcomes; create pragmatic roadmaps.
Technical leadership : lead cross‑functional engineering initiatives; influence without authority; scale best practices.
Platform engineering depth : Spring Boot,
Kubernetes ,
Kafka, RabbitMQ ,
Oracle DB ,
MongoDB ; network fundamentals and performance tuning.
Observability & operations excellence : design for reliability; instrument services; optimize
SLOs ; drive
MTTD/MTTR
down.
Security by design : deep understanding of platform security, secrets, identity, data protection, and compliance in regulated finance. AI/AIOps guardrails and auditability.
Cost & performance optimization : measure and tune infra parameters, right‑size capacity, and drive efficiency at scale.
Collaboration & communication : clear written/spoken communication; stakeholder management across products, finance, and technology.
Innovation mindset : evaluate and integrate new cloud/platform services and
AIOps
services; enable
AI/ML
and modern developer experience.
Licenses and Certifications
Certified Information Systems Security Professional (CISSP) – preferred.
AWS Certified Solutions Architect – preferred.
Certified Kubernetes Administrator (CKA) – preferred.
At least 18 years of age
Legally authorized to work in the United States
Travel Travel Required (Yes/No): No
DOT Regulated DOT Regulated Position (Yes/No): No
Safety Sensitive Position (Yes/No): No
Base Pay Range Base Pay Range: $120,500 - $217,500 Corporate Bonus Target: 20%
Benefits At T-Mobile, our benefits exemplify the spirit of One Team, Together! Full and part‑time employees have access to the same benefits when eligible, including medical, dental, and vision insurance, a flexible spending account, 401(k), employee stock grants, employee stock purchase plan, paid time off and up to 12 paid holidays…etc.
Equal Opportunity T-Mobile USA, Inc. is an Equal Opportunity Employer. All decisions concerning the employment relationship will be made without regard to age, race, ethnicity, color, religion, creed, sex, sexual orientation, gender identity or expression, national origin, religious affiliation, marital status, citizenship status, veteran status, the presence of any physical or mental disability, or any other status or characteristic protected by federal, state, or local law. Discrimination, retaliation or harassment based upon any of these factors is wholly inconsistent with how we do business and will not be tolerated.
Accommodations For individuals with disabilities needing reasonable accommodation, please email ApplicantAccommodation@t-mobile.com or call 1-844-873-9500.
#J-18808-Ljbffr