Logo
Ozmo

Principal Software Architect

Ozmo, Raleigh, North Carolina, United States, 27601

Save Job

Ozmo is seeking an experienced Principal Software Architect to lead the design of AI‑first architectures that power our next‑generation SaaS platform. In this role, you’ll define how we integrate intelligent systems into production at scale: RAG and agentic workflows, multi‑tenant AI infrastructure, and customer‑facing AI capabilities. You’ll collaborate across product, engineering, and customer teams to translate AI opportunities into deployable architectures that balance innovation with reliability, security, and cost efficiency.

As a growing technology company, Ozmo is transforming how enterprises leverage AI to deliver automated outcomes at scale. We’re building a team of engineers who thrive on solving hard technical problems, shipping production systems that matter, and defining what’s possible with AI in enterprise software. If you’re passionate about architecting intelligent systems at scale and want to shape the future of AI‑enabled SaaS, this is the role for you.

What you will do:

Lead the design, documentation, and communication of end‑to‑end system architectures that align with Ozmo’s strategic goals, including AI‑first capabilities that handle non‑deterministic outputs, continuous evaluation, and graceful fallbacks

Architect scalable AI inference and retrieval stacks (vector/graph stores, embedding pipelines, RAG systems, caching, batching, streaming), balancing accuracy, cost, and latency targets.

Define and own architectural standards for AI‑enabled systems, including agent orchestration, memory/state management, guardrails, model versioning, evaluation gates, and safe rollout practices

Architect multi‑tenant AI patterns across the platform, including per‑customer data isolation, access control, model configuration, data residency, and cost allocation

Collaborate with customers’ technical teams to evaluate requirements, architect integrations, and ensure seamless deployment aligned with their existing systems

Partner with product, data, ML, and engineering leaders to translate AI opportunities into deployable system designs; guide tradeoffs between deterministic services and model‑driven components

Create reusable reference architectures and "golden paths" for teams shipping AI features, accelerating delivery while maintaining quality standards

Mentor engineers and foster a culture of innovation, experimentation, and technical excellence

Ensure AI systems adhere to Ozmo’s reliability, security, and compliance requirements, including PII handling, policy enforcement, auditability, and incident response for AI components

Contribute to long‑term technical roadmaps and architectural reviews that include AI‑first design principles.

What you will bring to the role:

7+ years architecting and delivering enterprise‑scale SaaS platforms, including multi‑tenant systems, distributed architectures, and complex integrations

Hands‑on experience designing AI‑enabled systems: RAG architectures, LLM integrations, agentic workflows, vector databases, and AI evaluation frameworks

Deep expertise balancing AI system tradeoffs: accuracy vs. latency vs. cost, deterministic vs. model‑driven components, and build vs. buy decisions

Strong foundation in distributed systems and cloud‑native architecture (AWS, Azure, or GCP): microservices, event‑driven architectures, serverless patterns, API design, domain‑driven design (DDD) principles, and AI infrastructure patterns including model hosting, inference optimization, observability for non‑deterministic systems, and security/compliance

Deep expertise in multi‑tenant SaaS architecture, including data isolation, per‑customer configuration, cost allocation, and scalability patterns for AI workloads

Proven experience leading AI‑enabled architecture initiatives and mentoring technical teams on intelligent system design

Track record creating reusable reference architectures that accelerate team delivery while maintaining quality and security standards

Strong communication skills with ability to translate complex technical concepts for diverse audiences: engineers, executives, and customers

Passion for continuous learning, experimentation, and improving technical systems at scale

While a Bachelor’s degree is preferred, we place greater value on proven, relevant experience

Perks of being an Ozmonaut:

Medical, vision, dental and life insurance along with short‑ and long‑term disability

Plenty of paid time off (PTO) that grows the longer you’re with Ozmo, as well as paid holidays

401k to save for retirement with employer matching

Paid maternity and bonding leave for new parents

Paid pawternity leave when you bring a new pet into your life

One‑month sabbatical after you have been with Ozmo for five years

Flexible, hybrid work arrangements to support your best work

Pay Range:

$180,000+ based on a number of factors and commensurate with qualifications and experience.

This is a fantastic opportunity to join a dynamic team while working remotely within the United States.

Applicants must be legally authorized to work in the U.S. Ozmo does not sponsor employment visas. Ozmo participates in the E‑verify program. Ozmo does not discriminate in employment opportunities or practices on the basis of race, color, religion, sex, sexual orientation, gender identity, gender expression, pregnancy, marital status, national origin, age, disability, or any other characteristic protected by law.

#J-18808-Ljbffr