CADDi
For security reasons, the candidate must be a U.S. Citizen or a Permanent Resident (Green Card).
Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform, safeguarding U.S. customer data, and supporting the Aerospace and Defense Industrial Base. You will own U.S. operations while collaborating with a global team of 150+ engineers in a fast‑paced, high‑growth environment.
What your days will look like
Design, implement, and operate highly available, scalable, and fault‑tolerant infrastructure primarily on GCP, with multi‑cloud deployments.
Lead Terraform‑based infrastructure development with security best practices, encrypted state management, and governance tools.
Build robust CI/CD pipelines, integrate automated security testing, vulnerability scanning, and compliance checks throughout the development lifecycle.
Implement observability with Prometheus, Grafana, and ELK; define SLOs/SLIs, manage error budgets, and lead incident response with blameless post‑mortems.
Navigate complex regulatory requirements for U.S. Aerospace and Defense Industrial Base, collaborating with security and legal teams on expanding compliance standards.
Reduce operational toil through Python, Go, or Bash automation; follow‑the‑sun collaboration, with primary responsibility for U.S. platform incidents and operations.
Requirements
Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
2+ years in Site Reliability Engineering, DevOps, or Systems Engineering with cloud‑based SaaS platforms.
Deep Terraform and Infrastructure‑as‑Code expertise with security best practices.
Proficiency in Python and other scripting/programming languages.
Modern CI/CD experience (GitHub Actions, GitLab CI, Jenkins, ArgoCD, Spinnaker) including AI/ML workloads.
Strong cloud platform experience, preferably GCP (AWS, Azure experience also valuable).
Experience building and optimizing containers (Docker) and configuring orchestration (Kubernetes).
Monitoring tools experience (Datadog, Prometheus, Grafana, etc.).
Experience in regulated industries (Aerospace & Defense, Finance, Healthcare) with secure platform building.
DevSecOps principles and security integration experience.
Security‑first development mindset with understanding of secure infrastructure practices.
Strong problem‑solving and communication skills for distributed team environments.
What would have us dialing your number immediately
Hyper‑growth startup experience.
AI Safety experience.
MLOps and AI/ML infrastructure security experience.
What you will get in return
Competitive salary in the Chicago market with company‑paid healthcare benefits, 401(k) matching, generous time off, and work/life balance.
In‑depth experience in various aspects of the international tech startup environment in Chicago.
Opportunity to contribute to developing and implementing a winning strategy as a foundational member.
Exposure to cross‑functional collaboration and leaders within a growing startup environment.
Chance to directly impact customer satisfaction, retention, and business growth, helping multiple manufacturing businesses succeed in the U.S.
Benefits
Comprehensive health, dental, and vision insurance (100% company‑covered).
Base salary $110,000 to $150,000 annually based on experience and skills.
Stock options plan.
401(k) plan with 4% company match.
15 days paid time off, five sick days, and ten company holidays.
Company culture with lunches, events, healthy snacks, and quarterly celebrations.
Professional development opportunities, conferences, and learning initiatives.
Commute and parking benefits; referral bonuses.
We are a diverse and inclusive workplace that values your unique talents and perspectives. We are committed to building a team that reflects the communities we serve.
Ready to join a passionate team and make a real difference in the future of manufacturing in the U.S.? Apply today and let’s talk.
#J-18808-Ljbffr
Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform, safeguarding U.S. customer data, and supporting the Aerospace and Defense Industrial Base. You will own U.S. operations while collaborating with a global team of 150+ engineers in a fast‑paced, high‑growth environment.
What your days will look like
Design, implement, and operate highly available, scalable, and fault‑tolerant infrastructure primarily on GCP, with multi‑cloud deployments.
Lead Terraform‑based infrastructure development with security best practices, encrypted state management, and governance tools.
Build robust CI/CD pipelines, integrate automated security testing, vulnerability scanning, and compliance checks throughout the development lifecycle.
Implement observability with Prometheus, Grafana, and ELK; define SLOs/SLIs, manage error budgets, and lead incident response with blameless post‑mortems.
Navigate complex regulatory requirements for U.S. Aerospace and Defense Industrial Base, collaborating with security and legal teams on expanding compliance standards.
Reduce operational toil through Python, Go, or Bash automation; follow‑the‑sun collaboration, with primary responsibility for U.S. platform incidents and operations.
Requirements
Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
2+ years in Site Reliability Engineering, DevOps, or Systems Engineering with cloud‑based SaaS platforms.
Deep Terraform and Infrastructure‑as‑Code expertise with security best practices.
Proficiency in Python and other scripting/programming languages.
Modern CI/CD experience (GitHub Actions, GitLab CI, Jenkins, ArgoCD, Spinnaker) including AI/ML workloads.
Strong cloud platform experience, preferably GCP (AWS, Azure experience also valuable).
Experience building and optimizing containers (Docker) and configuring orchestration (Kubernetes).
Monitoring tools experience (Datadog, Prometheus, Grafana, etc.).
Experience in regulated industries (Aerospace & Defense, Finance, Healthcare) with secure platform building.
DevSecOps principles and security integration experience.
Security‑first development mindset with understanding of secure infrastructure practices.
Strong problem‑solving and communication skills for distributed team environments.
What would have us dialing your number immediately
Hyper‑growth startup experience.
AI Safety experience.
MLOps and AI/ML infrastructure security experience.
What you will get in return
Competitive salary in the Chicago market with company‑paid healthcare benefits, 401(k) matching, generous time off, and work/life balance.
In‑depth experience in various aspects of the international tech startup environment in Chicago.
Opportunity to contribute to developing and implementing a winning strategy as a foundational member.
Exposure to cross‑functional collaboration and leaders within a growing startup environment.
Chance to directly impact customer satisfaction, retention, and business growth, helping multiple manufacturing businesses succeed in the U.S.
Benefits
Comprehensive health, dental, and vision insurance (100% company‑covered).
Base salary $110,000 to $150,000 annually based on experience and skills.
Stock options plan.
401(k) plan with 4% company match.
15 days paid time off, five sick days, and ten company holidays.
Company culture with lunches, events, healthy snacks, and quarterly celebrations.
Professional development opportunities, conferences, and learning initiatives.
Commute and parking benefits; referral bonuses.
We are a diverse and inclusive workplace that values your unique talents and perspectives. We are committed to building a team that reflects the communities we serve.
Ready to join a passionate team and make a real difference in the future of manufacturing in the U.S.? Apply today and let’s talk.
#J-18808-Ljbffr