Motion Recruitment
This is an opportunity to join a fast-paced infrastructure team focused on scaling cloud-native systems that support complex AI and data workloads. This is a full-time role based in New York City, working with AWS, Kubernetes, Helm, Terraform, Datadog, and scripting in Bash and Python to ensure reliability, automation, and observability across systems. You'll be part of a cutting-edge environment operating at the intersection of fintech and AI, helping build platforms that power smarter financial decision-making.
As a Site Reliability Engineer, you’ll be responsible for designing and maintaining infrastructure, improving monitoring, and automating systems using Infrastructure as Code. You’ll work cross-functionally with development and operations teams, taking ownership of systems that support core product delivery. If you're passionate about cloud infrastructure, Kubernetes, and building tools that empower developers, this is an excellent opportunity to learn, grow, and make a high-impact contribution.
Required Skills & Experience
3–5 years of experience with AWS and/or Azure cloud platforms 2–3 years managing Kubernetes clusters in production 2–3 years of Helm experience for Kubernetes package management 2–3 years working with Datadog or similar monitoring tools 3–5 years of Linux system administration and shell scripting 2–3 years of experience with Terraform or similar IaC tools Desired Skills & Experience
Experience with MLOps monitoring and observability PostgreSQL, Elasticsearch, or vector DBs like Qdrant Familiarity with AWS GuardDuty, CloudWatch, and CloudTrail Certifications in AWS, Azure, or Kubernetes Experience with GCP or distributed tracing tools What You Will Be Doing
Tech Breakdown 70% Cloud Infrastructure (AWS, Kubernetes, Terraform, Helm) 30% Monitoring, Automation & Observability (Datadog, Python, Bash) Daily Responsibilities 80% Hands-On Engineering & Infrastructure Automation 20% Team Collaboration & Cross-Functional Support The Offer
Bonus Eligible You Will Receive the Following Benefits
Medical, Dental, and Vision Insurance Vacation Time Stock Options Applicants must be currently authorized to work in the US on a full-time basis now and in the future.
#J-18808-Ljbffr
3–5 years of experience with AWS and/or Azure cloud platforms 2–3 years managing Kubernetes clusters in production 2–3 years of Helm experience for Kubernetes package management 2–3 years working with Datadog or similar monitoring tools 3–5 years of Linux system administration and shell scripting 2–3 years of experience with Terraform or similar IaC tools Desired Skills & Experience
Experience with MLOps monitoring and observability PostgreSQL, Elasticsearch, or vector DBs like Qdrant Familiarity with AWS GuardDuty, CloudWatch, and CloudTrail Certifications in AWS, Azure, or Kubernetes Experience with GCP or distributed tracing tools What You Will Be Doing
Tech Breakdown 70% Cloud Infrastructure (AWS, Kubernetes, Terraform, Helm) 30% Monitoring, Automation & Observability (Datadog, Python, Bash) Daily Responsibilities 80% Hands-On Engineering & Infrastructure Automation 20% Team Collaboration & Cross-Functional Support The Offer
Bonus Eligible You Will Receive the Following Benefits
Medical, Dental, and Vision Insurance Vacation Time Stock Options Applicants must be currently authorized to work in the US on a full-time basis now and in the future.
#J-18808-Ljbffr