Advantage Technical
Sr Site Reliability Engineer
Advantage Technical, Jersey City, New Jersey, United States, 07390
Job Type
W2 contract, 6 month+ contract
Location Jersey City, NJ (Onsite daily)
Title Sr SRE
As a Senior Site Reliability Engineer, you'll bring software engineering practices to operations - building the reliability framework, defining Service Level Objectives (SLOs), and automating toil away. You'll own the health and performance of container platforms (EKS & OpenShift), Middleware Platforms (Kafka, Redis), and the CI/CD/observability pipelines that power modern, distributed applications.
Key Responsibilities Platform Operations
Administer and optimize Kubernetes clusters - Amazon EKS and Red Hat OpenShift
Manage platform lifecycle, upgrades, scaling, and security controls
Middleware Management
Operate and tune event platforms like Apache Kafka
Administer In-memory data stores like Redis Enterprise Clusters
Administer and maintain 3 Scale API Gateway platform
Automation
Fine tune Infrastructure-as-Code (IaC) pipelines and platform components
Automate manual operations through IaC & configuration management tools/platforms
Observability & Instrumentation
Design and implement monitoring dashboards and alerts with Prometheus, Grafana, ELK stack, and Splunk
Instrument Java, Node.js, and Python distributed apps - embed tracing, metrics, and logs at code-level to meet SLOs
Reliability Engineering
Define SLIs/SLOs and manage error budgets - use data-driven insights to balance reliability and feature velocity
Lead on-call rotations, incident response, and conduct blameless root cause analysis to drive continuous improvement
Performance & Capacity
Forecast and right-size resource usage across clusters and middleware
Profile and tune application performance (CPU, memory, GC, threading) in production
Required Skills & Qualifications
12+ years of overall industry experience
6+ years in SRE, DevOps, Platform, or Production Engineering roles
EKS and/or OpenShift administration certification (CKA, AWS Certified Kubernetes Administrator, Red Hat Certified OpenShift Administrator, or equivalent)
Hands-on with Kubernetes internals, networking, Helm charts, and Operators
Middleware expertise: Deploying, scaling, and securing Kafka and Redis clusters
Strong IaC toolchain experience: Helm, ArgoCD, Terraform, Ansible or equivalent tools/platforms
Observability mastery: Prometheus, Grafana, ELK/Splunk or equivalent tools/platforms
Enforce container security and policy governance using tools like OPA/Gatekeeper, Kyverno, and scanners such as Trivy, Clair, and Snyk, integrated with CI/CD and admission controls for automated compliance
Implement Kubernetes network segmentation using NetworkPolicy and/or Calico, ensuring secure east-west traffic and minimizing blast radius to protect service reliability
Programming/scripting proficiency in Python, Shell Scripting, Groovy or similar automation scripting
Demonstrable experience instrumenting distributed applications (Java, Node.js, Python) with metrics, logs, and tracing libraries
Proven track record of running large-scale production systems with minimal downtime
Strong analytical, debugging, communication, and collaboration skills
Nice-to-Have
Service mesh experience (Istio, Linkerd)
Chaos engineering foundations (Chaos Monkey, LitmusChaos)
Familiarity with security/compliance in regulated environments
Experienced with any API Gateway platform (e.g. RedHat 3 Scale API Gateway)
What Makes This Role Unique
You'll be the architect of reliability guardrails - building automation and pipelines that free developers and engineers from manual ops
You'll define and enforce SLO-driven releases, leveraging error budgets to strike the right balance between innovation and uptime
You'll own end-to-end instrumentation: from container runtime metrics through Kafka-backed event flows to application-level traces in code
The base pay range above represents the low and high end of the base compensation range we reasonably expect to pay for this position. Actual base compensation will vary and may be above or below the range based on a range of factors including, but not limited to, geographic location, actual experience, and job performance. This job posting is not a promise of any specific pay for any specific employee.
The range listed is just one component of the total compensation package for our employees. Based on the details of your position, we provide a variety of benefits to our employees, including medical, dental, and vision plans, pre‑tax savings plans, pre‑tax parking and commuter plans, supplemental health and welfare plans, a retirement savings plan, an employee assistance program, pet insurance, and paid holidays. Other rewards may include short‑term incentives and paid time off.
After you have applied, download our Staffmark Group WorkNOW App to receive real‑time job offers and apply for additional opportunities. You can download it from the App Store or get it on Google Play.
About Advantage Technical: With company roots going back over 30 years, Advantage Technical is an engineering and information technology services company and a national leader in the provision of technical resources today. These services include Staff Augmentation, Direct Placement, Project Resourcing and Outsourcing - delivered from 40 key market locations, by over 3500 specialized contractors, to over 500 clients across North America. Advantage Technical is a Best of Staffing Diamond Award winner for both Clients and Talent. For more information about the industries and services offered by Advantage Technical, please visit AdvantageTechnical.com.
Advantage Technical is committed to providing equal employment opportunity for all persons regardless of race, color, religion (including religious dress and grooming practices), sex, sexual orientation, gender, gender identity, gender expression, age, marital status, national origin, ancestry, citizenship status, pregnancy, medical condition, genetic information, mental and physical disability, political affiliation, union membership, status as a parent, military or veteran status or other non‑merit based factors. We will provide reasonable accommodations throughout the application, interviewing and employment process. If you require a reasonable accommodation, contact your local branch. Advantage Technical is an E‑Verify employer. This policy is applicable to all phases of the employment relationship, including hiring, transfers, promotions, training, terminations, working conditions, compensation, benefits, and other terms and conditions of employment.
All employees are directed to familiarize themselves with this policy and to act in accordance with it. All decisions with respect to employment matters and other phases of employer‑temporary employee relationships will be in keeping with this policy and in accordance with all applicable laws and regulations.
To read our Privacy Notice for Candidates and Employees/Contractors, please refer to our Privacy Notice for Candidates and Employees/Contractors.
By applying for this job, you agree that you may receive both AI‑generated and non‑AI generated calls, text messages, or emails from Staffmark Group and/or its affiliates, and contracted partners. Frequency varies for text messages. Message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You can reply STOP to cancel and HELP for help. You can access our general Privacy Policy at Privacy Policy - Staffmark.
#J-18808-Ljbffr
Location Jersey City, NJ (Onsite daily)
Title Sr SRE
As a Senior Site Reliability Engineer, you'll bring software engineering practices to operations - building the reliability framework, defining Service Level Objectives (SLOs), and automating toil away. You'll own the health and performance of container platforms (EKS & OpenShift), Middleware Platforms (Kafka, Redis), and the CI/CD/observability pipelines that power modern, distributed applications.
Key Responsibilities Platform Operations
Administer and optimize Kubernetes clusters - Amazon EKS and Red Hat OpenShift
Manage platform lifecycle, upgrades, scaling, and security controls
Middleware Management
Operate and tune event platforms like Apache Kafka
Administer In-memory data stores like Redis Enterprise Clusters
Administer and maintain 3 Scale API Gateway platform
Automation
Fine tune Infrastructure-as-Code (IaC) pipelines and platform components
Automate manual operations through IaC & configuration management tools/platforms
Observability & Instrumentation
Design and implement monitoring dashboards and alerts with Prometheus, Grafana, ELK stack, and Splunk
Instrument Java, Node.js, and Python distributed apps - embed tracing, metrics, and logs at code-level to meet SLOs
Reliability Engineering
Define SLIs/SLOs and manage error budgets - use data-driven insights to balance reliability and feature velocity
Lead on-call rotations, incident response, and conduct blameless root cause analysis to drive continuous improvement
Performance & Capacity
Forecast and right-size resource usage across clusters and middleware
Profile and tune application performance (CPU, memory, GC, threading) in production
Required Skills & Qualifications
12+ years of overall industry experience
6+ years in SRE, DevOps, Platform, or Production Engineering roles
EKS and/or OpenShift administration certification (CKA, AWS Certified Kubernetes Administrator, Red Hat Certified OpenShift Administrator, or equivalent)
Hands-on with Kubernetes internals, networking, Helm charts, and Operators
Middleware expertise: Deploying, scaling, and securing Kafka and Redis clusters
Strong IaC toolchain experience: Helm, ArgoCD, Terraform, Ansible or equivalent tools/platforms
Observability mastery: Prometheus, Grafana, ELK/Splunk or equivalent tools/platforms
Enforce container security and policy governance using tools like OPA/Gatekeeper, Kyverno, and scanners such as Trivy, Clair, and Snyk, integrated with CI/CD and admission controls for automated compliance
Implement Kubernetes network segmentation using NetworkPolicy and/or Calico, ensuring secure east-west traffic and minimizing blast radius to protect service reliability
Programming/scripting proficiency in Python, Shell Scripting, Groovy or similar automation scripting
Demonstrable experience instrumenting distributed applications (Java, Node.js, Python) with metrics, logs, and tracing libraries
Proven track record of running large-scale production systems with minimal downtime
Strong analytical, debugging, communication, and collaboration skills
Nice-to-Have
Service mesh experience (Istio, Linkerd)
Chaos engineering foundations (Chaos Monkey, LitmusChaos)
Familiarity with security/compliance in regulated environments
Experienced with any API Gateway platform (e.g. RedHat 3 Scale API Gateway)
What Makes This Role Unique
You'll be the architect of reliability guardrails - building automation and pipelines that free developers and engineers from manual ops
You'll define and enforce SLO-driven releases, leveraging error budgets to strike the right balance between innovation and uptime
You'll own end-to-end instrumentation: from container runtime metrics through Kafka-backed event flows to application-level traces in code
The base pay range above represents the low and high end of the base compensation range we reasonably expect to pay for this position. Actual base compensation will vary and may be above or below the range based on a range of factors including, but not limited to, geographic location, actual experience, and job performance. This job posting is not a promise of any specific pay for any specific employee.
The range listed is just one component of the total compensation package for our employees. Based on the details of your position, we provide a variety of benefits to our employees, including medical, dental, and vision plans, pre‑tax savings plans, pre‑tax parking and commuter plans, supplemental health and welfare plans, a retirement savings plan, an employee assistance program, pet insurance, and paid holidays. Other rewards may include short‑term incentives and paid time off.
After you have applied, download our Staffmark Group WorkNOW App to receive real‑time job offers and apply for additional opportunities. You can download it from the App Store or get it on Google Play.
About Advantage Technical: With company roots going back over 30 years, Advantage Technical is an engineering and information technology services company and a national leader in the provision of technical resources today. These services include Staff Augmentation, Direct Placement, Project Resourcing and Outsourcing - delivered from 40 key market locations, by over 3500 specialized contractors, to over 500 clients across North America. Advantage Technical is a Best of Staffing Diamond Award winner for both Clients and Talent. For more information about the industries and services offered by Advantage Technical, please visit AdvantageTechnical.com.
Advantage Technical is committed to providing equal employment opportunity for all persons regardless of race, color, religion (including religious dress and grooming practices), sex, sexual orientation, gender, gender identity, gender expression, age, marital status, national origin, ancestry, citizenship status, pregnancy, medical condition, genetic information, mental and physical disability, political affiliation, union membership, status as a parent, military or veteran status or other non‑merit based factors. We will provide reasonable accommodations throughout the application, interviewing and employment process. If you require a reasonable accommodation, contact your local branch. Advantage Technical is an E‑Verify employer. This policy is applicable to all phases of the employment relationship, including hiring, transfers, promotions, training, terminations, working conditions, compensation, benefits, and other terms and conditions of employment.
All employees are directed to familiarize themselves with this policy and to act in accordance with it. All decisions with respect to employment matters and other phases of employer‑temporary employee relationships will be in keeping with this policy and in accordance with all applicable laws and regulations.
To read our Privacy Notice for Candidates and Employees/Contractors, please refer to our Privacy Notice for Candidates and Employees/Contractors.
By applying for this job, you agree that you may receive both AI‑generated and non‑AI generated calls, text messages, or emails from Staffmark Group and/or its affiliates, and contracted partners. Frequency varies for text messages. Message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You can reply STOP to cancel and HELP for help. You can access our general Privacy Policy at Privacy Policy - Staffmark.
#J-18808-Ljbffr