Delphi-US, LLC - Peacemakers in the Talent War
Sr. Cloud Reliability Engineer
Delphi-US, LLC - Peacemakers in the Talent War, Richmond, Virginia, United States, 23214
Job Title
Sr. Cloud Reliability Engineer (Contract) Hybrid (US Citizen Only) Job Description
We are seeking a highly skilled Senior Cloud Reliability Engineer to join our Site Reliability Engineering (SRE) Service within the Cloud Solutions & Services department. This role is responsible for ensuring the reliability, scalability, and performance of our cloud foundational environments. You will leverage software engineering principles, automation, and observability practices to enhance system resilience while supporting critical cloud networking infrastructure. Responsibilities
Works part of cloud foundational platform squads focused on Cloud Networks to demonstrate and champion site reliability culture and practices and exerts technical influence throughout your team Develops and maintains automations, scripts and code associated with automating manual work, improving reliability and stability of the cloud platform Develops, integrates and maintains synthetics (canaries) code to establish health of the services Leads SLIs, SLOs, Error budgets efforts in collaboration with product team to instrument, visualize for proactively managing the stability of cloud platforms Implement observability (logs, metrics, traces) and monitoring for Cloud Network components like VPC, VPN Tunnels, GWLB, and Transit Gateway using tools like SevOne, Grafana, Dynatrace, AWS CloudWatch and AWS Canary Respond to and resolve incidents in a timely manner Use Infrastructure as Code (IaC) tools like Terraform to manage AWS resources. Develops reusable artifacts and software utilities to industrialize SRE practices across FRS Qualifications
5-7 years of extensive experience in end-to-end enterprise software development life cycle experience including maintenance and support 3+ years of experience in Observability and SRE practices. 3+ years of experience in Cloud Networking domain (experience with Routers, Firewalls, Load Balancers, etc) Bachelor’s degree in computer science, Information Systems, or equivalent background or equivalent experience. Extensive knowledge and experience of working in AWS environments Knowledge of Azure is a plus Strong Software development experience in Cloud with one of the languages: Python or GoLang Experience with observability, open telemetry, and in one or more of the tools like Dynatrace, Prometheus, Grafana, AWS CloudWatch, AWS Canary, AWS event bridge Expertise in automating the TOIL Working experience in Agile and Scaled Agile environments Experience supporting infrastructure for large multi-services applications Knowledge of secure coding standards and banking environment is a plus. Desirable to have AWS Certifications (AWS Certified Solutions Architect and AWS Certified SysOps Administrator) About Delphi-US
Delphi-US is a national recruiting firm based in Newport, Rhode Island. We specialize in IT, Engineering and Professional Staffing services for organizations across the United States of America. Our mission is simple: We are Peacemakers in the Talent War. Delphi attracts our nation’s best and brightest Technical & Professional Talent and position our nation’s finest employers to further their competitive advantages. We accomplish this with a proprietary skill-based and cultural matching process that results in higher qualified submissions, hiring with purpose, and employer/employee mutual success. You’ll find our team is supremely experienced, friendly, professional and ready to advocate on your behalf. Armed with best in class talent, Delphi has an indelible understanding of employer expectations and we deliver lasting results. Seniority level
Mid-Senior level Employment type
Contract Job function
Engineering and Information Technology
#J-18808-Ljbffr
Sr. Cloud Reliability Engineer (Contract) Hybrid (US Citizen Only) Job Description
We are seeking a highly skilled Senior Cloud Reliability Engineer to join our Site Reliability Engineering (SRE) Service within the Cloud Solutions & Services department. This role is responsible for ensuring the reliability, scalability, and performance of our cloud foundational environments. You will leverage software engineering principles, automation, and observability practices to enhance system resilience while supporting critical cloud networking infrastructure. Responsibilities
Works part of cloud foundational platform squads focused on Cloud Networks to demonstrate and champion site reliability culture and practices and exerts technical influence throughout your team Develops and maintains automations, scripts and code associated with automating manual work, improving reliability and stability of the cloud platform Develops, integrates and maintains synthetics (canaries) code to establish health of the services Leads SLIs, SLOs, Error budgets efforts in collaboration with product team to instrument, visualize for proactively managing the stability of cloud platforms Implement observability (logs, metrics, traces) and monitoring for Cloud Network components like VPC, VPN Tunnels, GWLB, and Transit Gateway using tools like SevOne, Grafana, Dynatrace, AWS CloudWatch and AWS Canary Respond to and resolve incidents in a timely manner Use Infrastructure as Code (IaC) tools like Terraform to manage AWS resources. Develops reusable artifacts and software utilities to industrialize SRE practices across FRS Qualifications
5-7 years of extensive experience in end-to-end enterprise software development life cycle experience including maintenance and support 3+ years of experience in Observability and SRE practices. 3+ years of experience in Cloud Networking domain (experience with Routers, Firewalls, Load Balancers, etc) Bachelor’s degree in computer science, Information Systems, or equivalent background or equivalent experience. Extensive knowledge and experience of working in AWS environments Knowledge of Azure is a plus Strong Software development experience in Cloud with one of the languages: Python or GoLang Experience with observability, open telemetry, and in one or more of the tools like Dynatrace, Prometheus, Grafana, AWS CloudWatch, AWS Canary, AWS event bridge Expertise in automating the TOIL Working experience in Agile and Scaled Agile environments Experience supporting infrastructure for large multi-services applications Knowledge of secure coding standards and banking environment is a plus. Desirable to have AWS Certifications (AWS Certified Solutions Architect and AWS Certified SysOps Administrator) About Delphi-US
Delphi-US is a national recruiting firm based in Newport, Rhode Island. We specialize in IT, Engineering and Professional Staffing services for organizations across the United States of America. Our mission is simple: We are Peacemakers in the Talent War. Delphi attracts our nation’s best and brightest Technical & Professional Talent and position our nation’s finest employers to further their competitive advantages. We accomplish this with a proprietary skill-based and cultural matching process that results in higher qualified submissions, hiring with purpose, and employer/employee mutual success. You’ll find our team is supremely experienced, friendly, professional and ready to advocate on your behalf. Armed with best in class talent, Delphi has an indelible understanding of employer expectations and we deliver lasting results. Seniority level
Mid-Senior level Employment type
Contract Job function
Engineering and Information Technology
#J-18808-Ljbffr