Fidelity TalentSource

Site Reliability Lead

Fidelity TalentSource, Westlake, Texas, United States

Requirements: Must have:

A bachelor’s degree or equivalent experience in a technology-related field (e.g., Engineering, Computer Science) is required; a master’s degree is a plus. At least 8 years of hands-on experience in deploying and/or supporting highly distributed multi-tiered systems at scale. A minimum of 2 years of experience in Cloud development (AWS) and migration skills; experience building and operating resilient platforms in AWS cloud environments is essential. 2-4 years of software development experience using Python, NodeJS, or Java with a focus on SDLC and automation. Proven hands-on experience with container orchestration, preferably Kubernetes. Experience in operating and implementing distributed and highly concurrent service-based systems. Responsibilities: We aim to enhance our Enterprise Infrastructure by merging Operations Excellence with Development Experience to deliver services that are high scale, highly available, and resilient through automation and Infrastructure as Code. I am looking for a Site Reliability Engineer (SRE) who can apply best practices in Resiliency Engineering, Automation, Observability & Chaos Testing to ensure reliability within our ecosystem. You will play a vital role in helping teams scale by providing production insights, automating operations, offering developer guidance, and generating real-time metrics. You’ll be part of the Production Services team, a centralized support services organization within Fidelity’s Enterprise Infrastructure & Operations group, supporting over 3000 applications across various business units. Company: We appreciate candidates with a systems thinking approach who are passionate about collaborating with diverse teams and overcoming various challenges. I value individuals who can automate processes using various scripting languages (such as Python and Shell scripting) and manage systems through infrastructure as code tools (like IAM, ARM, Terraform, and Chef). Candidates should possess a solid understanding of Cloud Computing and DevOps concepts, including CI/CD pipelines, along with hands-on experience with one or more observability tools (like Prometheus, Grafana, ELK/OpenSearch, OpenTelemetry, and Datadog). I am seeking someone with proven experience in maintaining the scalability and resiliency of complex environments and implementing advanced observability practices at scale. Strong communication skills are essential for effectively reaching both technical and non-technical audiences. Fidelity promotes a hybrid working model, combining the benefits of onsite and offsite experiences to enhance team collaboration and business strategy. As a proud part of Fidelity Investments, we embrace a culture of diversity, equity, and inclusion, where we respect and value the unique perspectives of all our associates. If you’re looking to advance your career in a dynamic and inclusive environment, we encourage you to consider joining us. Fidelity is an equal opportunity employer and welcomes applicants from diverse backgrounds.

#J-18808-Ljbffr