Logo
Tech Cratic

Lead Site Reliability Engineer

Tech Cratic, Marlborough, Massachusetts, us, 01752

Save Job

Technology has revolutionized how we approach job hunting, and this book streamlines the process into a fast, efficient system that works. Instead of relying on outdated advice,

The 2-Hour Job Search

focuses on strategies that deliver real results in less time. #1 Best Seller:

The 2-Hour Job Search , with

568 ratings

and an impressive

4.6 out of 5 stars . Don’t miss out on this game-changing guide.

Order your copy now for expert job search tips! Job title:

Lead Site Reliability Engineer Company:

BJ’s Wholesale Club Job description:

Join our team of more than 34,000 team members, supporting our members and communities in our Club Support Center, 235+ clubs and eight distribution centers. BJ’s Wholesale Club offers a collaborative and inclusive environment where all team members can learn, grow and be their authentic selves. Together, we’re committed to providing outstanding service and convenience to our members, helping them save on the products and services they need for their families and homes. The Benefits of working at BJ’s:

BJ’s pays weekly Eligible for free BJ’s Inner Circle and Supplemental membership(s)* Generous time off programs to support busy lifestyles* (Vacation, Personal, Holiday, Sick, Bereavement Leave, Jury Duty) Benefit plans for your changing needs* (Three medical plans**, Health Savings Account (HSA), two dental plans, vision plan, flexible spending) 401(k) plan with company match (must be at least 18 years old) *Eligibility requirements vary by position. **Medical plans vary by location. Responsibilities:

Design and manage Java-based microservices, bash scripts, Redis, High-Availability design, adhering to SRE principles. Maintain system integrity and meet service level objectives (SLOs) and indicators (SLIs) in high-pressure environments. Identify and resolve issues proactively using observability tools like New Relic, Scalyr/Splunk, bash scripts, and Python scripts. Lead initiatives to improve systems and implement SRE best practices. Conduct root-cause analyses for incidents and generate RCA reports. Apply software engineering principles to operational challenges and system performance. Ensure infrastructure availability, performance, security, and efficiency following SRE guidelines. Design and maintain production monitoring systems for timely issue detection. Troubleshoot performance issues using various tools and methodologies. Enhance application and environment security measures. Support multi-cloud environments with SRE strategies. Automate tasks to improve operational workflows. Follow change management and version control best practices. Promote a SRE mindset organization-wide. Qualifications:

Bachelor’s Degree in Computer Science or related field, or foreign equivalent. Curiosity and self-drive to solve complex challenges and drive change. Excellent communication skills for interaction with management, developers, and leadership. Ability to adapt and learn new technologies quickly. Minimum of 5 years of experience in SRE or related roles. Job Conditions:

Work within a diverse, global team environment. Participate in cross-training across regions. Rotate in an on-call schedule to support 24/7 availability. The estimated starting salary for this position is $109,000.00 per year. Actual salaries depend on various factors including location, education, experience, and qualifications.

#J-18808-Ljbffr