General Motors of Canada
Job Description
Hybrid: This role is categorized as hybrid. This means the successful candidate is expected to report to Austin, TX, Warren, MI, or Mountain View, CA three times per week, at minimum.
The Software Engineering Site Reliability Engineer (SRE) is responsible for ensuring the reliability, scalability, and performance of software systems. Their job profile includes:
- Monitoring the performance and availability of software systems, identifying and resolving issues, and implementing proactive measures to prevent future incidents.
- Developing and maintaining automation tools and infrastructure to streamline software deployment, configuration management, and system monitoring.
- Analyzing system performance, identifying bottlenecks, and implementing optimizations to improve the efficiency and scalability of software systems.
- Responding to incidents, conducting root cause analysis, and implementing corrective actions to prevent similar incidents in the future.
- Collaborating with software development teams to ensure that reliability and scalability considerations are incorporated into the software design and implementation.
- Identifying opportunities for process improvement, implementing best practices, and driving initiatives to enhance the reliability and performance of software systems.
Your Skills & Qualifications (Required Qualifications):
- 8+ years of relevant professional experience, with a strong foundation in computer science.
- Bachelor's degree in Computer Science or a related field, or equivalent work experience.
- Proficiency in at least one programming language (e.g., Python, Go, Java) and familiarity with multiple language ecosystems.
- Solid understanding of operating systems, networking, distributed systems, databases, and storage architectures.
- Deep understanding of how code runs on underlying hardware, including operating systems, algorithms, and data structures. Ability to optimize or troubleshoot code by understanding its execution and the impact on system resources.
- Proven experience in automating manual processes, building deployment pipelines, or managing configuration systems.
- Experience handling production incidents, including root cause analysis, mitigation, and working through complex system failures.
- Strong communication skills, with an ability to explain technical concepts to both engineering and business stakeholders.
- Commitment to collaborative problem-solving and shared ownership of services.
Preferred Qualifications:
- Experience with cloud platforms (AWS, GCP, Azure).
- Familiarity with container orchestration systems like Kubernetes.
- A track record of managing or developing distributed systems.
- Prior experience with Java in production.
This job may be eligible for relocation benefits.
- Compensation: The expected base compensation for this role is $195,000 - $298,800. Actual base compensation within this range will vary based on relevant factors.
- Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.
Benefits include health and wellbeing programs, medical, dental, vision, HSA, FSA, retirement plans, paid time off, tuition assistance, GM vehicle discounts, and more.
#J-18808-Ljbffr