Athena
Role Overview
The Infrastructure Staff Engineer will spearhead Athena’s infrastructure initiatives, focusing on creating a robust, scalable, and efficient environment that supports the organization's computing needs. This role demands a high level of expertise in cloud platforms along with a deep understanding of distributed systems, site reliability engineering (SRE), observability, and DevOps practices.
Responsibilities
- Infrastructure Strategy: Develop and implement an infrastructure strategy that leverages GCP’s offerings to meet the current and future demands of Athena's product and service offerings.
- Cloud Management: Lead efforts in cloud resource management, ensuring cost-effective and efficient utilization of GCP services.
- Distributed Systems: Design and optimize distributed systems for high availability, scalability, and resilience.
- SRE and DevOps Integration: Incorporate SRE and DevOps methodologies into the infrastructure management process, focusing on automation, continuous integration, and continuous deployment (CI/CD).
- Observability: Establish comprehensive monitoring and observability frameworks to proactively detect and resolve infrastructure issues.
- Tooling: Identify and implement development tools that enhance productivity and streamline workflows.
- Cross-Functional Collaboration: Work closely with software engineering teams to ensure infrastructure supports the seamless deployment and operation of applications.
- Mentorship and Leadership: Mentor and provide technical guidance to infrastructure team members, promoting skill development and knowledge sharing.
Qualifications
- Experience: 10+ years of experience in infrastructure engineering, with a strong emphasis on GCP and distributed systems.
- Cloud Expertise: Proficiency with GCP services, tools, and best practices for cloud infrastructure management.
- SRE and DevOps Practices: Extensive experience with SRE and DevOps principles, including infrastructure as code (IaC), CI/CD pipelines, automation, and configuration management.
- Observability: In-depth knowledge of logging, monitoring, alerting, and telemetry solutions.
- Technical Acumen: Hands-on experience with containerization and orchestration technologies such as Docker and Kubernetes.
- Problem-Solving: Excellent problem-solving skills, with the ability to devise and implement efficient solutions to complex infrastructure challenges.
- Communication: Strong communication skills, capable of effectively collaborating with various stakeholders and conveying technical concepts to non-technical audiences.
- Leadership Skills: Proven leadership abilities, with experience guiding and growing a team of infrastructure engineers.
- Education: A bachelor's or master's degree in Computer Science, Engineering, or a relevant field. Relevant industry certifications in GCP or cloud architecture are highly desirable.
Seniority level
- Mid-Senior level
Employment type
- Full-time
Job function
- Information Technology
Industries
- Business Consulting and Services